Adding time collection per layer.
coding dense networks
speeding up volume copy functions
speeding up AVGPool
Speeding up maxpooling.
Speeding up Maxpooling.
Collecting timings per layer.
Updating OpenCL experiment.