Bug fixing in regards to interleaved convolution.
testcnnalgo now compiles again.
Making convolutions a bit faster.
Making CAI more portable.
Preparing code for future (faster) dot product engines.
Preparing more potent dot product engines.
Small fix on OpenCL dot product test.
working on OpenCL implementation
making the code more portable
adding volume tiling test