On Sat, Mar 12, 2011 at 1:24 PM, Ian Scott <firstname.lastname@example.org>
It previous tests, core vnl BLAS-like routines i.e. operator*(vnl_vector, vnl_matrix) were at least as fast as tuned BLAS libraries, since BLAS has to handle things like skip intervals, etc.
On the other hand the LAPACK libraries provided in $VXLSRC/v3p just use standard BLAS libraries in $VXLSRC/v3p that haven't been tuned at all. This is the cause of the situation you found. It would be really useful if someone wanted to sort out the v3p CMake config so that you could use a system-provided (and tuned) BLAS library.
If someone does this, please let me know and I'll try the timing tests again and report the results.