terrible serial SYRK performance
Brought to you by:
rwhaley,
tonyc040457
The 3.11.38 (and prior releases, not sure how far back) all have a stupid error that will roughly halve serial SYRK/HERK performance (may have some affect on parallel perf, not sure).
This is fixed in 3.11.39, but in the meantime, you can fix this error by adding the line
to ATLAS/include/atlas_kern3.h