move files
remove some warnings
remove prefetch instructions, since they appear to only eat cycles
more explicit instanciation
fighting with icc weirdness
dosn't work with icc
make it compile with icc
add some instanciantions
add more instanciations because icc wants it
adapt code to icc