Providing stretch and blit routines for SDL surfaces. These are optimized for speed including lots of assembler parts in the general routines and dynamic cpu native code generation for applications to compile specialized stretch-and-blit routines at runti