From: Felix <fx...@gm...> - 2005-01-17 23:05:13
|
Hi, the attached patch optimizes the spantmp2.h template by avoiding redundant address computations for every pixel in linear spans. Since the assembler read functions make the same assumptions that should not affect existing users of the template. Unfortunately there is hardly any measurable speedup on the Savage. But I suspect that's a hardware limitation with CPU access to tiled surfaces (linear mapping of tiled frame buffer). If the effect of my patch is more significant on other hardware I'll commit this patch to CVS. A good test should be the glxgears framerate with no_rast=3Dtrue. (I only tested it with 16bpp so far, before committing anything I'd make sure that 32bpp works as well.) Regards, Felix --=20 | Felix K=FChling <fx...@gm...> http://fxk.de.vu | | PGP Fingerprint: 6A3C 9566 5B30 DDED 73C3 B152 151C 5CC1 D888 E595 | |