- Use MMX for motion compensation. Although this only
uses MMX registers, it requires SSE2, so you need to
#define HAVE_SSE2 to enable this.
- Some other tweaks to mot_comp , mostly trying to
improve cache locality.
- Included clamp to (0,1020) bounds in motion
compensation output, so only I frames need to be
clipped in frame_decompress.cpp.
This includes the contents of my previous patch (making
array rows contiguous in memory).
Profiled on an Athlon64 3200, Windows2000, vs.net compiler
unopt 49126 samples
opt 33809 samples
CompensateBlock + children:
CompensateComponent + ClipComponent
1 sample ~= 0.1ms,
Log in to post a comment.