[Ffmpeg-devel] a little optim for a SSE version of H263_LOOP_FILTER
Zuxy Meng
zuxy.meng
Mon Nov 6 14:58:18 CET 2006
Hi,
2006/11/5, skal <skal65535 at orange.fr>:
>
> Hi everybody,
>
> in case, it seems to me a SSE version of
> H263_LOOP_FILTER is possible by replacing
> "psubusb %%mm4, %%mm2 \n\t"\
> "movq %%mm2, %%mm3 \n\t"\
> "psubusb %%mm4, %%mm3 \n\t"\
> "psubb %%mm3, %%mm2 \n\t"\
> at dsputil_mmx.c:587 (fresh cvs), by:
> "psubusb %%mm4, %%mm2 \n\t"\
> "pminub %%mm4, %%mm2 \n\t"\
>
> +maybe a little re-org of the loop (mm3 is gone).
>
> Well, this is just for the fun of it, since the speed-up
> (if any) might not be worth a special version...
What about rewrite it with purely SSE2? I'll try when I have time...
--
Zuxy
Beauty is truth,
While truth is beauty.
PGP KeyID: E8555ED6
More information about the ffmpeg-devel
mailing list