[FFmpeg-devel] [PATCH] NEON add_signed_pixels_clamped
Måns Rullgård
mans
Sat Apr 4 15:27:19 CEST 2009
David Conrad <lessen42 at gmail.com> writes:
> Hi,
>
> 3% overall wmv3 decoding speedup.
>
> Also, is it possible to have something like
>
> .macro reg=2
> d\(\reg*2)
>
> evaluate to d4? Or any other ideas to put the repeated sections in a
> macro that isn't ugly?
I don't think there is a simple way, but read on.
The attached version is 1 cycle faster on Cortex-A8, 5 cycles faster
on A9. Unfortunately it's somewhat more difficult to read than the
original. That's the price you pay for speed.
--
M?ns Rullg?rd
mans at mansr.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: add_pixels_clamped_neon.patch
Type: text/x-patch
Size: 3313 bytes
Desc: not available
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20090404/c2a3e863/attachment.bin>
More information about the ffmpeg-devel
mailing list