[FFmpeg-devel] [PATCH] VC-1: Faster C loopfilter
David Conrad
lessen42
Sun Jan 18 06:48:41 CET 2009
On Jan 18, 2009, at 12:39 AM, Jason Garrett-Glaser wrote:
> On Sat, Jan 17, 2009 at 11:54 PM, Jason Garrett-Glaser
> <darkshikari at gmail.com> wrote:
>> Yuvi requested I optimize this one, so I did. This patch can be
>> summed up as follows:
>>
>> 13:20 <@mru> I see a lot of xor
>> 13:20 <@mru> that's usually a sign of magic
>>
>> 338 -> 315 cycles for vc1_loop_filter on Core 2 Duo.
>> Overall decoding 12.047s -> 11.781s
>>
>> Dark Shikari
>>
>
> Also, it seems forcing inlining of vc1_filter_line improves
> performance greatly (315 -> 275 cycles from the above); is it fine to
> av_always_inline it?
I am working on mmx/sse versions you know :P
Anyways you should most of the inlining benefits with separate h/v and
size=4/8/16 functions (h/v much more so than 4/8/16), e.g. with the
attached (I'll split/update it once your optimizations hit svn)
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: vc1-dsp_lf.txt
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20090118/f1469610/attachment.txt>
-------------- next part --------------
More information about the ffmpeg-devel
mailing list