[Ffmpeg-devel] [RFC] smallcpy for h264
Luca Barbato
lu_zero
Sat Oct 7 14:18:52 CEST 2006
Michael Niedermayer wrote:
> but before i will agree to this i want
> 1. to know why we spend a significant time doing small memcpys
Loren do you have time to have a look on it? The on x86simd codepath has
many of them...
> 2. why ppc doesnt inline memcpy like x86 does
inlined memcpy are triggered with -O3 iirc, so having them doesn't help
speed at all (see the threads about avoiding -O3 to get better speed)
I'll dig glibc to see if we have inlined variants available.
>
> furthermore these aligment related changes must be split,reviewed
> and applied before any benchmarking makes sense (= your benchmark
> of missaliged arrays with memcpy vs. your code with aligned arrays
> might show more the speed difference of alignment and less that
> of the actual code)
please check the attached code.
lu
--
Luca Barbato
Gentoo/linux Gentoo/PPC
http://dev.gentoo.org/~lu_zero
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ffmpeg_h264_align.patch
Type: text/x-patch
Size: 3158 bytes
Desc: not available
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20061007/2d565355/attachment.bin>
More information about the ffmpeg-devel
mailing list