[Ffmpeg-devel] [PATCH] lowres chroma bug
Oleg Metelitsa
oleg
Thu Feb 8 09:09:46 CET 2007
Hello Trent,
>> Of course that would only be done for avg_h264_chroma_mc2_mmx2, not for
>> avg_h264_chroma_mc{4,8}_mmx2. Maybe this is faster than the using the
>> 16-bit move? The same can be done for the put version too:
>>
>> @@ -1376,1 +1376,2 @@
>> -#define H264_CHROMA_OP4(S,D,T)
>> +#define H264_CHROMA_OP4(S,D,T) "movd 2+" #S ", " #T "\n\t"\
>> + "punpcklwd " #T ", " #D "\n\t"
Why do not use one SSE integer instruction instead of two MMX
instructions?
So we will have:
#define H264_CHROMA_OP2(S,D,T) "pinsrw $1, 2+" #S ", " #D " \n\t"
instead of
>> +#define H264_CHROMA_OP2(S,D,T) "movd 2+" #S ", " #T "\n\t"\
>> + "punpcklwd " #T ", " #D "\n\t"
Oleg
More information about the ffmpeg-devel
mailing list