[FFmpeg-devel] [PATCH 0/2] x86: hevc_mc: port to SSSE3
James Almer
jamrial at gmail.com
Sat Aug 23 16:48:12 CEST 2014
On 23/08/14 11:07 AM, Mickaël Raulet wrote:
> For 10bits and 12bits, they should stay sse4 as well because of packusdw. You need some instructions to convert it to ssse3 see below
>
>
> static av_always_inline __m128i _MM_PACKUS_EPI32( __m128i a, __m128i b )
> {
> a = _mm_slli_epi32 (a, 16);
> a = _mm_srai_epi32 (a, 16);
> b = _mm_slli_epi32 (b, 16);
> b = _mm_srai_epi32 (b, 16);
> a = _mm_packs_epi32 (a, b);
> return a;
> }
There's a PACK macro in lavfi/x86/yasm-16.asm that does this without intrinsics.
More information about the ffmpeg-devel
mailing list