[FFmpeg-devel] swscale/swscale_unscaled : add X86_64 (SSE2, AVX) for uyvyto422
Martin Vignali
martin.vignali at gmail.com
Tue Apr 3 00:26:41 EEST 2018
Hello,
In attach patch to add SSE and AVX SIMD (x86_64) for uyvyto422
Write using external ASM (keep a similar logic, than the MMX Inline version)
also add checkasm test
uyvy is use by some devices (like decklink, to provide raw data)
Around 20% faster (on a "benchmark cmd", who test pix_fmt conversion)
(4.2s with the patch, 5.2s without)
Pass fate test for me.
Checkasm result :
uyvytoyuv422_c: 14146.6
uyvytoyuv422_mmx: 13696.4
uyvytoyuv422_mmxext: 19395.9
uyvytoyuv422_sse2: 6284.1
uyvytoyuv422_avx: 5379.6
Martin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-swscale-swscale_unscaled-add-X86_64-SSE2-and-AVX-for.patch
Type: application/octet-stream
Size: 9896 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20180402/43e86b6a/attachment.obj>
More information about the ffmpeg-devel
mailing list