[FFmpeg-devel] avcodec/utvideodec : add x86 SIMD (SSSE3) for gradient prediction
Martin Vignali
martin.vignali at gmail.com
Sun Nov 26 23:15:54 EET 2017
Hello,
Patch in attach add SIMD (SSSE 3) for gradient prediction
and a checkasm test
Checkasm result (width = 1024) (kaby lake, macos 10.12)
add_gradient_pred_c: 1708.8
add_gradient_pred_ssse3: 533.0
Benchmark on a 3 min HD File in gradient (422)
without SIMD :
bench: utime=102.695s
bench: maxrss=102592512kB
with SIMD
bench: utime=91.712s
bench: maxrss=102543360kB
i will add AVX2 version later (need more clean before submitting, and will
conflict with another patch add_left_pred avx2 version)
This new dsp func, can probably also be use by magicyuv decoder
i'm not sure about the best asm way for load an uint8_t to all part of an
xmm,
Comment welcome
Martin
Jokyo Images
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-avcodec-utvideodec-add-SIMD-SSSE3-for-gradient_pred.patch
Type: application/octet-stream
Size: 6180 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20171126/237dc19f/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0002-checkasm-llviddsp-add-test-for-add_gradient_pred.patch
Type: application/octet-stream
Size: 1892 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20171126/237dc19f/attachment-0001.obj>
More information about the ffmpeg-devel
mailing list