[FFmpeg-devel] [WIP] [PATCH 0/5] sse2/avx functions for 8-bit simple_idct
James Darnley
jdarnley at obe.tv
Sat Jun 10 14:46:39 EEST 2017
So here is my second work in progress patch set. These are the functions from
the second half of my previous.
With Ronald's rounding contribution I am much closer to finishing. As I
mentioned on IRC I think the speed of decoding an MPEG2 HD sample has increased
from 215 to 235 fps.
I still have a small optimisation to make and I need to use the correct
coefficients. This will require a large change to the macros. I am sending
this so that people can nitpick my changes.
James Darnley (5):
avcodec/x86: cleanup simple_idct10
avcodec/x86: add x86-64 8-bit simple_idct function
more cleanup
avcodec/x86: add x86-64 8-bit simple_idct put function
avcodec/x86: add x86-64 8-bit simple_idct add function
libavcodec/tests/x86/dct.c | 2 +
libavcodec/x86/idctdsp_init.c | 23 +++++++
libavcodec/x86/proresdsp.asm | 2 +-
libavcodec/x86/simple_idct.h | 9 +++
libavcodec/x86/simple_idct10.asm | 100 ++++++++++++++++++++++++++++--
libavcodec/x86/simple_idct10_template.asm | 80 +++++++++++-------------
6 files changed, 167 insertions(+), 49 deletions(-)
--
2.13.0
More information about the ffmpeg-devel
mailing list