[FFmpeg-devel] [PATCH] Moves yuv2yuvX_sse3 to yasm, unrolls main loop and other small optimizations for ~20% speedup.
Alan Kelly
alankelly at google.com
Tue Oct 27 15:06:49 EET 2020
Thanks for the feedback Anton.
The second patch incorporates changes suggested by James Almer:
avx2 instructions are wrapped in if cpuflag(avx2) and movddup restored
mm1 is replaced by m1 on x86_32
On Tue, Oct 27, 2020 at 10:40 AM Anton Khirnov <anton at khirnov.net> wrote:
> Hi,
> Quoting Alan Kelly (2020-10-27 10:10:14)
> > ---
> > libswscale/x86/Makefile | 1 +
> > libswscale/x86/swscale.c | 75 ++++---------------------
> > libswscale/x86/yuv2yuvX.asm | 109 ++++++++++++++++++++++++++++++++++++
> > 3 files changed, 120 insertions(+), 65 deletions(-)
> > create mode 100644 libswscale/x86/yuv2yuvX.asm
> >
>
> No comments on the code itself (yet?), but as for your submission:
> - when you send multiple iterations of the same patch, it is helpful to
> mention what changed, e.g. with git send-email --annotate
> - the commit message should follow the standard format of:
> * swscale: short summary of the change
>
> Extended description of the commit, if needed.
>
> --
> Anton Khirnov
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel at ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request at ffmpeg.org with subject "unsubscribe".
More information about the ffmpeg-devel
mailing list