[FFmpeg-devel] [PATCH] lavc/aarch64: Add pix_abs16_x2 neon implementation

Martin Storsjö martin at martin.st
Wed Jul 13 23:29:39 EEST 2022


On Tue, 12 Jul 2022, Hubert Mazur wrote:

> Provide neon implementation for pix_abs16_x2 function.
>
> Performance tests of implementation are below.
> - pix_abs_0_1_c: 283.5
> - pix_abs_0_1_neon: 39.0
>
> Benchmarks and tests run with checkasm tool on AWS Graviton 3.
>
> Signed-off-by: Hubert Mazur <hum at semihalf.com>
> ---
> libavcodec/aarch64/me_cmp_init_aarch64.c |  3 +
> libavcodec/aarch64/me_cmp_neon.S         | 75 ++++++++++++++++++++++++
> 2 files changed, 78 insertions(+)

Thanks, I think this looks good enough to me, thus pushed.

// Martin



More information about the ffmpeg-devel mailing list