[FFmpeg-devel] [PATCH] avcodec/aarch64/me_cmp: add dotprod implementations of sse16 and vsse_intra16
Martin Storsjö
martin at martin.st
Fri Aug 16 14:06:17 EEST 2024
On Thu, 15 Aug 2024, Ramiro Polla wrote:
> checkasm --bench for Raspberry Pi 5 Model B Rev 1.0:
> sse_0_c: 241.5
> sse_0_neon: 37.2
> sse_0_dotprod: 22.2
> vsse_4_c: 148.7
> vsse_4_neon: 31.0
> vsse_4_dotprod: 15.7
> ---
> libavcodec/aarch64/me_cmp_init_aarch64.c | 14 +++
> libavcodec/aarch64/me_cmp_neon.S | 114 +++++++++++++++++++++++
> 2 files changed, 128 insertions(+)
LGTM, thanks!
// Martin
More information about the ffmpeg-devel
mailing list