[FFmpeg-devel] [PATCH] avcodec/aarch64/me_cmp: add dotprod implementations of sse16 and vsse_intra16
Ramiro Polla
ramiro.polla at gmail.com
Sat Aug 17 16:33:32 EEST 2024
On Fri, Aug 16, 2024 at 1:16 PM Martin Storsjö <martin at martin.st> wrote:
> On Thu, 15 Aug 2024, Ramiro Polla wrote:
> > checkasm --bench for Raspberry Pi 5 Model B Rev 1.0:
> > sse_0_c: 241.5
> > sse_0_neon: 37.2
> > sse_0_dotprod: 22.2
> > vsse_4_c: 148.7
> > vsse_4_neon: 31.0
> > vsse_4_dotprod: 15.7
> > ---
> > libavcodec/aarch64/me_cmp_init_aarch64.c | 14 +++
> > libavcodec/aarch64/me_cmp_neon.S | 114 +++++++++++++++++++++++
> > 2 files changed, 128 insertions(+)
>
> LGTM, thanks!
Thanks. Pushed.
More information about the ffmpeg-devel
mailing list