[FFmpeg-devel] [PATCH 0/5] Provide neon implementation for me_cmp functions
Hubert Mazur
hum at semihalf.com
Thu Aug 18 12:24:55 EEST 2022
Thanks for the review and pointing out the issues. I will check out the
other patches for such things and fix them if needed.
Regards
On Thu, Aug 18, 2022 at 11:08 AM Martin Storsjö <martin at martin.st> wrote:
> On Tue, 16 Aug 2022, Hubert Mazur wrote:
>
> > Add arm64 neon implementation for functions from motion estimation
> > family. All of them were tested and benchmarked using checkasm tool.
> > The rare code paths, e.g. when filter_size % 4 != 0 were also tested.
>
>
> > Instructions were manualy deinterleaved to reach best performance.
>
> You probably mean "interleaved", as deinterleaved would be how it was
> initially, which is detrimental for performance.
>
> Overall I think this patchset is close enough now. There were a bunch of
> minor details left on the patches, but I'll fix that up locally and push
> them, instead of doing yet another round of these. I'll comment and point
> out the details I changed - please pay attention to them for future
> patches though!
>
> // Martin
>
>
More information about the ffmpeg-devel
mailing list