[FFmpeg-devel] [PATCH v2 0/7] arm64 neon implementation for 8bits functions

Grzegorz Bernacki gjb at semihalf.com
Mon Oct 3 17:10:13 EEST 2022


Changes since v1:

- changed tabs to spaces
- modified branch instruction in vsse8
- apply Martin's patches with improved instructions scheduling 

Grzegorz Bernacki (4):
  lavc/aarch64: Add neon implementation for pix_abs8 functions.
  lavc/aarch64: Provide neon implementation of nsse8
  lavc/aarch64: Provide optimized implementation of vsse8 for arm64.
  lavc/aarch64: Add neon implementation for vsse_intra8

Martin Storsjö (3):
  aarch64: me_cmp: Improve scheduling in ff_pix_abs8_y2_neon
  aarch64: me_cmp: Fix up the prologue of ff_pix_abs8_xy2_neon
  aarch64: me_cmp: Improve scheduling in vsse_intra8

 libavcodec/aarch64/me_cmp_init_aarch64.c |  33 ++
 libavcodec/aarch64/me_cmp_neon.S         | 414 +++++++++++++++++++++++
 2 files changed, 447 insertions(+)

-- 
2.37.1



More information about the ffmpeg-devel mailing list