[FFmpeg-devel] [PATCH 3/3] aarch64/vvc: Add put_qpel_hv

Martin Storsjö martin at martin.st
Wed Sep 11 15:28:01 EEST 2024


On Wed, 11 Sep 2024, Zhao Zhili wrote:

> From: Zhao Zhili <zhilizhao at tencent.com>
>
> With Apple M1 (no i8mm):
>
> put_luma_hv_8_4x4_c:                                     2.2 ( 1.00x)
> put_luma_hv_8_4x4_neon:                                  0.8 ( 3.00x)
> put_luma_hv_8_8x8_c:                                     7.0 ( 1.00x)
> put_luma_hv_8_8x8_neon:                                  0.8 ( 9.33x)
> put_luma_hv_8_16x16_c:                                  22.8 ( 1.00x)
> put_luma_hv_8_16x16_neon:                                2.5 ( 9.10x)
> put_luma_hv_8_32x32_c:                                  84.8 ( 1.00x)
> put_luma_hv_8_32x32_neon:                                9.5 ( 8.92x)
> put_luma_hv_8_64x64_c:                                 333.0 ( 1.00x)
> put_luma_hv_8_64x64_neon:                               35.5 ( 9.38x)
> put_luma_hv_8_128x128_c:                              1294.5 ( 1.00x)
> put_luma_hv_8_128x128_neon:                            137.8 ( 9.40x)
>
> With Pixel 8 Pro:
>
> put_luma_hv_8_4x4_c:                                     5.0 ( 1.00x)
> put_luma_hv_8_4x4_neon:                                  0.8 ( 6.67x)
> put_luma_hv_8_4x4_i8mm:                                  0.2 (20.00x)
> put_luma_hv_8_8x8_c:                                    13.2 ( 1.00x)
> put_luma_hv_8_8x8_neon:                                  1.2 (10.60x)
> put_luma_hv_8_8x8_i8mm:                                  1.2 (10.60x)
> put_luma_hv_8_16x16_c:                                  44.2 ( 1.00x)
> put_luma_hv_8_16x16_neon:                                4.5 ( 9.83x)
> put_luma_hv_8_16x16_i8mm:                                4.2 (10.41x)
> put_luma_hv_8_32x32_c:                                 160.8 ( 1.00x)
> put_luma_hv_8_32x32_neon:                               17.5 ( 9.19x)
> put_luma_hv_8_32x32_i8mm:                               16.0 (10.05x)
> put_luma_hv_8_64x64_c:                                 611.2 ( 1.00x)
> put_luma_hv_8_64x64_neon:                               68.0 ( 8.99x)
> put_luma_hv_8_64x64_i8mm:                               62.2 ( 9.82x)
> put_luma_hv_8_128x128_c:                              2384.8 ( 1.00x)
> put_luma_hv_8_128x128_neon:                            268.8 ( 8.87x)
> put_luma_hv_8_128x128_i8mm:                            245.8 ( 9.70x)
> ---
> libavcodec/aarch64/h26x/dsp.h       |   8 ++
> libavcodec/aarch64/h26x/qpel_neon.S | 140 ++++++++++++++++++++++++++++
> libavcodec/aarch64/vvc/dsp_init.c   |  14 +++
> 3 files changed, 162 insertions(+)

Ok

// Martin



More information about the ffmpeg-devel mailing list