[FFmpeg-devel] [PATCH 0/5] RISC-V: Improve H264 decoding performance using RVV intrinsic
Arnie Chang
arnie.chang at sifive.com
Tue May 9 12:50:25 EEST 2023
We are submitting a set of patches that significantly improve H.264 decoding performance
by utilizing RVV intrinsic code. The average speedup(FPS) achieved by these patches is more than 2x,
as experimented on 720P videos running on an internal FPGA board.
Patch1: add support for RVV intrinsic code in the configure file
Patch2: optimize chroma motion compensation
Patch3: optimize luma motion compensation
Patch4: optimize dsp functions, such as IDCT, in-loop filtering, and weighed filtering
Patch5: optimize intra prediction
Arnie Chang (5):
configure: Add detection of RISC-V vector intrinsic support
lavc/h264chroma: Add vectorized implementation of chroma MC for RISC-V
lavc/h264qpel: Add vectorized implementation of luma MC for RISC-V
lavc/h264dsp: Add vectorized implementation of DSP functions for
RISC-V
lavc/h264pred: Add vectorized implementation of intra prediction for
RISC-V
configure | 2 +
libavcodec/h264chroma.c | 2 +
libavcodec/h264chroma.h | 1 +
libavcodec/h264dsp.c | 2 +
libavcodec/h264dsp.h | 3 +-
libavcodec/h264pred.c | 2 +
libavcodec/h264pred.h | 3 +-
libavcodec/h264qpel.c | 2 +
libavcodec/h264qpel.h | 1 +
libavcodec/riscv/Makefile | 11 +
libavcodec/riscv/h264_chroma_init_riscv.c | 45 +
libavcodec/riscv/h264_dsp_init_riscv.c | 68 ++
libavcodec/riscv/h264_idct.c | 482 +++++++++
libavcodec/riscv/h264_idct.h | 46 +
libavcodec/riscv/h264_inloop.c | 669 ++++++++++++
libavcodec/riscv/h264_inloop.h | 47 +
libavcodec/riscv/h264_lowpass.h | 249 +++++
libavcodec/riscv/h264_mc_chroma.c | 821 ++++++++++++++
libavcodec/riscv/h264_mc_chroma.h | 40 +
libavcodec/riscv/h264_mc_luma.c | 412 +++++++
libavcodec/riscv/h264_mc_luma.h | 101 ++
libavcodec/riscv/h264_mc_luma_avg16.h | 1183 +++++++++++++++++++++
libavcodec/riscv/h264_mc_luma_avg8.h | 773 ++++++++++++++
libavcodec/riscv/h264_mc_luma_put16.h | 963 +++++++++++++++++
libavcodec/riscv/h264_mc_luma_put8.h | 648 +++++++++++
libavcodec/riscv/h264_pred.c | 884 +++++++++++++++
libavcodec/riscv/h264_pred.h | 53 +
libavcodec/riscv/h264_pred_init_riscv.c | 67 ++
libavcodec/riscv/h264_qpel_init_riscv.c | 107 ++
libavcodec/riscv/h264_utility.h | 75 ++
libavcodec/riscv/h264_weighted_sum.c | 273 +++++
libavcodec/riscv/h264_weighted_sum.h | 47 +
32 files changed, 8080 insertions(+), 2 deletions(-)
create mode 100644 libavcodec/riscv/h264_chroma_init_riscv.c
create mode 100644 libavcodec/riscv/h264_dsp_init_riscv.c
create mode 100644 libavcodec/riscv/h264_idct.c
create mode 100644 libavcodec/riscv/h264_idct.h
create mode 100644 libavcodec/riscv/h264_inloop.c
create mode 100644 libavcodec/riscv/h264_inloop.h
create mode 100644 libavcodec/riscv/h264_lowpass.h
create mode 100644 libavcodec/riscv/h264_mc_chroma.c
create mode 100644 libavcodec/riscv/h264_mc_chroma.h
create mode 100644 libavcodec/riscv/h264_mc_luma.c
create mode 100644 libavcodec/riscv/h264_mc_luma.h
create mode 100644 libavcodec/riscv/h264_mc_luma_avg16.h
create mode 100644 libavcodec/riscv/h264_mc_luma_avg8.h
create mode 100644 libavcodec/riscv/h264_mc_luma_put16.h
create mode 100644 libavcodec/riscv/h264_mc_luma_put8.h
create mode 100644 libavcodec/riscv/h264_pred.c
create mode 100644 libavcodec/riscv/h264_pred.h
create mode 100644 libavcodec/riscv/h264_pred_init_riscv.c
create mode 100644 libavcodec/riscv/h264_qpel_init_riscv.c
create mode 100644 libavcodec/riscv/h264_utility.h
create mode 100644 libavcodec/riscv/h264_weighted_sum.c
create mode 100644 libavcodec/riscv/h264_weighted_sum.h
--
2.17.1
More information about the ffmpeg-devel
mailing list