[FFmpeg-devel] [PATCH] af_afir: RISC-V V fcmul_add
Rémi Denis-Courmont
remi at remlab.net
Tue Sep 26 21:34:58 EEST 2023
Le tiistaina 26. syyskuuta 2023, 12.24.58 EEST flow gg a écrit :
> benchmark:
> fcmul_add_c: 19.7
> fcmul_add_rvv_f32: 6.7
Nit: please pad mnemonics to at least 8 columns for consistency.
I'm a bit surprised that the performance improves this much, considering that
the C910 is notoriously bad at both segmented strided loads. It might be that
the C versions is just very bad due to lack of aliasing optimisations. Oh
well.
Note that you could do the double versions with very little extra efforts.
--
雷米‧德尼-库尔蒙
http://www.remlab.net/
More information about the ffmpeg-devel
mailing list