[FFmpeg-devel] [PATCH 5/7] lavc/me_cmp: R-V V vsse vsad

flow gg hlefthleft at gmail.com
Thu Feb 22 05:04:58 EET 2024


.macro vabsaddu dst src tmp
        - vneg.v          \tmp, \src
        - vmax.vv         \tmp, \src, \tmp
        + vfabs.v             \tmp, \src
        vwaddu.wv       \dst, \dst, \tmp
.endm

After making this change, the tests did not pass. I'm not quite clear on
how to understand the differences..

checkasm: 4 of 21 tests have failed
benchmarking with native FFmpeg timers
nop: 7.0
func: vsad_0, x=46 y=13 h=4, error: asm=720974 c=6162
func: vsad_1, x=16 y=14 h=10, error: asm=1146753 c=9353
func: vsad_4, x=13 y=32 h=12, error: asm=2654565 c=14573
func: vsad_5, x=32 y=1 h=8, error: asm=917745 c=3865

Rémi Denis-Courmont <remi at remlab.net> 于2024年2月22日周四 02:07写道:

> Le tiistaina 6. helmikuuta 2024, 17.56.32 EET flow gg a écrit :
> >
>
> Did you try to compute integral absolute values with the ad-hoc (floating
> point) instruction instead of vneg/vmax? It should work since the sign is
> in
> the same place, though I don't know if it will be faster.
>
> --
> レミ・デニ-クールモン
> http://www.remlab.net/
>
>
>
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel at ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request at ffmpeg.org with subject "unsubscribe".
>


More information about the ffmpeg-devel mailing list