[FFmpeg-devel] [PATCH v2] aarch64/opusdsp: implement NEON accelerated postfilter and deemphasis
Lynne
dev at lynne.ee
Sat Apr 6 17:08:15 EEST 2019
Apr 6, 2019, 2:42 PM by ceffmpeg at gmail.com:
> 2019-04-02 13:47 GMT+02:00, Lynne <> dev at lynne.ee <mailto:dev at lynne.ee>> >:
>
>> 153372 UNITS in postfilter_c, 65536 runs, 0 skips
>> 73164 UNITS in postfilter_neon, 65536 runs, 0 skips -> 2.1x speedup
>>
>> 80591 UNITS in deemphasis_c, 131072 runs, 0 skips
>> 43969 UNITS in deemphasis_neon, 131072 runs, 0 skips -> 1.83x speedup
>>
>> Total decoder speedup: ~15% on a Raspberry Pi 3 (from
>> 28.1x to 33.5x realtime)
>>
>
> How can I reproduce this?
> On something called "X-Gene" Linux, I see a 2,5% overall
> speedup (126x to 129x)
>
> The issue is that there is some probability that your patch will
> break mac and Windows compilation, so I would at least
> like to understand why I can not reproduce your speedup.
>
I tested this on real hardware.
How can this break mac support at all when it doesn't support aarch64? And I can't find a windows aarch64 compiler either.
More information about the ffmpeg-devel
mailing list