[FFmpeg-devel] lavc/aarch64: add simple idct neon functions
Matthieu Bouron
matthieu.bouron at gmail.com
Thu Feb 23 17:59:16 EET 2017
Hello,
The following patchset add the ff_simple_idct function neon functions for the
aarch64 platform. It's ported from armv7 simple_idct_neon with some improvements:
* the source idct blocks are now loaded once and kept in v24-v31
* the source idct blocks are no longer overriden in idct_col4_top
* the destination is now written in one pass at the end of
ff_simple_idct{,_put,_add}_neon
It is bitexact with the armv7 neon implementation.
Here are some results (reported by {START,STOP}_TIMER) on an Odroid-C2 (Cortex
A53):
Functions IDCT: simple IDCT: simpleneon
ff_simple_idct_put 9795 units 3170 units
ff_simple_idct_add 10227 units 3302 units
Matthieu
More information about the ffmpeg-devel
mailing list