[FFmpeg-devel] lavc/aarch64: add simple idct neon functions
Matthieu Bouron
matthieu.bouron at gmail.com
Mon Mar 6 16:48:57 EET 2017
On Thu, Feb 23, 2017 at 04:59:16PM +0100, Matthieu Bouron wrote:
> Hello,
>
> The following patchset add the ff_simple_idct function neon functions for the
> aarch64 platform. It's ported from armv7 simple_idct_neon with some improvements:
> * the source idct blocks are now loaded once and kept in v24-v31
> * the source idct blocks are no longer overriden in idct_col4_top
> * the destination is now written in one pass at the end of
> ff_simple_idct{,_put,_add}_neon
>
> It is bitexact with the armv7 neon implementation.
>
> Here are some results (reported by {START,STOP}_TIMER) on an Odroid-C2 (Cortex
> A53):
>
> Functions IDCT: simple IDCT: simpleneon
> ff_simple_idct_put 9795 units 3170 units
> ff_simple_idct_add 10227 units 3302 units
>
Ping.
More information about the ffmpeg-devel
mailing list