[FFmpeg-devel] [PATCH 2/2] h264: integrate clear_blocks calls with IDCT.
Michael Niedermayer
michaelni at gmx.at
Tue Feb 19 16:41:13 CET 2013
On Mon, Feb 18, 2013 at 09:03:02PM -0800, Ronald S. Bultje wrote:
> From: "Ronald S. Bultje" <rsbultje at gmail.com>
>
> The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700
> to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb
> (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip
> tested (cathedral), i.e. almost 30 cycles per mb faster.
> ---
> libavcodec/arm/h264idct_neon.S | 29 +++++++---
> libavcodec/h264.c | 14 ++---
> libavcodec/h264_mb_template.c | 11 +---
> libavcodec/h264addpx_template.c | 4 ++
> libavcodec/h264dsp.c | 4 +-
> libavcodec/h264dsp.h | 4 +-
> libavcodec/h264idct_template.c | 16 ++++--
> libavcodec/h264pred.h | 8 +--
> libavcodec/h264pred_template.c | 28 ++++++----
> libavcodec/ppc/h264_altivec.c | 3 ++
> libavcodec/svq3.c | 4 +-
> libavcodec/x86/h264_idct.asm | 108 ++++++++++++++++++++++++++++---------
> libavcodec/x86/h264_idct_10bit.asm | 53 ++++++++++++++++--
> 13 files changed, 209 insertions(+), 77 deletions(-)
applied
thanks
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
It is dangerous to be right in matters on which the established authorities
are wrong. -- Voltaire
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20130219/8238fb53/attachment.asc>
More information about the ffmpeg-devel
mailing list