[FFmpeg-devel] [PATCH] Avoid integer to float point domain crossing penalties
Adrian Tong
adriantong1024 at gmail.com
Thu Jun 27 09:01:07 EEST 2019
Anyone interested in reviewing this patch ?
Thanks
-Adrian
On Mon, 24 Jun 2019 at 13:57, <adriantong1024 at gmail.com> wrote:
> From: Adrian Tong <adriantong1024 at gmail.com>
>
> On internal benchmark, I see a noisy-level difference (more likely to be
> an improvement) in ff_h264_decode_mb_cabac which calls this function.
> ---
> libavutil/x86/intreadwrite.h | 4 ++--
> 1 file changed, 2 insertions(+), 2 deletions(-)
>
> diff --git a/libavutil/x86/intreadwrite.h b/libavutil/x86/intreadwrite.h
> index 4061d19231..df0bf45ae1 100644
> --- a/libavutil/x86/intreadwrite.h
> +++ b/libavutil/x86/intreadwrite.h
> @@ -68,8 +68,8 @@ static av_always_inline void AV_COPY128(void *d, const
> void *s)
> {
> struct v {uint64_t v[2];};
>
> - __asm__("movaps %1, %%xmm0 \n\t"
> - "movaps %%xmm0, %0 \n\t"
> + __asm__("movdqa %1, %%xmm0 \n\t"
> + "movdqa %%xmm0, %0 \n\t"
> : "=m"(*(struct v*)d)
> : "m" (*(const struct v*)s)
> : "xmm0");
> --
> 2.20.1 (Apple Git-117)
>
>
More information about the ffmpeg-devel
mailing list