[FFmpeg-devel] [PATCH] avcodec/x86/hevc: fix luma 12b overflow

Sun Feb 25 18:41:39 EET 2024

Hi,

On Sun, Feb 25, 2024 at 11:28 AM James Almer <jamrial at gmail.com> wrote:

> On 2/25/2024 1:22 PM, Ronald S. Bultje wrote:
> > On Sun, Feb 25, 2024 at 10:56 AM Ronald S. Bultje <rsbultje at gmail.com>
> > wrote:
> >
> >> Hi,
> >>
> >> On Sun, Feb 25, 2024 at 3:28 AM J. Dekker <jdek at itanimul.li> wrote:
> >>
> >>> Weak filter can overflow in delta0 calculation before >> 4 in int16.
> >>>
> >>> Signed-off-by: J. Dekker <jdek at itanimul.li>
> >>> ---
> >>>
> >>>   I do not know x86 simd at all, so this is just an attempt to fix
> >>>   the implementation rather than write extremely performant code.
> >>>
> >>>   Suggestions welcome.
> >>>
> >>
> >> https://pastebin.com/KvcbQ2nK
> >>
> >
> > Attached a slightly adjusted version which does sse2 in 16bit also.
> >
> > Ronald
>
> > diff --git a/libavcodec/x86/hevc_deblock.asm
> b/libavcodec/x86/hevc_deblock.asm
> > index 85ee4800bb..869301caff 100644
> > --- a/libavcodec/x86/hevc_deblock.asm
> > +++ b/libavcodec/x86/hevc_deblock.asm
> > @@ -31,6 +31,7 @@ cextern pw_1023
> >  pw_pixel_max_12: times 8 dw ((1 << 12)-1)
> >  pw_m2:           times 8 dw -2
> >  pd_1 :           times 4 dd  1
> > +pd_8 :           times 8 dd  8
>
> This is unused.
>

Fixed.

Ronald
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-hevc-x86-deblock-fix-12bit-overflow.patch
Type: application/octet-stream
Size: 2294 bytes
Desc: not available
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20240225/ef516978/attachment.obj>