[FFmpeg-devel] [PATCH v3] checkasm/h264dsp: Fix stack overflow in check_idct_dequant
Zhao Zhili
quinkblack at foxmail.com
Mon Jun 16 14:49:25 EEST 2025
> On Jun 16, 2025, at 19:03, Andreas Rheinhardt <andreas.rheinhardt at outlook.com> wrote:
>
> Zhao Zhili:
>>
>>
>>> On Jun 16, 2025, at 17:46, Andreas Rheinhardt <andreas.rheinhardt at outlook.com> wrote:
>>>
>>> Zhao Zhili:
>>>> From: Zhao Zhili <zhilizhao at tencent.com>
>>>>
>>>> ---
>>>> tests/checkasm/h264dsp.c | 20 +++++++++++++++-----
>>>> 1 file changed, 15 insertions(+), 5 deletions(-)
>>>>
>>>> diff --git a/tests/checkasm/h264dsp.c b/tests/checkasm/h264dsp.c
>>>> index f5f9650224..a0f8fd858a 100644
>>>> --- a/tests/checkasm/h264dsp.c
>>>> +++ b/tests/checkasm/h264dsp.c
>>>> @@ -328,25 +328,35 @@ static void check_idct_multiple(void)
>>>> static void check_idct_dequant(void)
>>>> {
>>>> static const int depths[5] = { 8, 9, 10, 12, 14 };
>>>> - LOCAL_ALIGNED_16(int16_t, src, [16]);
>>>> - /* Ensure dst buffers are large enough to hold dctcoefs of all bit-depths. */
>>>> + /* Ensure buffers are large enough to hold dctcoefs of all bit-depths. */
>>>> + LOCAL_ALIGNED_16(uint8_t, src_buf, [16 * sizeof(int32_t)]);
>>>> LOCAL_ALIGNED_16(uint8_t, dst0, [16 * 16 * sizeof(int32_t)]);
>>>> LOCAL_ALIGNED_16(uint8_t, dst1, [16 * 16 * sizeof(int32_t)]);
>>>> + int16_t *src = (int16_t *)src_buf;
>>>> int16_t *dst_ref = (int16_t *)dst0;
>>>> int16_t *dst_new = (int16_t *)dst1;
>>>> H264DSPContext h;
>>>> int bit_depth, i, qmul;
>>>> declare_func_emms(AV_CPU_FLAG_MMX | AV_CPU_FLAG_SSE2, void, int16_t *output, int16_t *input, int qmul);
>>>>
>>>> - for (int j = 0; j < 16; j++)
>>>> - src[j] = (rnd() % 512) - 256;
>>>> -
>>>> qmul = rnd() % 4096;
>>>>
>>>> for (i = 0; i < FF_ARRAY_ELEMS(depths); i++) {
>>>> bit_depth = depths[i];
>>>> ff_h264dsp_init(&h, bit_depth, 1);
>>>>
>>>> + if (bit_depth == 8) {
>>>> + for (size_t j = 0; j < 16; j++) {
>>>> + int16_t r = (rnd() % 512) - 256;
>>>> + AV_WN16A(&src_buf[j << 1], r);
>>>> + }
>>>> + } else {
>>>> + for (size_t j = 0; j < 16; j++) {
>>>> + int32_t r = (rnd() % (1 << (bit_depth + 1))) - (1 << bit_depth);
>>>> + AV_WN32A(&src_buf[j << 2], r);
>>>> + }
>>>> + }
>>>> +
>>>> memset(dst0, 0, 16 * 16 * SIZEOF_COEF);
>>>> memset(dst1, 0, 16 * 16 * SIZEOF_COEF);
>>>>
>>>
>>> This still has an effective-type violation: src_buf is of type uint8_t,
>>> yet the ff_h264_luma_dc_dequant_idct functions will read it as
>>> int16_t/int32_t. It also still has the downside that buffer overflows
>>> for the 8bit case can go undetected.
>>
>> A bunch of template has cast like
>>
>> pixel *dst = (pixel *)_dst;
>> const pixel *src = (const pixel *)_src;
>>
>> then read and write as int16_t.
>>
>> And a bunch of checkasm use uint8_t[] array on stack as src and dst,
>> which leading to UB.
>>
>> This patch isn’t specific. And this patch add zero UB (it’s there before the patch,
>> both src and dst are accessed as int32_t/int16_t while they are int16_t and uint8_t).
>>
>
> This patch adds UB: src was int16_t before, so that the accesses in the
> eight bit function were fine, but are not with this patch. Anyway, it is
> irrelevant now.
Why it suddenly becomes a big problem access to properly aligned uint8_t *?
I don’t mind to discuss the rules regarding to these violating of strict aliasing,
especially in checkasm. But why it suddenly becomes a rule blocking a patch
trying to fix a fate failure.
I don’t buy the reason "the accesses in the eight bit function were fine”.
>
> - Andreas
>
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel at ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request at ffmpeg.org with subject "unsubscribe".
More information about the ffmpeg-devel
mailing list