[FFmpeg-devel] [FFmpeg-cvslog] swscale/aarch64: Fix rgb24toyv12 only works with aligned width

Martin Storsjö martin at martin.st
Wed Sep 25 22:53:48 EEST 2024


On Wed, 25 Sep 2024, Zhao Zhili wrote:

>> On Sep 25, 2024, at 16:01, Martin Storsjö <martin at martin.st> wrote:
>> 
>> On Tue, 24 Sep 2024, Zhao Zhili wrote:
>> 
>>> ffmpeg | branch: master | Zhao Zhili <zhilizhao at tencent.com> | Wed Sep 18 21:11:44 2024 +0800| [e18b46d95fadcbaaf450bda9f1871849f2b0c586] | committer: Zhao Zhili
>>> 
>>> swscale/aarch64: Fix rgb24toyv12 only works with aligned width
>>> 
>>> Since c0666d8b, rgb24toyv12 is broken for width non-aligned to 16.
>>> Add a simple wrapper to handle the non-aligned part.
>>> 
>>> Co-authored-by: johzzy <hellojinqiang at gmail.com>
>>> Signed-off-by: Zhao Zhili <zhilizhao at tencent.com>
>>> 
>>>> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=e18b46d95fadcbaaf450bda9f1871849f2b0c586
>>> ---
>>> 
>>> libswscale/aarch64/rgb2rgb.c | 23 ++++++++++++++++++++++-
>>> tests/checkasm/sw_rgb.c      |  2 +-
>>> 2 files changed, 23 insertions(+), 2 deletions(-)
>>> --- a/tests/checkasm/sw_rgb.c
>>> +++ b/tests/checkasm/sw_rgb.c
>>> @@ -129,7 +129,7 @@ static int cmp_off_by_n(const uint8_t *ref, const uint8_t *test, size_t n, int a
>>> 
>>> static void check_rgb24toyv12(struct SwsContext *ctx)
>>> {
>>> -    static const int input_sizes[] = {16, 128, 512, MAX_LINE_SIZE, -MAX_LINE_SIZE};
>>> +    static const int input_sizes[] = {2, 16, 128, 540, MAX_LINE_SIZE, -MAX_LINE_SIZE};
>>>
>>>    LOCAL_ALIGNED_32(uint8_t, src, [BUFSIZE * 3]);
>>>    LOCAL_ALIGNED_32(uint8_t, buf_y_0, [BUFSIZE]);
>> 
>> These new test cases fail on x86_32; we have got a version of rgb24toyv12 which is specific to "#if ARCH_X86_32 && HAVE_7REGS".
>> 
>> Can you have a look?
>
> Sorry for the break. I’m on a short vacation without access to x86_32 
> test environment. And I’m not familiar with x86 asm. I’m afraid removing 
> the new test is what I can do for now, if that’s an option.

Thanks - yeah I think that's the practically best thing to do at the 
moment. I guess this assembly has existed in this form for a very long 
time already, so while it probably is incorrect for these cases, it 
doesn't seem to be an urgent thing. (But I guess whatever case that was 
noted on aarch64 also would be noted on x86_32?) So silencing the test for 
now probably is simplest, until the assembly can be fixed.

Or we could ifdef out these uneven cases for ARCH_X86_32, but that's also 
kinda ugly...

// Martin


More information about the ffmpeg-devel mailing list