[FFmpeg-devel] [PATCH] arm32/neon: Avoid using bge/beq for function calls

Martin Storsjö martin at martin.st
Mon Jan 9 23:48:11 EET 2023


On Mon, 9 Jan 2023, Martin Storsjö wrote:

> Hi Rui,
>
> Long time no see!
>
> On Sat, 7 Jan 2023, Rui Ueyama wrote:
>
>> It looks like compiler-generated code always uses `b`, `bl` or `blx`
>> instructions for function calls. These instructions have a 24-bit
>> immediate and therefore can jump anywhere between PC +- 16 MiB.
>> 
>> This hand-written assembly code instead uses `bge` and `beq` for
>> interprocedural jumps. Since these instructions have only a 19-bit
>> immediate (we have less bits for condition code), they can jump only
>> within PC +- 512 KiB. This sometimes causes a "relocation R_ARM_THM_JUMP19
>> out of range" error when linked with the mold linker. This error can
>> easily be avoided by using `b` instead of `bge` or `beq`.
>
> Can you add a bit more explanation about what happens in mold in this case 
> and context about the setup - I don't quite understand how this can happen 
> (even if the code admittedly is a bit unusual)?
>
> Since .L_swri_oldapi_conv_flt_to_s16_neon and 
> .L_swri_oldapi_conv_fltp_to_s16_2ch_neon are local symbols, they don't get 
> emitted by the assembler, and the branch instructions are encoded with fixed 
> offsets and no relocations. And even if there would be a relocation, the 
> destination is within the same text section chunk in the object file, so it 
> shouldn't be possible for it to be out of range.
>
> The only possibility for this to be out of range, is if the destination is 
> treated as a global and routed via the PLC?
>
> What am I missing here?

In particular, it seems like the commits 
b22db4f465c9adb2cf1489e04f7b65ef6bb55b8b and 
e84212b78e00df17799e01be1e153a073eb8f689 were introduced to fix exactly 
this issue - by converting references from using the external global 
symbols into local labels instead.

// Martin


More information about the ffmpeg-devel mailing list