[FFmpeg-devel] Discrepancy between comments for AVX512 flags
James Darnley
jdarnley at obe.tv
Sat Aug 27 01:01:52 EEST 2022
While cherry-picking some stuff for avx512 I have noticed that ffmpeg
has a discrepancy in the comments for the two avx512 flags.
Lets start with the public header
> libavutil/cpu.h
> 56│ #define AV_CPU_FLAG_AVX512 0x100000 ///< AVX-512 functions: requires OS support even if YMM/ZMM registers aren't used
> 57│ #define AV_CPU_FLAG_AVX512ICL 0x200000 ///< F/CD/BW/DQ/VL/VNNI/IFMA/VBMI/VBMI2/VPOPCNTDQ/BITALG/GFNI/VAES/VPCLMULQDQ
This seem to imply the first only detects ZMM support and the second
groups all instruction sets together. This appears to be different to
what we imply in internal code
> libavutil/x86/cpu.c
> 151│ #if HAVE_AVX512 /* F, CD, BW, DQ, VL */
> libavutil/x86/x86inc.asm
> 840│ %assign cpuflags_avx512 (1<<20)| cpuflags_avx2 ; F, CD, BW, DQ, VL
The detection code itself has
> libavutil/x86/cpu.c
> 151│ #if HAVE_AVX512 /* F, CD, BW, DQ, VL */
> 152│ if ((xcr0_lo & 0xe0) == 0xe0) { /* OPMASK/ZMM state */
> 153│ if ((rval & AV_CPU_FLAG_AVX2) && (ebx & 0xd0030000) == 0xd0030000) {
> 154│ rval |= AV_CPU_FLAG_AVX512;
> 155│ #if HAVE_AVX512ICL
> 156│ if ((ebx & 0xd0200000) == 0xd0200000 && (ecx & 0x5f42) == 0x5f42)
> 157│ rval |= AV_CPU_FLAG_AVX512ICL;
If you decode the bits being checked you'll see that the base avx512
checks ebx for F DQ CD BW VL and avx512icl checks ebx for IFMA CD BW VL
and ecx for VBMI VBMI2 GFNI VAES VPCLMULQDQ VNNI BITALG VPOPCNTDQ. The
first matches what the internal comments imply.
Part of the difference is my fault and dates from when the flag was
first added.
Has there been a discussion about which features should go with which flag?
More information about the ffmpeg-devel
mailing list