[FFmpeg-devel] [PATCH] libavutil: add clean aperture (CLAP) side data.

Kieran O Leary kieran.o.leary at gmail.com
Tue May 5 15:10:44 EEST 2020


Hi,

I broke the threading with my last reply, i apologise. Here goes another
attempt:

On Tue, Apr 28, 2020 at 6:23 PM Neil Birkbeck <neil.birkbeck at gmail.com>
wrote:

> On Tue, Apr 28, 2020 at 3:18 AM Nicolas George <george at nsup.org> wrote:
>
> > Andreas Rheinhardt (12020-04-28):
> > > That's expected. The patch provided only provides the structure in
> which
> > > the values are intended to be exported; it does not add any demuxing or
> > > muxing capabilities for mov or mkv (as you can see from the fact that
> > > none of these (de)muxers have been changed in the patch).
> >
> > Which is something I intended to comment on: adding code without users
> > is rarely a good idea. I suggest we do not commit until at least one
> > demuxer use it, preferably at least two. Otherwise, we may realize that
> > “oh crap, it doesn't work” because of a tiny unforeseen detail.
>
>
> Thanks for the feedback. I also have patches for the demux (MOV/MKV) and
> mux (MOV/MKV).
>
> As there is still the alternative of using the fields in the
> AVCodecParameters/AVCodecContext, my intention was to keep the first patch
> small to resolve discussion on that point.
>
> I've included the patches, if you'd like to try test it, Kieren. I see on
> your test file that there may be some slight rounding error making output
> crop 704 not 703 (MKV file ends up with pixel_crop_{left,right} = 8).
>
> /ffprobe ../testdata/clap.mov 2>&1 | grep -A1 "Side"
>     Side data:
>       Clean aperture:[width 41472/59 height:576/1 h_offset:0/1
> v_offset:0/1]
> ./ffmpeg -i ../testdata/clap.mov  -vcodec copy -acodec copy /tmp/clap.mkv
> ./ffprobe /tmp/clap.mkv 2>&1 | grep -A1 "Side"
>     Side data:
>       Clean aperture:[width 704/1 height:576/1 h_offset:0/1 v_offset:0/1]
>

I have to look deeper into the MKV side of things and most likely raise it
with the cellar mailing list so that better minds than mine can weigh in. I
do see that the rounding up to 704 could be an issue alright.
As for MOV, your patch appears to generate the same output clap values as
the input, so that's really great! command line and mediainfo trace below:
$ ./ffmpeg -i /mnt/c/Users/blaaa/Downloads/clap.mov -c:v v210 newv210.mov
&& mediainfo --Details=1 newv210.mov |grep -i clap -n10
ffmpeg version N-97506-g2fae000994 Copyright (c) 2000-2020 the FFmpeg
developers
  built with gcc 7 (Ubuntu 7.5.0-3ubuntu1~18.04)
  configuration:
  libavutil      56. 43.100 / 56. 43.100
  libavcodec     58. 82.100 / 58. 82.100
  libavformat    58. 42.101 / 58. 42.101
  libavdevice    58.  9.103 / 58.  9.103
  libavfilter     7. 79.100 /  7. 79.100
  libswscale      5.  6.101 /  5.  6.101
  libswresample   3.  6.100 /  3.  6.100
Guessed Channel Layout for Input Stream #0.1 : stereo
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from
'/mnt/c/Users/blaaa/Downloads/clap.mov':
  Metadata:
    major_brand     : qt
    minor_version   : 537199360
    compatible_brands: qt
    creation_time   : 2018-09-13T08:48:41.000000Z
  Duration: 00:00:01.00, start: 0.000000, bitrate: 80686 kb/s
    Stream #0:0(eng): Video: v210 (v210 / 0x30313276),
yuv422p10le(smpte170m/bt470bg/bt709, top coded first (swapped)), 720x576,
79626 kb/s, SAR 59:54 DAR 295:216, 9 fps, 25 tbr, 25k tbn, 25k tbc (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Video Media Handler
      encoder         : Uncompressed 10-Bit YUV
      timecode        : 00:00:00:00
    Side data:
      Clean aperture:[width 41472/59 height:576/1 h_offset:0/1 v_offset:0/1]
    Stream #0:1(eng): Audio: pcm_s24le (in24 / 0x34326E69), 48000 Hz,
stereo, s32 (24 bit), 2304 kb/s (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Sound Media Handler
      timecode        : 00:00:00:00
    Stream #0:2(eng): Data: none (tmcd / 0x64636D74), 0 kb/s (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : Time Code Media Handler
      reel_name       : 001
      timecode        : 00:00:00:00
File 'newv210.mov' already exists. Overwrite? [y/N] y
Stream mapping:
  Stream #0:0 -> #0:0 (v210 (native) -> v210 (native))
  Stream #0:1 -> #0:1 (pcm_s24le (native) -> aac (native))
Press [q] to stop, [?] for help
Output #0, mov, to 'newv210.mov':
  Metadata:
    major_brand     : qt
    minor_version   : 537199360
    compatible_brands: qt
    encoder         : Lavf58.42.101
    Stream #0:0(eng): Video: v210 (v210 / 0x30313276), yuv422p10le(top
coded first (swapped)), 720x576 [SAR 59:54 DAR 295:216], q=2-31, 221184
kb/s, 0.04 fps, 12800 tbn, 25 tbc (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Video Media Handler
      timecode        : 00:00:00:00
      encoder         : Lavc58.82.100 v210
    Side data:
      Clean aperture:[width 41472/59 height:576/1 h_offset:0/1 v_offset:0/1]
    Stream #0:1(eng): Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz,
stereo, fltp (24 bit), 128 kb/s (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Sound Media Handler
      timecode        : 00:00:00:00
      encoder         : Lavc58.82.100 aac
frame=   25 fps=0.0 q=-0.0 Lsize=   27009kB time=00:00:00.96
bitrate=230455.2kbits/s dup=16 drop=0 speed=19.8x
video:27000kB audio:6kB subtitle:0kB other streams:0kB global headers:0kB
muxing overhead: 0.008794%
[aac @ 0x7fffe91cc680] Qavg: 171.537
226-1A5FBA4         detail:                         9 (0x09)
227-1A5FBA5        Pixel Aspect Ratio (16 bytes)
228-1A5FBA5         Header (8 bytes)
229-1A5FBA5          Size:                          16 (0x00000010)
230-1A5FBA9          Name:                          pasp
231-1A5FBAD         hSpacing:                       59 (0x0000003B)
232-1A5FBB1         vSpacing:                       54 (0x00000036)
233-1A5FBB5        Clean Aperture (40 bytes)
234-1A5FBB5         Header (8 bytes)
235-1A5FBB5          Size:                          40 (0x00000028)
236:1A5FBB9          Name:                          clap
237-1A5FBBD         apertureWidth_N:                41472 (0x0000A200)
238-1A5FBC1         apertureWidth_D:                59 (0x0000003B)
239-1A5FBC5         apertureHeight_N:               576 (0x00000240)
240-1A5FBC9         apertureHeight_D:               1 (0x00000001)
241-1A5FBCD         horizOff_N:                     0 (0x00000000)
242-1A5FBD1         horizOff_D:                     1 (0x00000001)
243-1A5FBD5         vertOff_N:                      0 (0x00000000)
244-1A5FBD9         vertOff_D:                      1 (0x00000001)
245-1A5FBDD      Time to Sample (24 bytes)
246-1A5FBDD       Header (8 bytes)


and here's similar info for mkv:

 ./ffmpeg -i /mnt/c/Users/blaaa/Downloads/clap.mov -c:v v210 newv210.mkv &&
m
ediainfo --Details=1 newv210.mkv |grep -i pixel -n10
ffmpeg version N-97506-g2fae000994 Copyright (c) 2000-2020 the FFmpeg
developers
  built with gcc 7 (Ubuntu 7.5.0-3ubuntu1~18.04)
  configuration:
  libavutil      56. 43.100 / 56. 43.100
  libavcodec     58. 82.100 / 58. 82.100
  libavformat    58. 42.101 / 58. 42.101
  libavdevice    58.  9.103 / 58.  9.103
  libavfilter     7. 79.100 /  7. 79.100
  libswscale      5.  6.101 /  5.  6.101
  libswresample   3.  6.100 /  3.  6.100
Guessed Channel Layout for Input Stream #0.1 : stereo
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from
'/mnt/c/Users/blaaa/Downloads/clap.mov':
  Metadata:
    major_brand     : qt
    minor_version   : 537199360
    compatible_brands: qt
    creation_time   : 2018-09-13T08:48:41.000000Z
  Duration: 00:00:01.00, start: 0.000000, bitrate: 80686 kb/s
    Stream #0:0(eng): Video: v210 (v210 / 0x30313276),
yuv422p10le(smpte170m/bt470bg/bt709, top coded first (swapped)), 720x576,
79626 kb/s, SAR 59:54 DAR 295:216, 9 fps, 25 tbr, 25k tbn, 25k tbc (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Video Media Handler
      encoder         : Uncompressed 10-Bit YUV
      timecode        : 00:00:00:00
    Side data:
      Clean aperture:[width 41472/59 height:576/1 h_offset:0/1 v_offset:0/1]
    Stream #0:1(eng): Audio: pcm_s24le (in24 / 0x34326E69), 48000 Hz,
stereo, s32 (24 bit), 2304 kb/s (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Sound Media Handler
      timecode        : 00:00:00:00
    Stream #0:2(eng): Data: none (tmcd / 0x64636D74), 0 kb/s (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : Time Code Media Handler
      reel_name       : 001
      timecode        : 00:00:00:00
File 'newv210.mkv' already exists. Overwrite? [y/N] y
Stream mapping:
  Stream #0:0 -> #0:0 (v210 (native) -> v210 (native))
  Stream #0:1 -> #0:1 (pcm_s24le (native) -> ac3 (native))
Press [q] to stop, [?] for help
Output #0, matroska, to 'newv210.mkv':
  Metadata:
    major_brand     : qt
    minor_version   : 537199360
    compatible_brands: qt
    encoder         : Lavf58.42.101
    Stream #0:0(eng): Video: v210 (v210 / 0x30313276), yuv422p10le(top
coded first (swapped)), 720x576 [SAR 59:54 DAR 295:216], q=2-31, 221184
kb/s, 25 fps, 1k tbn, 25 tbc (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Video Media Handler
      timecode        : 00:00:00:00
      encoder         : Lavc58.82.100 v210
    Side data:
      Clean aperture:[width 41472/59 height:576/1 h_offset:0/1 v_offset:0/1]
    Stream #0:1(eng): Audio: ac3 ([0] [0][0] / 0x2000), 48000 Hz, stereo,
fltp (24 bit), 192 kb/s (default)
    Metadata:
      creation_time   : 2018-09-13T08:48:41.000000Z
      handler_name    : ?Apple Sound Media Handler
      timecode        : 00:00:00:00
      encoder         : Lavc58.82.100 ac3
frame=    9 fps=0.0 q=-0.0 Lsize=    9731kB time=00:00:00.41
bitrate=193962.2kbits/s speed=10.6x
video:9720kB audio:10kB subtitle:0kB other streams:0kB global headers:0kB
muxing overhead: 0.015487%
193-000144     Data:                               40000000 (0x02625A00)
194-000148    CodecID - V_MS/VFW/FOURCC (17 bytes)
195-000148     Header (2 bytes)
196-000148      Name:                              6 (0x06)
197-000149      Size:                              15 (0x0F)
198-00014A     Data:                               V_MS/VFW/FOURCC
199-000159    Video (52 bytes)
200-000159     Header (9 bytes)
201-000159      Name:                              96 (0x60)
202-00015A      Size:                              43 (0x0000000000002B)
203:000162     PixelWidth - 720 (0x2D0) (4 bytes)
204-000162      Header (2 bytes)
205-000162       Name:                             48 (0x30)
206-000163       Size:                             2 (0x02)
207-000164      Data:                              720 (0x02D0)
208:000166     PixelHeight - 576 (0x240) (4 bytes)
209-000166      Header (2 bytes)
210-000166       Name:                             58 (0x3A)
211-000167       Size:                             2 (0x02)
212-000168      Data:                              576 (0x0240)
213-00016A     FlagInterlaced - 1 (0x1) (3 bytes)
214-00016A      Header (2 bytes)
215-00016A       Name:                             26 (0x1A)
216-00016B       Size:                             1 (0x01)
217-00016C      Data:                              1 (0x01)
218-00016D     FieldOrder - 9 (0x9) (3 bytes)
--
228-000175     DisplayHeight - 216 (0xD8) (4 bytes)
229-000175      Header (3 bytes)
230-000175       Name:                             5306 (0x14BA)
231-000177       Size:                             1 (0x01)
232-000178      Data:                              216 (0xD8)
233-000179     DisplayUnit - 3 (0x3) (4 bytes)
234-000179      Header (3 bytes)
235-000179       Name:                             5298 (0x14B2)
236-00017B       Size:                             1 (0x01)
237-00017C      Data:                              3 (0x03)
238:00017D     PixelCropLeft - 8 (0x8) (4 bytes)
239-00017D      Header (3 bytes)
240-00017D       Name:                             5324 (0x14CC)
241-00017F       Size:                             1 (0x01)
242-000180      Data:                              8 (0x08)
243:000181     PixelCropRight - 8 (0x8) (4 bytes)
244-000181      Header (3 bytes)
245-000181       Name:                             5341 (0x14DD)
246-000183       Size:                             1 (0x01)
247-000184      Data:                              8 (0x08)
248:000185     PixelCropBottom - 0 (0x0) (4 bytes)
249-000185      Header (3 bytes)
250-000185       Name:                             5290 (0x14AA)
251-000187       Size:                             1 (0x01)
252-000188      Data:                              0 (0x00)
253:000189     PixelCropTop - 0 (0x0) (4 bytes)
254-000189      Header (3 bytes)
255-000189       Name:                             5307 (0x14BB)
256-00018B       Size:                             1 (0x01)
257-00018C      Data:                              0 (0x00)
258-00018D    CodecPrivate - Copy of vids (43 bytes)
259-00018D     Header (3 bytes)
260-00018D      Name:                              9122 (0x23A2)
261-00018F      Size:                              40 (0x28)
262-000190     Size:                               40 (0x00000028)
263-000194     Width:                              720 (0x000002D0)

Best,

Kieran O'Leary
Irish Film Institute


More information about the ffmpeg-devel mailing list