[FFmpeg-user] Chroma subsampling formats
Jim DeLaHunt
list+ffmpeg-user at jdlh.com
Sat Jul 19 23:14:21 EEST 2025
Mark:
On 2025-07-19 12:32, Mark Filipak wrote:
> I've made quite a few simplifying revisions. What do 'you' think of it?
>
> Below is my preliminary message to Charles Poynton, which I plan to send
> in a week if there are no further revisions.
>
> --Mark.
>
> =====
>
> Dear Dr. Poynton,
>
> Thank you for providing valuable guidance to so many people for so many
> years.
>
> You may know that Wikipedia contains a rather tortured description of
> chroma subsampling. I seek to streamline the description and to expand
> the scope to notations not currently covered.
As I read this draft, I don't see you asking any questions. You are just
stating things as you understand them. What kind of response do you want
from Poynton? I suggest you insert here a list of questions which
afford concise answers. Number the questions.
e.g.
Q1. Is 4:4:2 subsampling correctly depicted in my diagram "4:4:2
conjecture" below?
Q2. For 4:2:2 subsampling, I read a difference between your description
and Wikipedia's. I have depicted them below in diagrams "4:2:2,
Wikipedia" and "4:2:2 (BT.601), Poynton". As I read it, you imply that
4:2:2 subsampling involves sampling at a half-pixel offset from the
luminance pixel locations, while Wikipedia implies sampling at luminance
pixel locations. Do my diagrams depict you correctly? Am I correct that,
in your understanding, 4:2:2 subsampling involves sampling at a
half-pixel offset?
[I do not claim that this is the right wording. I am only giving an
example of a list of questions.]
> I'm associated with the ffmpeg-user mailing list. And though I speak
> solely for myself, I have the entirety of ffmpeg as an audience. I would
> be delighted to receive the opinions of an authority. All comments are
> welcome. I will not post any replies to ffmpeg-user or anywhere else
> without your expressed permission. However, I have posted this message
> to ffmpeg-user for comments.
>
> Warm Regards,
> Mark Filipak.
>
> References.
> Poynton:https://www.poynton.ca/PDFs/Chroma_subsampling_notation.pdf
> Wikipedia:https://en.wikipedia.org/wiki/Chroma_subsampling
>
> KEY:
> • : denotes a luma pixel.
> © : denotes a chroma pixel.
> ©== : denotes a single Cb Cr pair that spans 2 pixels.
> ©====== : denotes a single Cb Cr pair that spans 4 pixels.
> ——[1]—— : denotes that there is no chroma subpicture there [1].
Your notation also apparently represents half pixel offsets like this:
| ©== ©==
|©== ©==
When I saw this difference, I thought it was a formatting mistake. I
suggest you add this notation to your key.
> Shown next is the top-left corner of the sampled film.
> +—————————
> | • • • • <== The sample loci in subpicture 0 (frame or field).
> | • • • • <== The sample loci in subpicture 1 (frame or field).
> | The above is repeated across the pixel data's columns and rows.
>
> Shown next is the loci of Y, Cb, and Cr separated as element 'planes'.
> | Y Cb Cr
> | • • • • | © © © © | © © © © 4:4:4, Poynton, Wikipedia.
> | • • • • | © © © © | © © © ©
>
> | • • • • | © © © © | © © © © 4:4:2, conjecture.
> | • • • • | ©== ©== | ©== ©==
…[elided]…
> | • • • • | ©== ©== | ©== ©== 4:2:2, Wikipedia.
> | • • • • | ©== ©== | ©== ©==
>
> | • • • • |©== ©== |©== ©== 4:2:2 (BT.601), Poynton.
> | • • • • |©== ©== |©== ©== [2]
…[elided]…
> | • • • • ©====== | ©====== 4:1:1 (480i), Poynton.
> | • • • • ©====== | ©====== [2]
There is no vertical line to the left of the Cb section in the diagram
for "4:1:1 (480i), Poynton". Is that difference intentional? If so, what
does it signify?
> | • • • • | ©====== | ©====== 4:1:0, conjecture.
> | • • • • | ——[1]—— | ——[1]——
>
> | • • • • | ——[1]—— | ——[1]—— 4:0:0, (monochrome) conjecture.
> | • • • • | ——[1]—— | ——[1]——
>
> | • • • | ©==== | ©==== 3:1:1 (Sony), Poynton,
> | • • • | ©==== | ©==== Wikipedia.
>
> [1] The chroma subpicture is void. If both chroma subpictures 0 and 1
> are void (e.g. 4:0:0), then there are no chroma subpictures and the
> composited pixels carry luma only. Otherwise, the decoder copies chroma
> subpicture 0 to chroma subpicture 1. Note that though it is possible,
> the condition: chroma subpicture 0 void and chroma subpicture 1
> populated, has not been seen in the wild.
>
> [2] The chroma pixels are shown left-shifted by one-half pixel,
> presumably due to sampling one-half pixel to the left. Since they cannot
> be moved, the pixels must be obtained by mixing (e.g. C.n = (C.n +
> C.n+1)/2) in un-coded pictures. Note that the appearance of the left
> edges of chroma spans are slightly blurred by simple mixing. Note also
> that chroma spans are assumed to be truncated as needed at the left edge
> of film edges (e.g. in 4:1:1) and that the mixing is most accurately
> accomplished in frame pictures.
I hope you are successful in getting expert clarification, and in
updating the Wikipedia article.
I suggest you ask a followup question of Poynton: for any differences
with the WIkipedia article which he identifies, ask if he can suggest
reliable sources[3] which documents those differences. What you are
doing here sounds like what Wikipedia would call Original Research[4].
Wikipedia deals in verifiability, rather than the truth[5], for deep
philosophical reasons. Thus, your improvements to Wikipedia are of most
value if you can back them up with citations to reliable sources.
[3] <https://en.wikipedia.org/wiki/Wikipedia:Reliable_sources>
[4] <https://en.wikipedia.org/wiki/Wikipedia:No_original_research>
[5] <https://en.wikipedia.org/wiki/Wikipedia:Verifiability,_not_truth>
Best regards,
—Jim DeLaHunt
More information about the ffmpeg-user
mailing list