[FFmpeg-devel] [PATCH] avutil/hwcontext_cuda: fix YUV420P cuda_get_buffer
Timo Rothenpieler
timo at rothenpieler.org
Tue May 15 12:08:50 EEST 2018
On 14.05.2018 23:29, Marton Balint wrote:
> Regression since ece068a771ac3f725e854c681ecbef08e792addc.
>
> Signed-off-by: Marton Balint <cus at passwd.hu>
> ---
> libavutil/hwcontext_cuda.c | 9 ++++++---
> 1 file changed, 6 insertions(+), 3 deletions(-)
>
> diff --git a/libavutil/hwcontext_cuda.c b/libavutil/hwcontext_cuda.c
> index cb5d15c9d9..e16d0a2b4b 100644
> --- a/libavutil/hwcontext_cuda.c
> +++ b/libavutil/hwcontext_cuda.c
> @@ -159,10 +159,13 @@ static int cuda_get_buffer(AVHWFramesContext *ctx, AVFrame *frame)
> return res;
>
> // YUV420P is a special case.
> - // Nvenc expects the U/V planes in swapped order from how ffmpeg expects them.
> + // Nvenc expects the U/V planes in swapped order from how ffmpeg expects them, also chroma is half-aligned
> if (ctx->sw_format == AV_PIX_FMT_YUV420P) {
> - FFSWAP(uint8_t*, frame->data[1], frame->data[2]);
> - FFSWAP(int, frame->linesize[1], frame->linesize[2]);
> + int aligned_width = FFALIGN(ctx->width, CUDA_FRAME_ALIGNMENT);
> + frame->data[2] = frame->data[0] + aligned_width * ctx->height;
> + frame->data[1] = frame->data[2] + aligned_width * ctx->height / 4;
> + frame->linesize[1] = aligned_width / 2;
> + frame->linesize[2] = aligned_width / 2;
> }
Are you sure it isn't already that way? I tested specially YUV420P and
could not make out any issues.
And if this is really necessary, please don't use the old aligned_width
term, it's incredibly confusing and made the old code a mess.
linesize[0] should be identical to it if I'm not mistaken.
So something like
frame->linesize[1] = frame->linesize[2] = frame->linesize[0] / 2;
frame->data[2] = frame->data[1];
frame->data[1] = frame->data[2] + frame->linesize[2] * ctx->height / 2;
More information about the ffmpeg-devel
mailing list