[FFmpeg-devel] [PATCH 1/4] avfilter/scale_vt: implement frame crop

Zhao Zhili quinkblack at foxmail.com
Sun Nov 24 18:57:38 EET 2024



> On Nov 21, 2024, at 00:40, Koushik Dutta <koushd at gmail.com> wrote:
> 
> Is anyone reviewing videotoolbox or qsv filters? The scale_cuda
> version of this patch was merged.
> 
> On Sat, Oct 19, 2024 at 10:58 PM Koushik Dutta <koushd at gmail.com> wrote:
>> 
>> The crop filter has no effect on scale_vt:
>> 
>> -vf crop=100:100,scale_vt=300x300
>> 
>> Hardware frames (AV_PIX_FMT_FLAG_HWACCEL) are expected to use the crop_* properties,
>> as seen in the implementation vf_crop.c.
>> 
>> The current workaround is to hwdownload the full frame
>> and perform the crop on CPU.
>> 
>> Signed-off-by: Koushik Dutta <koushd at gmail.com>
>> ---
>> libavfilter/vf_scale_vt.c | 50 +++++++++++++++++++++++++++++++++++++++
>> 1 file changed, 50 insertions(+)
>> 
>> diff --git a/libavfilter/vf_scale_vt.c b/libavfilter/vf_scale_vt.c
>> index 05f4e7b797..3da46a6cd5 100644
>> --- a/libavfilter/vf_scale_vt.c
>> +++ b/libavfilter/vf_scale_vt.c
>> @@ -109,6 +109,8 @@ static av_cold int scale_vt_init(AVFilterContext *avctx)
>>         VTSessionSetProperty(s->transfer, kVTPixelTransferPropertyKey_DestinationYCbCrMatrix, value);
>>     }
>> 
>> +    VTSessionSetProperty(s->transfer, kVTPixelTransferPropertyKey_ScalingMode, kVTScalingMode_CropSourceToCleanAperture);
>> +
>>     return 0;
>> }
>> 
>> @@ -132,6 +134,18 @@ static int scale_vt_filter_frame(AVFilterLink *link, AVFrame *in)
>>     CVPixelBufferRef src;
>>     CVPixelBufferRef dst;
>> 
>> +    int left;
>> +    int top;
>> +    int width;
>> +    int height;
>> +    CFNumberRef crop_width_num;
>> +    CFNumberRef crop_height_num;
>> +    CFNumberRef crop_offset_left_num;
>> +    CFNumberRef crop_offset_top_num;
>> +    const void *clean_aperture_keys[4];
>> +    const void *source_clean_aperture_values[4];
>> +    CFDictionaryRef source_clean_aperture;
>> +
>>     AVFrame *out = ff_get_video_buffer(outlink, outlink->w, outlink->h);
>>     if (!out) {
>>         ret = AVERROR(ENOMEM);
>> @@ -153,8 +167,43 @@ static int scale_vt_filter_frame(AVFilterLink *link, AVFrame *in)
>>     if (s->colour_matrix != AVCOL_SPC_UNSPECIFIED)
>>         out->colorspace = s->colour_matrix;
>> 
>> +    width = (in->width - in->crop_right) - in->crop_left;
>> +    height = (in->height - in->crop_bottom) - in->crop_top;
>> +    // The crop offsets are relative to the center of the frame.
>> +    // the crop width and crop height are relative to the center of the crop rect, not top left as normal.
>> +    left = in->crop_left - in->width / 2 + width / 2;
>> +    top = in->crop_top - in->height / 2 + height / 2;
>> +    crop_width_num = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &width);
>> +    crop_height_num = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &height);
>> +    crop_offset_left_num = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &left);
>> +    crop_offset_top_num = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &top);
>> +
>> +    clean_aperture_keys[0] = kCVImageBufferCleanApertureWidthKey;
>> +    clean_aperture_keys[1] = kCVImageBufferCleanApertureHeightKey;
>> +    clean_aperture_keys[2] = kCVImageBufferCleanApertureHorizontalOffsetKey;
>> +    clean_aperture_keys[3] = kCVImageBufferCleanApertureVerticalOffsetKey;
>> +
>> +    source_clean_aperture_values[0] = crop_width_num;
>> +    source_clean_aperture_values[1] = crop_height_num;
>> +    source_clean_aperture_values[2] = crop_offset_left_num;
>> +    source_clean_aperture_values[3] = crop_offset_top_num;
>> +
>> +    source_clean_aperture = CFDictionaryCreate(kCFAllocatorDefault,
>> +                                                    clean_aperture_keys,
>> +                                                    source_clean_aperture_values,
>> +                                                    4,
>> +                                                    &kCFTypeDictionaryKeyCallBacks,
>> +                                                    &kCFTypeDictionaryValueCallBacks);
>> +
>> +    CFRelease(crop_width_num);
>> +    CFRelease(crop_height_num);
>> +    CFRelease(crop_offset_left_num);
>> +    CFRelease(crop_offset_top_num);
>> +
>>     src = (CVPixelBufferRef)in->data[3];
>>     dst = (CVPixelBufferRef)out->data[3];
>> +    CVBufferSetAttachment(src, kCVImageBufferCleanApertureKey,
>> +                           source_clean_aperture, kCVAttachmentMode_ShouldPropagate);

After applied crop, the output frame still holding crop info which are copied from input frame
via
    ret = av_frame_copy_props(out, in);

The output frame crop info should be reset.


>>     ret = VTPixelTransferSessionTransferImage(s->transfer, src, dst);
>>     if (ret != noErr) {
>>         av_log(ctx, AV_LOG_ERROR, "transfer image failed, %d\n", ret);
>> @@ -162,6 +211,7 @@ static int scale_vt_filter_frame(AVFilterLink *link, AVFrame *in)
>>         goto fail;
>>     }
>> 
>> +    CFRelease(source_clean_aperture);

Should be released early in case of error which leading to leak (the goto fail).

>>     av_frame_free(&in);
>> 
>>     return ff_filter_frame(outlink, out);
>> --
>> 2.39.5 (Apple Git-154)
>> 
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel at ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
> 
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request at ffmpeg.org with subject "unsubscribe".



More information about the ffmpeg-devel mailing list