[FFmpeg-devel] [RFC]] swscale modernization proposal
Michael Niedermayer
michael at niedermayer.cc
Sat Jul 6 00:34:06 EEST 2024
On Fri, Jul 05, 2024 at 08:31:17PM +0200, Niklas Haas wrote:
> On Wed, 03 Jul 2024 15:25:58 +0200 Niklas Haas <ffmpeg at haasn.xyz> wrote:
> > On Tue, 02 Jul 2024 15:27:00 +0200 Niklas Haas <ffmpeg at haasn.xyz> wrote:
> >
> > > 1. Is this a good idea, or too confusing / complex to be worth the gain?
> > > Specifically, I am worried about confusion arising due to differences
> > > in behavior, and implemented options, between all of the above.
> > >
> > > That said, I think there is a big win to be had from unifying all of
> > > the different scaling and/or conversion filters we have in e.g.
> > > libavfilter, as well as making it trivial for users of this API to
> > > try using e.g. GPU scaling instead of CPU scaling.
> >
> > After prototyping this approach a bit (using an internal struct
> > AVScaleBackend), I think I like it. It specifically makes handling
> > unscaled special converters pretty straightforward, for example - the
> > "unscaled" backend can be separate from the generic/scaling backend.
> >
> > We could also trivially plug in something like libyuv, or some other
> > limited-use-case fast path, without the user really noticing.
>
> Small update: I decided to scrap the idea of separate user-visible
> "backends" for now, but preserved the internal API boundary between the
> avscale_* "front-end" and the actual back-end implementation, which
> I have called 'AVScaleGraph' for now.
>
> The idea is that this will grow into a full colorspace <-> colorspace
> "solver", but for now it is just hooked up to sws_createContext().
>
> Attached is my revised working draft of <avscale.h>.
I dont agree to the renaming of swscale, that is heading toward
[...]
> /**
> * The exact interpretation of these quality presets depends on the backend
> * used, but the backend-invariant common settings are derived as follows:
> */
> enum AVScaleQuality {
> AV_SCALE_ULTRAFAST = 1, /* no dither, nearest+nearest */
> AV_SCALE_SUPERFAST = 2, /* no dither, bilinear+nearest */
> AV_SCALE_VERYFAST = 3, /* no dither, bilinear+bilinear */
> AV_SCALE_FASTER = 4, /* bayer dither, bilinear+bilinear */
> AV_SCALE_FAST = 5, /* bayer dither, bicubic+bilinear */
> AV_SCALE_MEDIUM = 6, /* bayer dither, bicubic+bicubic */
> AV_SCALE_SLOW = 7, /* bayer dither, lanczos+bicubic */
> AV_SCALE_SLOWER = 8, /* full dither, lanczos+bicubic */
> AV_SCALE_VERYSLOW = 9, /* full dither, lanczos+lanczos */
> AV_SCALE_PLACEBO = 10, /* full dither, lanczos+lanczos */
I dont think its a good idea to hardcode dither and the "FIR" filter to the quality level in the API
[...]
> /**
> * Like `avscale_frame`, but operates only on the (source) range from `ystart`
> * to `height`.
> *
> * @param ctx The scaling context.
> * @param dst The destination frame. The data buffers may either be already
> * allocated by the caller or left clear, in which case they will
> * be allocated by the scaler. The latter may have performance
> * advantages - e.g. in certain cases some (or all) output planes
> * may be references to input planes, rather than copies.
> * @param src The source frame. If the data buffers are set to NULL, then
> * this function behaves identically to `avscale_frame_setup`.
> * @param slice_start First row of slice, relative to `src`. Must be a
> * multiple of avscale_slice_alignment(src).
> * @param slice_height Number of (source) rows in the slice. Must be a
> * multiple of avscale_slice_alignment(src).
> *
> * @return 0 on success, a negative AVERROR code on failure.
> */
> int avscale_frame_slice(AVScaleContext *ctx, AVFrame *dst,
> const AVFrame *src, int slice_start, int slice_height);
>
> /**
> * Like `avscale_frame`, but without actually scaling. It will instead merely
> * initialize internal state that *would* be required to perform the operation,
> * as well as returning the correct error code for unsupported frame
> * combinations.
> *
> * @param ctx The scaling context.
> * @param dst The destination frame to consider.
> * @param src The source frame to consider.
> * @return 0 on success, a negative AVERROR code on failure.
> */
> int avscale_frame_setup(AVScaleContext *ctx, const AVFrame *dst,
> const AVFrame *src);
somewhat off topic as this is public API but
the swscale filtering code could internally use libavutil/executor.h
having filters and slices interdepend and need to execute them "in order"
and parallel, maybe that API is usefull, not sure, just wanted to mention it
thx
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
Why not whip the teacher when the pupil misbehaves? -- Diogenes of Sinope
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 195 bytes
Desc: not available
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20240705/1243a840/attachment.sig>
More information about the ffmpeg-devel
mailing list