[FFmpeg-devel] [PATCH 4/4] Make the crop filter accept parametric expressions.

Sun Sep 12 20:30:11 CEST 2010

On date Sunday 2010-09-12 14:23:24 +0200, Michael Niedermayer encoded:
> On Sun, Sep 12, 2010 at 01:51:23AM +0200, Stefano Sabatini wrote:
> > On date Saturday 2010-09-11 19:05:46 +0200, Michael Niedermayer encoded:
> > > On Sat, Sep 11, 2010 at 11:35:58AM +0200, Stefano Sabatini wrote:
> > > > On date Friday 2010-09-10 23:13:45 +0200, Michael Niedermayer encoded:
> > > > > On Fri, Sep 10, 2010 at 06:33:36PM +0200, Stefano Sabatini wrote:
> > > > > [...]
> > > > > > > also i think the values should be cliped into sane integer range and maybe a
> > > > > > > NAN check is needed to
> > > > > > 
> > > > > > The clipping is already performed, and NAN check added.
> > > > > 
> > > > > no its not, you dont clip the double before converting to int
> > > > > and if iam not mistaken C doesnt gurantee not representable types for that
> > > > > not just crashing your program
> > > > 
> > > > Check attached and tell me if you like it.
> > > [...]
> > > > +# trembling effect
> > > > +crop='10+10*sin(n/10):20+20*sin(n/5):w-2*x:h-2*y
> > > > +
> > > > +# erratic camera effect depending on timestamp
> > > > +crop='20+20*sin(3*t):10+10*sin(2*t):w-2*x:h-2*y'
> > > > +
> > > > +# set x depending on the value of y
> > > > +crop='y:10+10*sin(n/10):w-2*x:h-2*y'
> > > > + at end example
> > > 
> > > if x and y change per frame but w/h cannot then this looks odd
> > 
> > If you prefer we can use x0/y0 in place of x/y for the w and h
> > expressions.
> 
> for filters like above this makes as much sense as x/y
> what you mean (but dont conciously realize) is the maximum x/y that the
> x/y expressions can generate for any input.
> sadly we dont know these and its not easy to calculate for a computer

We could let the user specify w/h *before* x/y, and then use w/W/h/H
in x/y expressions.

This looks more useful than allowing the user to specify w/h in
function of the initial x/y values.

> > > [...]
> > > >  static int config_input(AVFilterLink *link)
> > > >  {
> > > >      AVFilterContext *ctx = link->dst;
> > > >      CropContext *crop = ctx->priv;
> > > >      const AVPixFmtDescriptor *pix_desc = &av_pix_fmt_descriptors[link->format];
> > > > +    int ret;
> > > > +    const char *expr;
> > > > +    double res;
> > > > +
> > > > +    crop->var_values[E  ]   = M_E;
> > > > +    crop->var_values[PHI]   = M_PHI;
> > > > +    crop->var_values[PI ]   = M_PI;
> > > > +    crop->var_values[X  ]   = NAN;
> > > > +    crop->var_values[Y  ]   = NAN;
> > > > +    crop->var_values[W  ]   = ctx->inputs[0]->w;
> > > > +    crop->var_values[H  ]   = ctx->inputs[0]->h;
> > > > +    crop->var_values[N  ]   = 0;
> > > >  
> > > >      av_image_fill_max_pixsteps(crop->max_step, NULL, pix_desc);
> > > >      crop->hsub = av_pix_fmt_descriptors[link->format].log2_chroma_w;
> > > >      crop->vsub = av_pix_fmt_descriptors[link->format].log2_chroma_h;
> > > >  
> > > > -    if (crop->w == 0)
> > > > -        crop->w = link->w - crop->x;
> > > > -    if (crop->h == 0)
> > > > -        crop->h = link->h - crop->y;
> > > > +    if ((ret = av_parse_expr(&crop->x_pexpr, crop->x_expr, var_names,
> > > > +                             NULL, NULL, NULL, NULL, 0, ctx)) < 0 ||
> > > > +        (ret = av_parse_expr(&crop->y_pexpr, crop->y_expr, var_names,
> > > > +                             NULL, NULL, NULL, NULL, 0, ctx)) < 0)
> > > > +        return AVERROR(EINVAL);
> > > > +
> > > > +    crop->var_values[X] = av_eval_expr(crop->x_pexpr, crop->var_values, NULL);
> > > > +    crop->var_values[Y] = av_eval_expr(crop->y_pexpr, crop->var_values, NULL);
> > > > +    /* evaluate again x as it may depend on y */
> > > > +    crop->var_values[X] = av_eval_expr(crop->x_pexpr, crop->var_values, NULL);
> > > >  
> > > > +    if (normalize_double(&crop->x, crop->var_values[X]) < 0 ||
> > > > +        normalize_double(&crop->y, crop->var_values[Y]) < 0) {
> > > > +        av_log(ctx, AV_LOG_ERROR,
> > > > +               "Too big value or invalid expression for x or y. "
> > > > +               "Maybe the expression for x:'%s' or for y:'%s' is self-referencing.\n",
> > > > +               crop->x_expr, crop->y_expr);
> > > > +        return AVERROR(EINVAL);
> > > 
> > > this could trigger for unknown pos/ts
> > 
> > ???
> 
> NAN in (unknown ts/pos), NAN out

I can set var_values[T] = NAN if pts == NAN.

The we still need to define a behavior in case the computed value for
x/y is NAN or out-bound.
The currently implemented behavior:

NAN       => use the last valid value
out-bound => approximate the value to the nearest in-bound value

Is it OK to keep this behavior?

> > > [...]
> > > > +    /* FIXME: when the TB will be settable */
> > > > +    crop->var_values[T]   = (double)picref->pts / AV_TIME_BASE;
> > > 
> > > AV_NOPTS_VALUE
> > 
> > How do you suggest to deal with that case? One option would be to use
> > the last valid values
> 
> we could have a filter that fills in missing timestamps by repeating the last
> or extrapolating
> 
> 
> >  (as I'm already doing with x/y).
> 
> i was unaware of that

Regards.
-- 
FFmpeg = Fancy and Fanciful Multimedia Programmable Emblematic Guide