[MPlayer-dev-eng] [PATCH] Make vo_gl / fragment programs work with Mesa
Reimar Döffinger
Reimar.Doeffinger at stud.uni-karlsruhe.de
Sun Mar 11 17:35:07 CET 2007
Hello,
On Wed, Feb 28, 2007 at 09:27:24PM +0100, Reimar D?ffinger wrote:
> On Wed, Feb 28, 2007 at 12:57:55PM +0100, Matthias Hopf wrote:
> > > > No, probably not. The chips *should* be capable of that. I'll ask Keith
> > > > later today.
> >
> > Ok, now I know that the chips only support a maximum of 4 dependent
> > texture lookups *in* *hardware*. That still could be enough, if I move
> > the calculation of the cubic texture lookup offsets outside the fragment
> > program. That should be doable, with two more interpolants.
>
> I think that any such solution will mean much more instructions and
> probably give speed problems for fullscreen videos.
> How about attached patch instead? It adds an interpolator that is
> cubic only in X-direction. Doesn't look too bad IMO.
> It could be further simplified if texture swizzles were supported *g*
Applied.
How about attached patch that avoids the extra lookup texture? (you
still need to "fix" two swizzles).
It is a lot more instructions more and thus a lot slower on my hardware
of course?
Greetings,
Reimar Döffinger
-------------- next part --------------
Index: libvo/gl_common.c
===================================================================
--- libvo/gl_common.c (revision 22509)
+++ libvo/gl_common.c (working copy)
@@ -708,6 +708,34 @@
"MUL cdelta.yw, parmy.rrgg, {0, -1, 0, 1};"
BICUB_FILT_MAIN("RECT");
+#define CALCWEIGHTS(t, s) \
+ "MAD "t", {-0.5, 0.1666, 0.3333, -0.3333}, "s", {1, 0, -0.5, 0.5};" \
+ "MAD "t", "t", "s", {0, 0, -0.5, 0.5};" \
+ "MAD "t", "t", "s", {-0.6666, 0, 0.8333, 0.1666};" \
+ "RCP a.x, "t".z;" \
+ "RCP a.y, "t".w;" \
+ "MAD "t".xy, "t".xyxy, a.xyxy, {1, 1, 0, 0};" \
+ "ADD "t".x, "t".xxxx, "s";" \
+ "SUB "t".y, "t".yyyy, "s";"
+
+static const char *bicub_notex_filt_template_2D =
+ "MAD coord.xy, fragment.texcoord[%c], {%f, %f}, {0.5, 0.5};"
+ "FRC coord.xy, coord.xyxy;"
+ CALCWEIGHTS("parmx", "coord.xxxx")
+ "MUL cdelta.xz, parmx.rrgg, {-%f, 0, %f, 0};"
+ CALCWEIGHTS("parmy", "coord.yyyy")
+ "MUL cdelta.yw, parmy.rrgg, {0, -%f, 0, %f};"
+ BICUB_FILT_MAIN("2D");
+
+static const char *bicub_notex_filt_template_RECT =
+ "ADD coord, fragment.texcoord[%c], {0.5, 0.5};"
+ "FRC coord.xy, coord.xyxy;"
+ CALCWEIGHTS("parmx", "coord.xxxx")
+ "MUL cdelta.xz, parmx.rrgg, {-1, 0, 1, 0};"
+ CALCWEIGHTS("parmy", "coord.yyyy")
+ "MUL cdelta.yw, parmy.rrgg, {0, -1, 0, 1};"
+ BICUB_FILT_MAIN("RECT");
+
#define BICUB_X_FILT_MAIN(textype) \
"ADD coord, fragment.texcoord[%c].xyxy, cdelta.xyxw;" \
"ADD coord2, fragment.texcoord[%c].xyxy, cdelta.zyzw;" \
@@ -992,6 +1020,18 @@
texs[0], (float)1.0 / texw, (float)1.0 / texw,
in_tex, in_tex, in_tex, in_tex, out_comp);
break;
+ case YUV_SCALER_BICUB_NOTEX:
+ if (rect)
+ snprintf(*prog_pos, *remain, bicub_notex_filt_template_RECT,
+ in_tex,
+ in_tex, in_tex, in_tex, in_tex, in_tex, in_tex, out_comp);
+ else
+ snprintf(*prog_pos, *remain, bicub_notex_filt_template_2D,
+ in_tex, (float)texw, (float)texh,
+ (float)1.0 / texw, (float)1.0 / texw,
+ (float)1.0 / texh, (float)1.0 / texh,
+ in_tex, in_tex, in_tex, in_tex, in_tex, in_tex, out_comp);
+ break;
}
*remain -= strlen(*prog_pos);
*prog_pos += strlen(*prog_pos);
Index: libvo/gl_common.h
===================================================================
--- libvo/gl_common.h (revision 22509)
+++ libvo/gl_common.h (working copy)
@@ -241,6 +241,8 @@
#define YUV_SCALER_BICUB 1
//! use cubic scaling in X and normal linear scaling in Y direction
#define YUV_SCALER_BICUB_X 2
+//! use cubic scaling without additional lookup texture
+#define YUV_SCALER_BICUB_NOTEX 3
//! mask for conversion type
#define YUV_CONVERSION_MASK 0xF
//! mask for scaler type
More information about the MPlayer-dev-eng
mailing list