[FFmpeg-devel] [PATCH] SPARC VIS simple_idct try#6
Balatoni Denes
dbalatoni
Wed Aug 29 01:13:17 CEST 2007
Hi!
Wednesday 29 August 2007 00:13-kor Michael Niedermayer ezt ?rta:
> > > Indeed, I didn't take that into account. So if I fix that 25% and the
> > > clamping part, will you accept the patch?
> >
> > Better yet: that would be 4 instructions. How about I gain 4 clocks in
> > some other way instead - how, let it be my secret. Okay?
>
> hmm no but you have to do that secret optimization too now at minimum for
> it to be considered for svn
4 instructions dealing with f46 (fzero and ldd in the macros) can be
eliminated, if f60 or f62 is also put to use.
> ill investigate the register shortage vs. avoidable load/stores vs. latency
> after (the unlikely) case that you do correct the undisputed
> suboptimalities
Argh, I checked what it would involve to fix the 25% overlap. Half of the
registers would have to be carefully renamed all over the place.
While the suboptimalities are not disputed, whether the benefit from fixing
them outweights the cost (in time, and code beuty) is heavily disputed.
> [...]
bye
Denes
More information about the ffmpeg-devel
mailing list