[FFmpeg-devel] [PATCH 05/11] dca dsp C: factorize scaling in lfe_fir
Christophe Gisquet
christophe.gisquet at gmail.com
Thu Feb 6 01:41:56 CET 2014
Scaling on output leads to respectively 32 or 64 multiplications, while doing
it on the input is respectively 8 or 4 multiplications. As a side note,
the trick can be used in other implementations.
Win64: 728 to 562 cycles.
---
libavcodec/dcadsp.c | 10 +++++++---
1 file changed, 7 insertions(+), 3 deletions(-)
diff --git a/libavcodec/dcadsp.c b/libavcodec/dcadsp.c
index b14c435..6600fef 100644
--- a/libavcodec/dcadsp.c
+++ b/libavcodec/dcadsp.c
@@ -39,19 +39,23 @@ dca_lfe_fir(float *out, const float *in, const float *coefs,
float *out2 = out + decifactor;
const float *cf0 = coefs;
const float *cf1 = coefs + 256;
+ float scaled_in[8];
int j, k;
+ for (j = 0; j < 256 / decifactor; j++)
+ scaled_in[j] = in[-j] * scale;
+
/* One decimated sample generates 2*decifactor interpolated ones */
for (k = 0; k < decifactor; k++) {
float v0 = 0.0;
float v1 = 0.0;
for (j = 0; j < 256 / decifactor; j++) {
- float s = in[-j];
+ float s = scaled_in[j];
v0 += s * *cf0++;
v1 += s * *--cf1;
}
- *out++ = v0 * scale;
- *out2++ = v1 * scale;
+ *out++ = v0;
+ *out2++ = v1;
}
}
--
1.8.0.msysgit.0
More information about the ffmpeg-devel
mailing list