[FFmpeg-devel] [PATCH 07/10] x86: float dsp: unroll SSE versions
Michael Niedermayer
michaelni at gmx.at
Sat Feb 15 19:01:06 CET 2014
On Fri, Feb 14, 2014 at 04:00:51PM +0000, Christophe Gisquet wrote:
> vector_fmul and vector_fmac_scalar are guaranteed that they can process in
> batch of 16 elements, but their SSE versions only does 8 at a time.
>
> Therefore, unroll them a bit.
> 299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64.
> ---
> libavutil/x86/float_dsp.asm | 40 ++++++++++++++++++++++++----------------
> 1 file changed, 24 insertions(+), 16 deletions(-)
applied
thanks
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
He who knows, does not speak. He who speaks, does not know. -- Lao Tsu
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20140215/b604206d/attachment.asc>
More information about the ffmpeg-devel
mailing list