Hi Guys, In stream_decoder.c when assigning lpc restore function, only IA32 processor benefits from SS2 and SSE4.1 optimization. Shouldn't it be the case for x86_64 processor as well ? Thanks, -- Olivier TRISTAN uvi.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.xiph.org/pipermail/flac-dev/attachments/20140311/1d49b5c2/attachment.htm
Olivier Tristan wrote:> In stream_decoder.c when assigning lpc restore function, > only IA32 processor benefits from SS2 and SSE4.1 optimization. > > Shouldn't it be the case for x86_64 processor as well ?I tried, and it didn't make decoding faster. (And even SSE4.1 for IA-32 is... questionable) OTOH, flac decoding is really very fast. It's very hard to make it even faster.
oki doki Flac decoding is already really fast indeed but as I'm streaming hundreds of FLAC file from the HD in realtime, I wouldn't mind some speed up if available :) On Wed, Mar 12, 2014 at 5:40 PM, lvqcl <lvqcl.mail at gmail.com> wrote:> Olivier Tristan wrote: > > > In stream_decoder.c when assigning lpc restore function, > > only IA32 processor benefits from SS2 and SSE4.1 optimization. > > > > Shouldn't it be the case for x86_64 processor as well ? > > I tried, and it didn't make decoding faster. (And even SSE4.1 for IA-32 > is... questionable) > > OTOH, flac decoding is really very fast. It's very hard to make it even > faster. > _______________________________________________ > flac-dev mailing list > flac-dev at xiph.org > http://lists.xiph.org/mailman/listinfo/flac-dev >-- Olivier TRISTAN uvi.net -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.xiph.org/pipermail/flac-dev/attachments/20140316/2f3ad2ca/attachment.htm
Seemingly Similar Threads
- x86_64 SSE2/SSE41 optim not used
- [PATCH 5/5] SIMD: remove outdated SSE2 code
- [PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
- [PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
- [PATCHv2] SSE2/SSSE3 optimized version of get_checksum1() for x86-64