search for: ia32_bswap

Displaying 4 results from an estimated 4 matches for "ia32_bswap".

2013 Jun 01
2
Performance checks
...see a small > improvement in decoding speed with assembly optimizations turned on, > but I think the difference used to be larger. Perhaps the compilers > got better or MMX is slower relative to normal code on current CPUs. > > Disabling the FLAC__bitreader_read_rice_signed_block_asm_ia32_bswap > function seems to help a bit. (there is an #if disabling the function > with comment "OPT: not clearly faster, needs more testing" in the > src/libFLAC/stream_decoder.c file) > > Here is the relative decoding speed with -5 and -8: > -5 -8 > no asm 99.0% 97.0%...
2013 May 29
2
Performance checks
On 28-05-13 20:09, Janne Hyv?rinen wrote: > On Windows the 32-bit NASM enabled compiles are always fastest. If you > can run 32-bit code on your Linux box you should compile with assembly > optimizations. That depends on the way you define speed. For decoding this doesn't seem to be true. I reran my tests, it took a little longer because I couldn't believe the results I got.
2013 May 31
0
Performance checks
...8 gcc on a Core 2 CPU I see a small improvement in decoding speed with assembly optimizations turned on, but I think the difference used to be larger. Perhaps the compilers got better or MMX is slower relative to normal code on current CPUs. Disabling the FLAC__bitreader_read_rice_signed_block_asm_ia32_bswap function seems to help a bit. (there is an #if disabling the function with comment "OPT: not clearly faster, needs more testing" in the src/libFLAC/stream_decoder.c file) Here is the relative decoding speed with -5 and -8: -5 -8 no asm 99.0% 97.0% asm 100.0% 100.0% asm (no ia32...
2013 Jun 01
0
Performance checks
...improvement in decoding speed with assembly optimizations turned on, >> but I think the difference used to be larger. Perhaps the compilers >> got better or MMX is slower relative to normal code on current CPUs. >> >> Disabling the FLAC__bitreader_read_rice_signed_block_asm_ia32_bswap >> function seems to help a bit. (there is an #if disabling the function >> with comment "OPT: not clearly faster, needs more testing" in the >> src/libFLAC/stream_decoder.c file) >> >> Here is the relative decoding speed with -5 and -8: >> -5 -8 &g...