Hi to all of vorbis developers! I think it is time to do core optimizations to vorbis-coder/decoder in asm. I`d like to help. What you think about that?
Yann Guidon wrote:> Aleksey wrote: > >> Hi to all of vorbis developers! >> >> I think it is time to do core optimizations to vorbis-coder/decoder >> in asm. I`d like to help. What you think about that? > > > higher level optimisations + good compiler switches is safer and far > less painful IMHO. > > unless you are already a P3 or P4 programming guru > but then, others having non-x86 computers will be disapointed. > > YG > >I mean rewrite critical core functions using x86 *simd* instructions. It significaly increase speed on x86(and new x86-64) machines IMHO, and precision will not change significaly. I just want to do vorbis faster for most users, who have x86. They paid for simd instructions and probably will be disappointed too, if vorbis will be slower than could be on their machines. And why others who have non-x86 computers will by disappointed? They simply couldn't use these optimizations and will use pure vorbis C code, as it is at this time. Why do not to use it if it exists ;)
Aleksey wrote:> Hi to all of vorbis developers! > > I think it is time to do core optimizations to vorbis-coder/decoder in > asm. I`d like to help. What you think about that?I think this may be interesting for you (Archer - a SSE optimized Vorbis encoder): http://homepage3.nifty.com/blacksword/ Discussion: http://www.hydrogenaudio.org/forums/index.php?showtopic=29161 bye, Maik Merten
Aleksey schrieb:> Why do not to use it if it exists ;)Because the "significal increase" in speed may reduce CPU load for playback from 0.5% to 0.4%, give or take a fraction or two. Why bother? Tor
Tor-Einar Jarnbjo wrote:> Aleksey schrieb: > >> Why do not to use it if it exists ;) > > > Because the "significal increase" in speed may reduce CPU load for > playback from 0.5% to 0.4%, give or take a fraction or two. Why bother? > > Tor > >And how about the encoder? I don`t believe that it will be about 1%
Aleksey schrieb:> And how about the encoder? I don`t believe that it will be about 1%When encoding with q4 in realtime, I have a CPU load of ~6.5%. How much do you expect to reduce this with your assembler optimizations and when do you expect a practical advantage of the performance boost? Tor