Lancer 20060722 (based on aoTuV 4.51) homepage3.nifty.com/blacksword Changes: * inline assembly replaces as much as possible to intrinsic * abolish original memory transfer code in block.c * bitreverse use looking up table * fix speed down vorbis_book_decodevv_add's regression in lancer 20060529 * remove optimization prevention code in vorbis_book_decodevv_add * pre-calculate tables for triggers in mdct * simplifying a code in which high frequency removed by mdct_backward * add decode only funcs: mdct_butterflies_backward, dct_butterfly_first_backward * improve SSE optimization: bark_noise_hybridmp * add SSE optimization: render_line, vorbis_noise_normalize, _vp_noise_normalize * add SSE3 optimization: mdct_bitreverse * add pre-calculation code: seed_loop, max_seeds * optimize: seed_chase * add SORT16 to psy.c * auto loop unrolling: SORT8, SORT32 in psy.c * use lddqu in non SSE environment for unaligned memory load * improve loop condiution code in inline assembly code * add t option for oggdec benchmarks (without outputting file)