Displaying 4 results from an estimated 4 matches for "oc_frag_recon_inter2_mmx".
2007 Dec 30
2
Patch: fragment reconstruction MMX for GCC
Hi again,
I measured my fragment reconstructions against the compiler output from
GCC and well - the new codes perform better, so I brushed up my gcc
inline assembler skills and made a port.
Code is here: http://torus.untergrund.net/code/mmxfrag.c
All routines perform much better now. Inter2 alone got a speedup of
factor 5 on Pentium-M. Athlon CPU's execute roughly 3 times faster.
2005 Mar 23
3
[PATCH] promised MMX patches rc1
Hello,
Here is my first speedup patch. Like 10-11%. No IDCT yet.
Please feel free to comment my code or even better think about
improvements. :) I belive my routines are not so bad, maybe
one day they will be even more faster.
What needs to be optimized is the loop filter fuction. I have
no ideas now how to do it. It does not leave much space for parallel
stuff, copying memory from lot of
2005 Jul 20
1
MMX IDCT for theora-exp
...1057 33746 8.6944 dump oc_state_loop_filter_frag_rows
57706 10.2185 9204 2.3713 libogg.so.0.5.2 (no symbols)
39182 6.9383 10146 2.6140 dump oc_state_frag_recon_mmx
31095 5.5062 38650 9.9578 dump oc_frag_recon_inter2_mmx
24133 4.2734 12945 3.3352 dump oc_frag_pred_dc
22053 3.9051 11120 2.8650 dump oc_huff_token_decode
12497 2.2129 163 0.0420 dump oc_idct8x8_mmx
10376 1.8374 22113 5.6972 dump oc_...
2005 Aug 17
2
MMX loop filter for theora-exp
...6 17.1511 160269 30.5682 libc-2.3.2.so (no symbols)
72543 10.4566 12407 2.3664 libogg.so.0.5.2 (no symbols)
56169 8.0964 46845 8.9348 dump oc_state_loop_filter_frag_rows_mmx
42864 6.1786 50895 9.7072 dump oc_frag_recon_inter2_mmx
36961 5.3277 11800 2.2506 dump oc_state_frag_recon_mmx
33020 4.7596 17433 3.3250 dump oc_frag_pred_dc
26958 3.8858 15152 2.8900 dump oc_huff_token_decode
15535 2.2393 31301 5.9701 dump...