Displaying 20 results from an estimated 200 matches similar to: "MMX IDCT for theora-exp"
2005 Mar 23
0
[PATCH]
Hello,
Here is my first speedup patch. Like 10-11%. No IDCT yet.
Please feel free to comment my code or even better think about
improvements. :) I belive my routines are not so bad, maybe
one day they will be even more faster.
What needs to be optimized is the loop filter fuction. I have
no ideas now how to do it. It does not leave much space for parallel
stuff, copying memory from lot of
2005 Mar 23
3
[PATCH] promised MMX patches rc1
Hello,
Here is my first speedup patch. Like 10-11%. No IDCT yet.
Please feel free to comment my code or even better think about
improvements. :) I belive my routines are not so bad, maybe
one day they will be even more faster.
What needs to be optimized is the loop filter fuction. I have
no ideas now how to do it. It does not leave much space for parallel
stuff, copying memory from lot of
2005 Aug 17
2
MMX loop filter for theora-exp
Hello,
I would like to announce the semi-optimized oc_state_loop_filter_frag_rows
It gains like 7% speedup. Unfortunately it has some issues:
1) wont compile on 64bit (I will fix it later hopefully)
2) is not yet fully optimized (instruction stalls)
Here are the results.
CPU: Athlon, speed 1466.91 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask
2005 Aug 20
0
[PATCH] remove some FZIGZAG
Hello,
As we discussed with derf some time ago, it seems it is not neccessary to enforce "forward" order of dct_coeffs.
This patch gains .99366902855226196000% so approx 1% speedup.
Meausurement method:
time nice -n -19 ./dump /mnt/disc4/theora/unix/gripen.ogg > /dev/null
Ogg logical stream 310b2968 is Theora 720x480 29.97 fps video
Encoded frame content is 720x480 with 0x0 offset
2004 Aug 24
5
MMX/mmxext optimisations
quite some speed improvement indeed.
attached the updated patch to apply to svn/trunk.
j
-------------- next part --------------
A non-text attachment was scrubbed...
Name: theora-mmx.patch.gz
Type: application/x-gzip
Size: 8648 bytes
Desc: not available
Url : http://lists.xiph.org/pipermail/theora-dev/attachments/20040824/5a5f2731/theora-mmx.patch-0001.bin
2004 Oct 22
5
theora-mmx_on_win32?
Hi. Has anyone tried http://svn.xiph.org/branches/theora-mmx this code on Win32 ?
I can compile it with very small modification,
304c304
< ogg_int16_t *const temp= (ogg_int16_t*)align_tmp;
---
> ogg_int16_t *const temp= (int16_t*)align_tmp;
but outputs seem terribly broken. -> ex. http://mycomputer.cc/temp/mmx-out.ogg
GCC version is 3.4.2.
$ gcc --version
gcc.exe (GCC) 3.4.2
2007 Mar 25
3
MMX patch to speed up Theora decoding
Hi,
Attached is a patch against 1.0alpha7 to speed up Theora decoding. It
is about 15~20% faster in my test. It consists of following things:
* MMX loop filter based on Rudolf Marek's patch in
http://lists.xiph.org/pipermail/theora-dev/2005-August/002838.html
* MMX IDCT based on Rudolf Marek's patch in
http://lists.xiph.org/pipermail/theora-dev/2005-July/002816.html
and the code in
2006 May 25
2
Compilation issues with s390
Hi all,
I'm trying to compile asterisk on the mainframe (s390 / s390x) and I am
running into issues. I was wondering if somebody could give a hand?
I'm thinking that I should be able to do this. I have noticed that Debian
even has binary RPM's out for Asterisk now. I'm trying to do this on SuSE
SLES8 (with the 2.4 kernel).
What I see is, an issue that arch=s390 isn't
2002 Dec 10
2
mingw compiling problem for libogg
(i hope this is correct m.list)
Hi,
there is a small compiling problem for mingw
when compiling on libogg..
in include/ogg/os_types.h :
ogg_int64_t, ogg_int32_t, etc are defined
correctly on cygwin and MSVC/Borland
but not on mingw...
i have attached a patch that will fix
this problem (i hope it attaches
correctly)
thx, Nehal
--- os_types.h.old Fri Jul 19 02:25:52 2002
+++ os_types.h Tue
2010 May 18
2
idct8x8 C version in libtheora1.1 release
When using the IDCT routines, the C version [ lib/idct.c:
oc_idct8x8_c(ogg_int16_t _y[64],int _last_zzi)] in libtheora 1.1.1, the
decoded image is garbled. Is it functionally equivalent to the MMX optimized
version [lib/x86/mmxidct.c: oc_idct8x8_mmx(ogg_int16_t _y[64],int
_last_zzi)] ?
I used some of the Theora video files from here:
http://wiki.xiph.org/index.php/List_of_Theora_videos for
2008 Mar 07
1
Bug in reference idct.
Hi
The Theora specification states, in section 7.9.3 ("The 1D Inverse DCT")
steps 14-16:
14. Assign T[5] the value T[4] - T[5].
15. Truncate T[5] to a 16-bit representation by dropping any higher-order
bits.
16. Assign T[5] the value C4 * (-T[5]) >> 16.
However, the relevant section of code in the reference decoder
(lib/dec/idct.c line 50) is:
2001 Apr 18
3
[Patch] OS/2 port
[Repost, maybe it will get noticed with a better subject]
So now beta 4 is out is there any chance you guys will have time to review
some of my patches? Here's a simple one. It just adds the standard integer
typedefs for EMX GCC. I've flattened the structure using #elif's so it doesn't
get a level deeper for every platform added but the result should be the same.
I realise
2006 May 30
2
16 bits, cast on idct function
Hi all,
Just a stupid question
The IDctSlow function on file idct.c has this line :
ip[0] = (ogg_int16_t)((_Gd + _Cd ) >> 0);
The ip[0] , _Gd and _Cd are of type ogg_int32_t
My question is:
The result of (_Gd + _Cd) can be a number with more than 16 bits ?
(yes, it can be because they are int32, but the algorithm could
guarantee something about that... I dont know...)
If
2010 Aug 02
0
[LLVMdev] Register Allocation ERROR! Ran out of registers during register allocation!
Hi all,
My Machine environment is Clang-2.8-svn on Linux-x86. When I build
ffmpeg-0.6 using Clang, error output:
CC libavcodec/x86/mpegvideo_mmx.o
fatal error: error in backend: Ran out of registers during register
allocation!
Please check your inline asm statement for invalid constraints:
INLINEASM <es:movd %eax, %xmm3
pshuflw $$0, %xmm3, %xmm3
punpcklwd %xmm3, %xmm3
2007 Oct 09
1
VC6 Patch
Here is a patch that gets the theora_static.dsp project for VC6 building
again.
Aaron
-------------- next part --------------
Index: win32/theora_static.dsp
===================================================================
--- win32/theora_static.dsp (revision 13945)
+++ win32/theora_static.dsp (working copy)
@@ -41,7 +41,7 @@
# PROP Intermediate_Dir "Static_Release"
# PROP
2012 Nov 28
0
[LLVMdev] [llvm-commits] [dragonegg] r168787 - in /dragonegg/trunk: src/x86/Target.cpp src/x86/x86_builtins test/validator/c/copysignp.c
Hi Pawel, can you please pull this dragonegg patch into 3.2. I am the
code owner for dragonegg.
Thanks a lot, Duncan.
On 28/11/12 13:44, Duncan Sands wrote:
> Author: baldrick
> Date: Wed Nov 28 06:44:50 2012
> New Revision: 168787
>
> URL: http://llvm.org/viewvc/llvm-project?rev=168787&view=rev
> Log:
> Add support for GCC's vector copysign builtins, fixing
2011 Jun 17
0
[LLVMdev] RFC: Integer saturation intrinsics
On Fri, Jun 17, 2011 at 4:22 PM, Evan Cheng <evan.cheng at apple.com> wrote:
>
> On Jun 17, 2011, at 3:42 PM, Eli Friedman wrote:
>
>> On Fri, Jun 17, 2011 at 3:08 PM, Evan Cheng <evan.cheng at apple.com> wrote:
>>> Hi all,
>>>
>>> I'm proposing integer saturation intrinsics.
>>>
>>> def int_ssat :
2000 Oct 05
4
Macintosh support (again)
In order to compile the latest Vorbis library on the Macintosh, the
change appended to this message needs to be made to os_types.h.in.
Then, anyone who wants to actually build it must first copy
os_types.h.in to os_types.h by hand, since MacOS does not include a
command line and thus cannot run configure.
It really bugs me that the whole "platforms which don't support
configure
2006 Apr 19
0
making ices under Mac OS X
After I got the libshout problem taken care of, I tried installing
installing ices. It gave the following error:
metadata.c:55: error: 'fd_set' undeclared (first use in this function)
I searched the internet for fd_set and found that select uses it,
which needs sys/time.h, which wasn't included in metadata.c, so I
added #include <sys/time.h> with the other includes, and it seems
2009 Aug 30
3
experimental patch for libtheora1.1beta3
Good morning in the Lord
Regarding the port of libtheora1.1beta3 for OpenBSD for amd64 and the
problem I described at:
http://lists.xiph.org/pipermail/theora/2009-August/002640.html
Attached is a patch for
libtheora/patches/patch-lib_x86_mmxencfrag_c
I can play videos with it. ?Does it work for you?
Best regards
--
Dios, gracias por tu amor infinito.