Displaying 20 results from an estimated 400 matches similar to: "[PATCH] promised MMX patches rc1"
2005 Mar 23
0
[PATCH]
Hello,
Here is my first speedup patch. Like 10-11%. No IDCT yet.
Please feel free to comment my code or even better think about
improvements. :) I belive my routines are not so bad, maybe
one day they will be even more faster.
What needs to be optimized is the loop filter fuction. I have
no ideas now how to do it. It does not leave much space for parallel
stuff, copying memory from lot of
2005 Jul 20
1
MMX IDCT for theora-exp
Hello,
I'm attaching IDCT MMX patch. I reused IDCT from theora-a3-MMXd.zip.
It should work on 64bit X86 platform too.
Here is most used functions when playing video with jet aircrafts (gripen)
Ogg logical stream 310b2968 is Theora 720x480 29.97 fps video
Encoded frame content is 720x480 with 0x0 offset
I can play this video with like 200-300 frame drops on Athlon XP 1700+
CPU load (with
2005 Aug 20
0
[PATCH] remove some FZIGZAG
Hello,
As we discussed with derf some time ago, it seems it is not neccessary to enforce "forward" order of dct_coeffs.
This patch gains .99366902855226196000% so approx 1% speedup.
Meausurement method:
time nice -n -19 ./dump /mnt/disc4/theora/unix/gripen.ogg > /dev/null
Ogg logical stream 310b2968 is Theora 720x480 29.97 fps video
Encoded frame content is 720x480 with 0x0 offset
2005 Aug 17
2
MMX loop filter for theora-exp
Hello,
I would like to announce the semi-optimized oc_state_loop_filter_frag_rows
It gains like 7% speedup. Unfortunately it has some issues:
1) wont compile on 64bit (I will fix it later hopefully)
2) is not yet fully optimized (instruction stalls)
Here are the results.
CPU: Athlon, speed 1466.91 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask
2007 Sep 26
1
Theora decoding problem on PowerPC
Hi,
I'm attempting to decode Theora videos on a PowerPC running a Linux
2.6.19 kernel. The version of GCC I'm cross-compiling from is 3.4.4.
The software versions I'm running are:
libogg-1.1.3
libpng-1.2.20
libtheora-1.0beta1
libvorbis-1.2.0
These are all the latest I was able to download.
Here's a back trace I got while running "dump_video" under
2009 Aug 30
3
experimental patch for libtheora1.1beta3
Good morning in the Lord
Regarding the port of libtheora1.1beta3 for OpenBSD for amd64 and the
problem I described at:
http://lists.xiph.org/pipermail/theora/2009-August/002640.html
Attached is a patch for
libtheora/patches/patch-lib_x86_mmxencfrag_c
I can play videos with it. ?Does it work for you?
Best regards
--
Dios, gracias por tu amor infinito.
2010 Jul 24
2
theorarm build
Hi all--
I tried building the ARM-optimized theora codec from the theorarm-
merge-branch, and encountered the following compile and runtime
problems before getting something to run. If there is another way to
build it, it would be nice to know, but I got the sense that its
current state in svn is incomplete.
I'm using a gcc cross-compiler for ARM on an x86 Linux PC. After
running
2004 Aug 24
5
MMX/mmxext optimisations
quite some speed improvement indeed.
attached the updated patch to apply to svn/trunk.
j
-------------- next part --------------
A non-text attachment was scrubbed...
Name: theora-mmx.patch.gz
Type: application/x-gzip
Size: 8648 bytes
Desc: not available
Url : http://lists.xiph.org/pipermail/theora-dev/attachments/20040824/5a5f2731/theora-mmx.patch-0001.bin
2004 Oct 22
5
theora-mmx_on_win32?
Hi. Has anyone tried http://svn.xiph.org/branches/theora-mmx this code on Win32 ?
I can compile it with very small modification,
304c304
< ogg_int16_t *const temp= (ogg_int16_t*)align_tmp;
---
> ogg_int16_t *const temp= (int16_t*)align_tmp;
but outputs seem terribly broken. -> ex. http://mycomputer.cc/temp/mmx-out.ogg
GCC version is 3.4.2.
$ gcc --version
gcc.exe (GCC) 3.4.2
2002 Dec 10
2
mingw compiling problem for libogg
(i hope this is correct m.list)
Hi,
there is a small compiling problem for mingw
when compiling on libogg..
in include/ogg/os_types.h :
ogg_int64_t, ogg_int32_t, etc are defined
correctly on cygwin and MSVC/Borland
but not on mingw...
i have attached a patch that will fix
this problem (i hope it attaches
correctly)
thx, Nehal
--- os_types.h.old Fri Jul 19 02:25:52 2002
+++ os_types.h Tue
2010 May 18
2
idct8x8 C version in libtheora1.1 release
When using the IDCT routines, the C version [ lib/idct.c:
oc_idct8x8_c(ogg_int16_t _y[64],int _last_zzi)] in libtheora 1.1.1, the
decoded image is garbled. Is it functionally equivalent to the MMX optimized
version [lib/x86/mmxidct.c: oc_idct8x8_mmx(ogg_int16_t _y[64],int
_last_zzi)] ?
I used some of the Theora video files from here:
http://wiki.xiph.org/index.php/List_of_Theora_videos for
2008 Mar 07
1
Bug in reference idct.
Hi
The Theora specification states, in section 7.9.3 ("The 1D Inverse DCT")
steps 14-16:
14. Assign T[5] the value T[4] - T[5].
15. Truncate T[5] to a 16-bit representation by dropping any higher-order
bits.
16. Assign T[5] the value C4 * (-T[5]) >> 16.
However, the relevant section of code in the reference decoder
(lib/dec/idct.c line 50) is:
2011 Mar 28
3
DCT in Theora
> I put debug code in a function in C,
> but the function oc_enc_fdct8x8() not called.
> Why?
There is no function oc_enc_fdct8x8. It's a macro, which usually calls a
platform-specific version via _enc->opt_vtable.fdct8x8, though on some
platforms, it will call a specific version directly (e.g.,
oc_enc_fdct8x8_x86_64sse2 on x86-64). All of the functions with
platform-specific
2001 Apr 18
3
[Patch] OS/2 port
[Repost, maybe it will get noticed with a better subject]
So now beta 4 is out is there any chance you guys will have time to review
some of my patches? Here's a simple one. It just adds the standard integer
typedefs for EMX GCC. I've flattened the structure using #elif's so it doesn't
get a level deeper for every platform added but the result should be the same.
I realise
2006 May 30
2
16 bits, cast on idct function
Hi all,
Just a stupid question
The IDctSlow function on file idct.c has this line :
ip[0] = (ogg_int16_t)((_Gd + _Cd ) >> 0);
The ip[0] , _Gd and _Cd are of type ogg_int32_t
My question is:
The result of (_Gd + _Cd) can be a number with more than 16 bits ?
(yes, it can be because they are int32, but the algorithm could
guarantee something about that... I dont know...)
If
2007 Oct 09
1
VC6 Patch
Here is a patch that gets the theora_static.dsp project for VC6 building
again.
Aaron
-------------- next part --------------
Index: win32/theora_static.dsp
===================================================================
--- win32/theora_static.dsp (revision 13945)
+++ win32/theora_static.dsp (working copy)
@@ -41,7 +41,7 @@
# PROP Intermediate_Dir "Static_Release"
# PROP
2000 Oct 05
4
Macintosh support (again)
In order to compile the latest Vorbis library on the Macintosh, the
change appended to this message needs to be made to os_types.h.in.
Then, anyone who wants to actually build it must first copy
os_types.h.in to os_types.h by hand, since MacOS does not include a
command line and thus cannot run configure.
It really bugs me that the whole "platforms which don't support
configure
2006 Apr 19
0
making ices under Mac OS X
After I got the libshout problem taken care of, I tried installing
installing ices. It gave the following error:
metadata.c:55: error: 'fd_set' undeclared (first use in this function)
I searched the internet for fd_set and found that select uses it,
which needs sys/time.h, which wasn't included in metadata.c, so I
added #include <sys/time.h> with the other includes, and it seems
2003 Feb 17
2
LccWin32 and OGG
Hello,
I'm new to OGG and I must say than it's impressive. I plan to use it
in my project (BASIC programming langage) but I use LccWin32 as main
C compiler (free and very powerful) and OGG has some difficulties to
compile. I would like to know if it was already reported. If not,
here it is:
- Inline ASM functions for vorbis_itoa() isn't ok for LCC (not the
same syntax than VC). I
2001 Jan 29
1
Mac OS X/Darwin patch for libvorbis
This is also pretty simple. Just a small patch to configure.in to set the right compiler flags and changes to os.h to (a) not have deeply nested ifdefs, making the file needlessly hard to read, (b) break up the sections by functionality and (c) don't assume that every non-Win32 has the floating versions of math library functions.
-tim
Index: configure.in