search for: lpc_intrin_sse41

Displaying 12 results from an estimated 12 matches for "lpc_intrin_sse41".

2014 Jan 24
2
PATCH for lpc_intrin_sse41.c: faster shifts
It turns out that int64 shift is quite slow... This patch changes the code from: (FLAC__int32)(xmm.m128i_i64[0] >> lp_quantization) into: _mm_cvtsi128_si32(_mm_srli_epi64(xmm, lp_quantization)); Encoding of 24-bit .wav files with 32-bit FLAC became noticeably faster. The new code works only if quantization <= 32, but its max value is 15 so the code always work. (max_shiftlimit == (1
2014 Jan 30
0
PATCH for lpc_intrin_sse41.c: faster shifts
lvqcl wrote: > It turns out that int64 shift is quite slow... > > This patch changes the code from: > (FLAC__int32)(xmm.m128i_i64[0] >> lp_quantization) > into: > _mm_cvtsi128_si32(_mm_srli_epi64(xmm, lp_quantization)); > > Encoding of 24-bit .wav files with 32-bit FLAC became noticeably faster. > > > The new code works only if quantization <= 32,
2014 Sep 20
2
[PATCH 4/4] lpc_intrin_sse41 routines
This patch increases speed of FLAC__lpc_restore_signal_wide_intrin_sse41 (decoding of 24-bit FLAC files for 32-bit platform). -------------- next part -------------- A non-text attachment was scrubbed... Name: lpc_sse4.zip Type: application/zip Size: 3310 bytes Desc: not available Url : http://lists.xiph.org/pipermail/flac-dev/attachments/20140920/a3d8efb4/attachment.zip
2014 Jun 28
0
[PATCH 14] preprocessor macros in lpc_intrin_sseN.c
Currently both lpc_intrin_sse2.c and lpc_intrin_sse41.c define macros RESIDUAL_RESULT and DATA_RESULT. This patch changes their names so they become different. Reason: FLAC build systems don't apply specific options (such as -msse4.1) to specific files. So it makes little sense to have separate *_intrin_sseA.c and *_intrin_sseB.c files. IMHO it...
2017 Jan 29
1
[PATCH 1/2] for lpc_intrin_sseNN.c
This patch fixes bracket placement, extra space, etc in lpc_intrin_sse2.c and lpc_intrin_sse41.c -------------- next part -------------- A non-text attachment was scrubbed... Name: ident_fix.patch Type: application/octet-stream Size: 3480 bytes Desc: not available URL: <http://lists.xiph.org/pipermail/flac-dev/attachments/20170129/e59c42eb/attachment.obj>
2014 Jun 27
4
Lets work towards a new version
...te: > Like I reported just before the release of 1.3.0 (mail of Fri, > 05 Apr 2013 08:25:10 +0200, to be specific), compiling on > Raspbian (Debian Wheezy, GCC 4.6) returns quite some warnings of > the type -Wcast-align. > > > CC lpc_intrin_sse2.lo > > CC lpc_intrin_sse41.lo > > CC md5.lo > > md5.c: In function 'format_input_': > > md5.c:282:25: warning: cast increases required alignment of > > target type [-Wcast-align] > > md5.c:288:24: warning: cast increases required alignment of > > target type [-Wcast-align]...
2019 Jul 14
8
Prelease now available
Hi all, I have a new pre-reelase (with a GPG signature) up here: http://mega-nerd.com/tmp/flac-1.3.3rc1.tar.xz http://mega-nerd.com/tmp/flac-1.3.3rc1.tar.xz.asc This code is built from commit 10a28d482a8e48b806f61ab766992b2add98ec43 plus another commmit to change the version numbers which I will not be pushing to the public repo before the final release. Note that audio files encoded
2013 Sep 17
2
PATCH: x86-64 support and SSE intrinscis code
Erik de Castro Lopo <mle+la at mega-nerd.com> wrote: >> -msse for SSE code, -msse2 for SSE2 code, -msse4.1 for SSE4.1 code > > Yes, that was it. Brain fart. These flags were not needed on x86_64. > > Erik But now all C code is compiled with -msse2 and it won't work on older CPUs. Isn't it better to compile only necessary files with this flag?
2014 Jun 19
0
Lets work towards a new version
...forgotten or people would like to see? Like I reported just before the release of 1.3.0 (mail of Fri, 05 Apr 2013 08:25:10 +0200, to be specific), compiling on Raspbian (Debian Wheezy, GCC 4.6) returns quite some warnings of the type -Wcast-align. > CC lpc_intrin_sse2.lo > CC lpc_intrin_sse41.lo > CC md5.lo > md5.c: In function 'format_input_': > md5.c:282:25: warning: cast increases required alignment of > target type [-Wcast-align] > md5.c:288:24: warning: cast increases required alignment of > target type [-Wcast-align] > CC memory.lo >...
2013 Sep 28
4
PATCH: modify/add intrinsics code
The patch does the following: 1. splits lpc_x86intrin.c to lpc_intrin_sse.c and lpc_intrin_sse2.c 2. adds FLAC__lpc_compute_residual_from_qlp_coefficients_intrin_sse2() function to lpc_intrin_sse2.c 3. adds lpc_intrin_sse41.c with two ..._wide_intrin_sse41() functions (useful for 24-bit en-/decoding) 4. adds precompute_partition_info_sums_intrin_sse2() / ...ssse3() and disables precompute_partition_info_sums_32bit_asm_ia32_(). SSE2 version uses 4 SSE2 instructions instead of 1 SSSE3 instruction PABSD...
2014 Jun 19
10
Lets work towards a new version
Hi all, It sees that the most serious bug in the flac bug tracker: https://sourceforge.net/p/flac/bugs/413/ has been fixed in git. This fix alone is worth a new release so its time to work towards one. Things I need to do for this new release: * Deal with all current patches on the mailing list. * Review all bugs reported against 1.3.0 on the sf.net. * Testing and coordination of testing
2018 Jul 10
9
[PATCH 0/7] PowerPC64 performance improvements
The following series adds initial vector support for PowerPC64. On POWER9, flac --best is about 3.3x faster. Amitay Isaacs (2): Add m4 macro to check for C __attribute__ features Check if compiler supports target attribute on ppc64 Anton Blanchard (5): configure.ac: Remove SPE detection code configure.ac: Add VSX enable/disable configure.ac: Fix FLAC__CPU_PPC on little endian, and add