Displaying 12 results from an estimated 12 matches for "lpc_intrin_sse41".
2014 Jan 24
2
PATCH for lpc_intrin_sse41.c: faster shifts
It turns out that int64 shift is quite slow...
This patch changes the code from:
(FLAC__int32)(xmm.m128i_i64[0] >> lp_quantization)
into:
_mm_cvtsi128_si32(_mm_srli_epi64(xmm, lp_quantization));
Encoding of 24-bit .wav files with 32-bit FLAC became noticeably faster.
The new code works only if quantization <= 32, but its max value is 15 so the code always work.
(max_shiftlimit == (1
2014 Jan 30
0
PATCH for lpc_intrin_sse41.c: faster shifts
lvqcl wrote:
> It turns out that int64 shift is quite slow...
>
> This patch changes the code from:
> (FLAC__int32)(xmm.m128i_i64[0] >> lp_quantization)
> into:
> _mm_cvtsi128_si32(_mm_srli_epi64(xmm, lp_quantization));
>
> Encoding of 24-bit .wav files with 32-bit FLAC became noticeably faster.
>
>
> The new code works only if quantization <= 32,
2014 Sep 20
2
[PATCH 4/4] lpc_intrin_sse41 routines
This patch increases speed of FLAC__lpc_restore_signal_wide_intrin_sse41
(decoding of 24-bit FLAC files for 32-bit platform).
-------------- next part --------------
A non-text attachment was scrubbed...
Name: lpc_sse4.zip
Type: application/zip
Size: 3310 bytes
Desc: not available
Url : http://lists.xiph.org/pipermail/flac-dev/attachments/20140920/a3d8efb4/attachment.zip
2014 Jun 28
0
[PATCH 14] preprocessor macros in lpc_intrin_sseN.c
Currently both lpc_intrin_sse2.c and lpc_intrin_sse41.c
define macros RESIDUAL_RESULT and DATA_RESULT.
This patch changes their names so they become different.
Reason: FLAC build systems don't apply specific options (such as
-msse4.1) to specific files. So it makes little sense to have
separate *_intrin_sseA.c and *_intrin_sseB.c files.
IMHO it...
2017 Jan 29
1
[PATCH 1/2] for lpc_intrin_sseNN.c
This patch fixes bracket placement, extra space, etc
in lpc_intrin_sse2.c and lpc_intrin_sse41.c
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ident_fix.patch
Type: application/octet-stream
Size: 3480 bytes
Desc: not available
URL: <http://lists.xiph.org/pipermail/flac-dev/attachments/20170129/e59c42eb/attachment.obj>
2014 Jun 27
4
Lets work towards a new version
...te:
> Like I reported just before the release of 1.3.0 (mail of Fri,
> 05 Apr 2013 08:25:10 +0200, to be specific), compiling on
> Raspbian (Debian Wheezy, GCC 4.6) returns quite some warnings of
> the type -Wcast-align.
>
> > CC lpc_intrin_sse2.lo
> > CC lpc_intrin_sse41.lo
> > CC md5.lo
> > md5.c: In function 'format_input_':
> > md5.c:282:25: warning: cast increases required alignment of
> > target type [-Wcast-align]
> > md5.c:288:24: warning: cast increases required alignment of
> > target type [-Wcast-align]...
2019 Jul 14
8
Prelease now available
Hi all,
I have a new pre-reelase (with a GPG signature) up here:
http://mega-nerd.com/tmp/flac-1.3.3rc1.tar.xz
http://mega-nerd.com/tmp/flac-1.3.3rc1.tar.xz.asc
This code is built from commit 10a28d482a8e48b806f61ab766992b2add98ec43
plus another commmit to change the version numbers which I will
not be pushing to the public repo before the final release.
Note that audio files encoded
2013 Sep 17
2
PATCH: x86-64 support and SSE intrinscis code
Erik de Castro Lopo <mle+la at mega-nerd.com> wrote:
>> -msse for SSE code, -msse2 for SSE2 code, -msse4.1 for SSE4.1 code
>
> Yes, that was it. Brain fart. These flags were not needed on x86_64.
>
> Erik
But now all C code is compiled with -msse2 and it won't work on older CPUs.
Isn't it better to compile only necessary files with this flag?
2014 Jun 19
0
Lets work towards a new version
...forgotten or people would like to see?
Like I reported just before the release of 1.3.0 (mail of Fri,
05 Apr 2013 08:25:10 +0200, to be specific), compiling on
Raspbian (Debian Wheezy, GCC 4.6) returns quite some warnings of
the type -Wcast-align.
> CC lpc_intrin_sse2.lo
> CC lpc_intrin_sse41.lo
> CC md5.lo
> md5.c: In function 'format_input_':
> md5.c:282:25: warning: cast increases required alignment of
> target type [-Wcast-align]
> md5.c:288:24: warning: cast increases required alignment of
> target type [-Wcast-align]
> CC memory.lo
>...
2013 Sep 28
4
PATCH: modify/add intrinsics code
The patch does the following:
1. splits lpc_x86intrin.c to lpc_intrin_sse.c and lpc_intrin_sse2.c
2. adds FLAC__lpc_compute_residual_from_qlp_coefficients_intrin_sse2()
function to lpc_intrin_sse2.c
3. adds lpc_intrin_sse41.c with two ..._wide_intrin_sse41() functions
(useful for 24-bit en-/decoding)
4. adds precompute_partition_info_sums_intrin_sse2() / ...ssse3() and
disables precompute_partition_info_sums_32bit_asm_ia32_().
SSE2 version uses 4 SSE2 instructions instead of 1 SSSE3 instruction PABSD...
2014 Jun 19
10
Lets work towards a new version
Hi all,
It sees that the most serious bug in the flac bug tracker:
https://sourceforge.net/p/flac/bugs/413/
has been fixed in git. This fix alone is worth a new release so its
time to work towards one.
Things I need to do for this new release:
* Deal with all current patches on the mailing list.
* Review all bugs reported against 1.3.0 on the sf.net.
* Testing and coordination of testing
2018 Jul 10
9
[PATCH 0/7] PowerPC64 performance improvements
The following series adds initial vector support for PowerPC64.
On POWER9, flac --best is about 3.3x faster.
Amitay Isaacs (2):
Add m4 macro to check for C __attribute__ features
Check if compiler supports target attribute on ppc64
Anton Blanchard (5):
configure.ac: Remove SPE detection code
configure.ac: Add VSX enable/disable
configure.ac: Fix FLAC__CPU_PPC on little endian, and add