search for: pitch_sse

Displaying 20 results from an estimated 24 matches for "pitch_sse".

2016 Sep 01
1
[PATCH] vs2015: include files added in 76674fea
...j index d9aaa03..f55ea37 100644 --- a/win32/VS2015/opus.vcxproj +++ b/win32/VS2015/opus.vcxproj @@ -852,6 +852,7 @@ <ClInclude Include="..\..\celt\vq.h" /> <ClInclude Include="..\..\celt\x86\celt_lpc_sse.h" /> <ClInclude Include="..\..\celt\x86\pitch_sse.h" /> + <ClInclude Include="..\..\celt\x86\vq_sse.h" /> <ClInclude Include="..\..\celt\x86\x86cpu.h" /> <ClInclude Include="..\..\celt\_kiss_fft_guts.h" /> <ClInclude Include="..\..\include\opus.h" /> @@ -913,...
2015 Mar 13
1
[RFC PATCH v3] Intrinsics/RTCD related fixes. Mostly x86.
...t_unit_dft.c | 4 +- celt/tests/test_unit_mathops.c | 11 +- celt/tests/test_unit_mdct.c | 4 +- celt/tests/test_unit_rotation.c | 11 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 +++++++++++++------------------ celt/x86/pitch_sse.h | 256 ++++++++++------------- celt/x86/pitch_sse2.c | 95 +++++++++ celt/x86/pitch_sse4_1.c | 195 ++++++++++++++++++ celt/x86/x86_celt_map.c...
2015 Mar 12
1
[RFC PATCHv2] Intrinsics/RTCD related fixes. Mostly x86.
...t_unit_dft.c | 4 +- celt/tests/test_unit_mathops.c | 11 +- celt/tests/test_unit_mdct.c | 4 +- celt/tests/test_unit_rotation.c | 11 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 +++++++++++++------------------ celt/x86/pitch_sse.h | 256 ++++++++++------------- celt/x86/pitch_sse2.c | 95 +++++++++ celt/x86/pitch_sse4_1.c | 195 ++++++++++++++++++ celt/x86/x86_celt_map.c...
2015 Mar 02
13
Patch cleaning up Opus x86 intrinsics configury
The attached patch cleans up Opus's x86 intrinsics configury. It: * Makes ?enable-intrinsics work with clang and other non-GCC compilers * Enables RTCD for the floating-point-mode SSE code in Celt. * Disables use of RTCD in cases where the compiler targets an instruction set by default. * Enables the SSE4.1 Silk optimizations that apply to the common parts of Silk when Opus is built in
2015 Aug 03
0
[PATCH 00/10] Patched cleaning up Opus x86 intrinsics configury
...| 4 +- celt/pitch.h | 19 +- celt/tests/test_unit_mathops.c | 9 +- celt/tests/test_unit_rotation.c | 9 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 +++++++++++++------------------ celt/x86/pitch_sse.h | 261 ++++++++++-------------- celt/x86/pitch_sse2.c | 95 +++++++++ celt/x86/pitch_sse4_1.c | 195 ++++++++++++++++++ celt/x86/x86_celt_map.c...
2015 Mar 18
5
[RFC PATCH v1 0/4] Enable aarch64 intrinsics/Ne10
...ft.c | 55 +++-- celt/tests/test_unit_mathops.c | 21 +- celt/tests/test_unit_mdct.c | 85 ++++--- celt/tests/test_unit_rotation.c | 21 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 ++++++++++--------------- celt/x86/pitch_sse.h | 256 ++++++++------------ celt/x86/pitch_sse2.c | 95 ++++++++ celt/x86/pitch_sse4_1.c | 195 +++++++++++++++ celt/x86/x86_celt_map.c | 76 +++++-...
2015 Mar 31
6
[RFC PATCH v1 0/5] aarch64: celt_pitch_xcorr: Fixed point series
...ft.c | 56 +++-- celt/tests/test_unit_mathops.c | 22 +- celt/tests/test_unit_mdct.c | 88 ++++--- celt/tests/test_unit_rotation.c | 22 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 ++++++++++--------------- celt/x86/pitch_sse.h | 256 ++++++++------------ celt/x86/pitch_sse2.c | 95 ++++++++ celt/x86/pitch_sse4_1.c | 195 +++++++++++++++ celt/x86/x86_celt_map.c | 76 +++++-...
2015 Oct 20
2
AVX Optimizations
Hello, I'm currently looking to see if Opus could benefit from AVX/AVX2 instructions and I've made some experiments optimizing xcorr_kernel_* function in celt using AVX2 intrinsics and got about 1.5x-1.6x speedup over SSE floating point version in pitch_sse.c. Have AVX optimizations been tried in Opus? Is there someone I could talk to about optimizing compute intensive tasks in Opus using AVX? Thanks, Radu -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.xiph.org/pipermail/opus/attachments/20151020/d565fef...
2016 Sep 13
4
[PATCH 12/15] Replace call of celt_inner_prod_c() (step 1)
Should call celt_inner_prod(). --- celt/bands.c | 7 ++++--- celt/bands.h | 2 +- celt/celt_encoder.c | 6 +++--- celt/pitch.c | 2 +- src/opus_multistream_encoder.c | 2 +- 5 files changed, 10 insertions(+), 9 deletions(-) diff --git a/celt/bands.c b/celt/bands.c index bbe8a4c..1ab24aa 100644 --- a/celt/bands.c +++ b/celt/bands.c
2015 May 08
8
[RFC PATCH v2]: Ne10 fft fixed and previous 0/8]
...ft.c | 62 +++-- celt/tests/test_unit_mathops.c | 22 +- celt/tests/test_unit_mdct.c | 88 ++++--- celt/tests/test_unit_rotation.c | 22 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 ++++++++++--------------- celt/x86/pitch_sse.h | 256 ++++++++------------ celt/x86/pitch_sse2.c | 95 ++++++++ celt/x86/pitch_sse4_1.c | 195 +++++++++++++++ celt/x86/x86_celt_map.c | 76 +++++-...
2015 May 15
11
[RFC V3 0/8] Ne10 fft fixed and previous
...ft.c | 62 +++-- celt/tests/test_unit_mathops.c | 22 +- celt/tests/test_unit_mdct.c | 88 ++++--- celt/tests/test_unit_rotation.c | 22 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 ++++++++++--------------- celt/x86/pitch_sse.h | 256 ++++++++------------ celt/x86/pitch_sse2.c | 95 ++++++++ celt/x86/pitch_sse4_1.c | 195 +++++++++++++++ celt/x86/x86_celt_map.c | 76 +++++-...
2015 Apr 28
10
[RFC PATCH v1 0/8] Ne10 fft fixed and previous
...ft.c | 62 +++-- celt/tests/test_unit_mathops.c | 22 +- celt/tests/test_unit_mdct.c | 88 ++++--- celt/tests/test_unit_rotation.c | 22 +- celt/x86/celt_lpc_sse.c | 4 + celt/x86/celt_lpc_sse.h | 12 +- celt/x86/pitch_sse.c | 334 ++++++++++--------------- celt/x86/pitch_sse.h | 256 ++++++++------------ celt/x86/pitch_sse2.c | 95 ++++++++ celt/x86/pitch_sse4_1.c | 195 +++++++++++++++ celt/x86/x86_celt_map.c | 76 +++++-...
2015 Jan 29
2
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
...t_arm_ne10.h \ > celt/arm/armcpu.h \ Wrong indentation. > celt/arm/fixed_armv4.h \ > celt/arm/fixed_armv5e.h \ > celt/arm/kiss_fft_armv4.h \ > celt/arm/kiss_fft_armv5e.h \ > celt/arm/pitch_arm.h \ > +celt/arm/fft_arm.h \ > +celt/arm/mdct_arm.h \ > celt/x86/pitch_sse.h \ Etc.
2015 Jan 29
0
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
...; > Wrong indentation. > >> celt/arm/fixed_armv4.h \ >> celt/arm/fixed_armv5e.h \ >> celt/arm/kiss_fft_armv4.h \ >> celt/arm/kiss_fft_armv5e.h \ >> celt/arm/pitch_arm.h \ >> +celt/arm/fft_arm.h \ >> +celt/arm/mdct_arm.h \ >> celt/x86/pitch_sse.h \ > > > Etc.
2014 Nov 09
0
[RFC PATCH v1] arm: kf_bfly4: Introduce ARM neon intrinsics
...ft_state *st, diff --git a/celt_headers.mk b/celt_headers.mk index 5bb193e..db4c2f3 100644 --- a/celt_headers.mk +++ b/celt_headers.mk @@ -37,5 +37,6 @@ celt/arm/fixed_armv5e.h \ celt/arm/kiss_fft_armv4.h \ celt/arm/kiss_fft_armv5e.h \ celt/arm/pitch_arm.h \ +celt/arm/kiss_fft_neon.h \ celt/x86/pitch_sse.h \ celt/x86/x86cpu.h diff --git a/celt_sources.mk b/celt_sources.mk index 20b1b1b..b27bf3e 100644 --- a/celt_sources.mk +++ b/celt_sources.mk @@ -32,3 +32,6 @@ celt/arm/celt_pitch_xcorr_arm.s CELT_AM_SOURCES_ARM_ASM = \ celt/arm/armopts.s.in + +CELT_SOURCES_ARM_NEON = \ +celt/arm/kiss_fft_neo...
2014 Nov 09
3
[RFC PATCH v1] arm: kf_bfly4: Introduce ARM neon intrinsics
Hello, This patch introduces ARM NEON Intrinsics to optimize kf_bfly4 routine in celt part of libopus. Using NEON optimized kf_bfly4(_neon) routine helped improve performance of opus_fft_impl function by about 21.4%. The end use case was decoding a music opus ogg file. The end use case saw performance improvement of about 4.47%. This patch has 2 components i. Actual neon code to improve
2015 May 15
0
[RFC V3 7/8] armv7, armv8: Optimize fixed point fft using NE10 library
...+#endif #include "celt_lpc.c" #include "pitch.c" -#include "arm/celt_neon_intr.c" #include "arm/arm_celt_map.c" -#endif #elif defined(OPUS_X86_MAY_HAVE_SSE2) || defined(OPUS_X86_MAY_HAVE_SSE4_1) #include "x86/x86cpu.c" #include "celt/x86/pitch_sse.c" diff --git a/celt/tests/test_unit_mdct.c b/celt/tests/test_unit_mdct.c index 53258fe..5d9a8f5 100644 --- a/celt/tests/test_unit_mdct.c +++ b/celt/tests/test_unit_mdct.c @@ -49,16 +49,16 @@ #if defined(OPUS_ARM_MAY_HAVE_NEON_INTR) || defined(OPUS_ARM_ASM) #include "arm/armcpu.c&quot...
2015 Jan 20
6
[RFC PATCH v1 0/2] Encode optimize using libNE10
Hello opus-dev, I've been cooking up this patchset to integrate NE10 library into opus. Current patchset focuses on encode use case mainly effecting performance of clt_mdct_forward() and opus_fft() (for float only) Glad to report the following on Encode use case: (Measured on my Beaglebone Black Cortex-A8 board) - Performance improvement for encode use case ~= 12.34% (Based on time -p
2015 Feb 04
0
[RFC PATCH v2] armv7(float): Optimize encode usecase using NE10 library
.../celt/tests/test_unit_mathops.c @@ -49,6 +49,8 @@ #include "cwrs.c" #include "pitch.c" #include "celt_lpc.c" +#include "kiss_fft.c" +#include "mdct.c" #if defined(OPUS_X86_MAY_HAVE_SSE4_1) || defined(OPUS_X86_MAY_HAVE_SSE2) #include "x86/pitch_sse.c" @@ -60,6 +62,10 @@ || defined(OPUS_ARM_NEON_INTR)) #if defined(OPUS_ARM_NEON_INTR) #include "arm/celt_neon_intr.c" +#if defined(HAVE_ARM_NE10) +#include "arm/celt_ne10_fft.c" +#include "arm/celt_ne10_mdct.c" +#endif #endif #include "arm/arm_cel...
2015 Mar 03
0
[RFC PATCHv3] armv7(float): Optimize encode usecase using NE10 library
.../celt/tests/test_unit_mathops.c @@ -49,6 +49,8 @@ #include "cwrs.c" #include "pitch.c" #include "celt_lpc.c" +#include "kiss_fft.c" +#include "mdct.c" #if defined(OPUS_X86_MAY_HAVE_SSE4_1) || defined(OPUS_X86_MAY_HAVE_SSE2) #include "x86/pitch_sse.c" @@ -60,6 +62,10 @@ || defined(OPUS_ARM_NEON_INTR)) #if defined(OPUS_ARM_NEON_INTR) #include "arm/celt_neon_intr.c" +#if defined(HAVE_ARM_NE10) +#include "arm/celt_ne10_fft.c" +#include "arm/celt_ne10_mdct.c" +#endif #endif #include "arm/arm_cel...