search for: _neon

Displaying 9 results from an estimated 9 matches for "_neon".

Did you mean: neon
2018 May 24
2
NEON detection under iOs
Hi, Thank you for a great work. Opus codec works great under many platforms. I have found a small performance issue under iOS platform. If the macro OPUS_HAVE_RTCD is not set, then encoder doesn't use some _neon functions at low bitrates (up to 64k). If the macro is set, then the compiler hits the error at opus/celt/arm/armcpu.c:153 (a function for detection missed). Being compared to Android version performance degradation is ~30%. Detection NEON under iOS is a bit tricky, because there is no API for it....
2018 Aug 31
1
NEON detection under iOs
...ll optimized functions are called directly under iOS. Regards, Victor -----Original Message----- From: Jonathan Lennox Sent: Thursday, August 30, 2018 11:59 AM To: Victor Cherepanov Cc: opus at xiph.org Subject: Re: [opus] NEON detection under iOs On iOS, configure should define OPUS_ARM_PRESUME_NEON and OPUS_ARM_PRESUME_NEON_INTR if you’re building for armv7, and OPUS_ARM_PRESUME_NEON_INTR for arm64 (since every Apple armv7 CPU has supported Neon, and of course arm64 supports it though the assembler syntax is different). This should be slightly faster than using RTCD, since the Neon funct...
2017 Jun 01
2
Opus floating-point NEON jump table question
Thank Jean-Mark and Jonathan! I tested current OPUS encoder in floating-point with Complexity 8. Hacking using the attached patch (which will generate "#define OPUS_ARM_MAY_HAVE_NEON 1" in config.h) will speed up about 14.7% on my Chromebook. Probably it's because many NEON intrinsics optimizations can benefit both fixed-point and floating-point encoder. So if it's safe enough to enable MAY_HAVE_NEON in floating-point by default, it could speed up floating-point N...
2018 Aug 30
0
NEON detection under iOs
On iOS, configure should define OPUS_ARM_PRESUME_NEON and OPUS_ARM_PRESUME_NEON_INTR if you’re building for armv7, and OPUS_ARM_PRESUME_NEON_INTR for arm64 (since every Apple armv7 CPU has supported Neon, and of course arm64 supports it though the assembler syntax is different). This should be slightly faster than using RTCD, since the Neon functions...
2014 Nov 14
0
[RFC PATCH v1] arm: kf_bfly4: Introduce ARM neon intrinsics
...m/projectNe10/Ne10 Regards, Vish On 9 November 2014 15:34, Viswanath Puttagunta <viswanath.puttagunta at linaro.org> wrote: > > Hello, > > This patch introduces ARM NEON Intrinsics to optimize > kf_bfly4 routine in celt part of libopus. > > Using NEON optimized kf_bfly4(_neon) routine helped improve > performance of opus_fft_impl function by about 21.4%. The > end use case was decoding a music opus ogg file. The end > use case saw performance improvement of about 4.47%. > > This patch has 2 components > i. Actual neon code to improve kf_bfly4 > ii....
2017 Jun 01
0
Opus floating-point NEON jump table question
Semantically, OPUS_ARM_MAY_HAVE_NEON is supposed to mean the compiler supports, and the CPU may support, Neon assembly code, which isn’t necessarily the same thing as the compiler supporting Neon intrinsics. (The Visual Studio ARM compiler, for instance, supports intrinsics but not assembly.) So I don’t think this patch is the right...
2017 Jun 02
2
Opus floating-point NEON jump table question
Thank Jonathan! I'll fix the MAY_HAVE_NEON() in silk/arm/arm_silk_map.c Linfeng On Thu, Jun 1, 2017 at 3:34 PM, Jonathan Lennox <jonathan at vidyo.com> wrote: > Semantically, OPUS_ARM_MAY_HAVE_NEON is supposed to mean the compiler > supports, and the CPU may support, Neon assembly code, which isn’t > necessarily the same t...
2014 Nov 09
3
[RFC PATCH v1] arm: kf_bfly4: Introduce ARM neon intrinsics
Hello, This patch introduces ARM NEON Intrinsics to optimize kf_bfly4 routine in celt part of libopus. Using NEON optimized kf_bfly4(_neon) routine helped improve performance of opus_fft_impl function by about 21.4%. The end use case was decoding a music opus ogg file. The end use case saw performance improvement of about 4.47%. This patch has 2 components i. Actual neon code to improve kf_bfly4 ii. Infrastructure to include neon int...
2015 Dec 23
6
[AArch64 neon intrinsics v4 0/5] Rework Neon intrinsic code for Aarch64 patchset
...Neon intrinsics for Silk noise shape feedback loop. Add Neon fixed-point implementation of xcorr_kernel. Makefile.am | 5 +- celt/arm/arm_celt_map.c | 17 ++++++ celt/arm/armcpu.c | 35 +++++++---- celt/arm/armcpu.h | 6 ++ celt/arm/celt_neon_intr.c | 61 ++++++++++++++++++- celt/arm/pitch_arm.h | 31 +++++++++- silk/NSQ.c | 57 ++++++----------- silk/NSQ.h | 97 +++++++++++++++++++++++++++++ silk/NSQ_del_dec.c | 40 +++++------- silk/arm/NSQ_neon.c | 135...