Displaying 20 results from an estimated 200 matches similar to: "[RFC PATCHv3] Encode optimize using libNe10"
2015 Feb 26
0
[RFC PATCH v2] Encode optimize using libNe10
On 25 February 2015 at 19:54, Timothy B. Terriberry <tterribe at xiph.org> wrote:
> Viswanath Puttagunta wrote:
>>
>> Can we please have review on RFCv2? We have quite a few optimizations
>> (Eg: ifft/mdct_backwards, fixed point fft/ifft mdct_forward/backward
>> etc) that are in my pipeline that depend on this patch series being
>> accepted.
>
>
>
2015 Mar 06
0
[RFC PATCH v2] Encode optimize using libNe10
Hi Timothy,
Do you prefer configuration you mentioned before:
$ configure --host=arm-linux-gnueabihf --enable-intrinsics \
--with-NE10-libraries=<path>/armv7_root/binary/usr/local/lib \
--with-NE10-includes=<path>//armv7_root/binary/usr/local/include/Ne10
Instead of
$ configure --enable-intrinsics \
--with-NE10-libraries=$HOME/src/xiph/git/arm/Ne10/build/modules \
2015 Mar 03
2
[RFC PATCHv3] Encode optimize using libNe10
Changes from RFC PATCH v2
- fixed compile issue when just compiling for --enable-intrinsics
for ARMv7 without NE10
- Notes for NE10:
- All compile/link warnings are now in upstream NE10
- Only patch pending upstream in NE10 is the one that
needs to add -funsafe-math-optimizations for ARMv7
targets.
- Phil Wang @ ARM is working on getting this fixed.
- Note that even without
2015 Feb 16
0
[RFC PATCH v2] Encode optimize using libNe10
Hello opus-dev,
Can we please have review on RFCv2? We have quite a few optimizations
(Eg: ifft/mdct_backwards, fixed point fft/ifft mdct_forward/backward
etc) that are in my pipeline that depend on this patch series being
accepted.
Regards,
Vish
On 4 February 2015 at 12:21, Viswanath Puttagunta
<viswanath.puttagunta at linaro.org> wrote:
> Changes from RFC PATCH v1:
> - passing
2015 Feb 05
0
[RFC PATCH v2] Encode optimize using libNe10
Hi Viswanath,
Great to see it coming.
> Phil,
>
> As you mentioned earlier, could you please address all
> compile and linker errors/warnings coming out of Ne10 library?
[Phil Wang] OK, I will deliver it. But I will try to add -funsafe-math-optimisations flag to our build system first.
Also I will have a look into our build system on Linux. From your previous response, I guess there
2015 Feb 26
3
[RFC PATCH v2] Encode optimize using libNe10
Viswanath Puttagunta wrote:
> Can we please have review on RFCv2? We have quite a few optimizations
> (Eg: ifft/mdct_backwards, fixed point fft/ifft mdct_forward/backward
> etc) that are in my pipeline that depend on this patch series being
> accepted.
So, trying to make progress on this...
On an armv7l board running Ubuntu, you've broken the build with just
--enable-intrinsics
2015 Feb 27
0
[RFC PATCH v2] Encode optimize using libNe10
Hi Timothy,
> The instructions for building libNE10 in doc/BuildingNe10.txt are also
> incorrect. I don't know if -DGNULINUX_PLATFORM=ON does anything, but I
> needed to specify -DNE10_LINUX_TARGET_ARCH=armv7 to avoid cmake
> failing with a 'syntax error' at line 83.
>
>
> In short, this needs some work.
Yes, -DNE10_LINUX_TARGET_ARCH=armv7 is needed for native
2015 Mar 04
1
[RFC PATCH v1] Decode(float) optimize using libNe10
Hello All,
I extended the libNE10 optimizations for float towards
mdct_backwards/opus_ifft.
I am able to get about 14.26% improvement for Decode use
case now on my Beaglebone Black. Please see [1] for measurements.
Questions
1. Since this patch needs to go in after Encode [2] patch)
should I submit this as patch series?
2. Since Jonathan Lennox posted intrinsics cleanup [3]
patch, should
2015 Mar 03
1
[RFC PATCH v4] Enable optimize using libNe10
Changes from RFC PATCH v3
- Just rebased on tip
- For all else, please see notes from RFC PATCH v3 at
http://lists.xiph.org/pipermail/opus/2015-March/002902.html
- latest wip opus tree/branch
https://git.linaro.org/people/viswanath.puttagunta/Ne10.git
branch: rfcv4_final_fft_ne10
Viswanath Puttagunta (1):
armv7(float): Optimize encode usecase using NE10 library
Makefile.am
2015 Feb 04
4
[RFC PATCH v2] Encode optimize using libNe10
Changes from RFC PATCH v1:
- passing arch parameter explicitly
- reduced stack usage by ~3.5K by using scaled NE10 fft version
- moved all optimization array functions to arm_celt_map.c
- Other cleanups pointed out by Timothy
Phil,
As you mentioned earlier, could you please address all
compile and linker errors/warnings coming out of Ne10 library?
You can find my working Ne10 repo at [1]
You
2015 Jan 20
6
[RFC PATCH v1 0/2] Encode optimize using libNE10
Hello opus-dev,
I've been cooking up this patchset to integrate NE10 library into opus.
Current patchset focuses on encode use case mainly effecting performance of
clt_mdct_forward() and opus_fft() (for float only)
Glad to report the following on Encode use case:
(Measured on my Beaglebone Black Cortex-A8 board)
- Performance improvement for encode use case ~= 12.34% (Based on time -p
2014 Dec 11
2
[ARM][FFT][NEON] Integrate Ne10 into Opus?
Hi everyone,
I am working on Ne10 project. Ne10 provides NEON optimized FFT routines that
are much faster (compared to those without NEON), on most ARMv7-A and all
ARMv8-A devices. How about integrate it into Opus?
I am not familiar with configure script, but I find "Optinal Packages" in
it. If we provides --with-ne10-fft option, the one extra thing that users
need to do is to
2015 Apr 02
2
Testing ARMv8 Ne10 and intrinsics branch
Using GCC 4.9.2, decoding the opus test vector set 10 times. All tests
pass. I will do longer tests later with a larger test set, but it looks
good so far.
I am having a lot of trouble with the Ne10 detection. Using the
precompiled Ne10 binaries at
http://people.linaro.org/~viswanath.puttagunta/opus/NE10_root/
Both NE10 and Ne10 capitalizations seem to be in use. Also, where should
the NE10
2015 Apr 02
0
Testing ARMv8 Ne10 and intrinsics branch
Hello Thomas,
I use the following configure command to link against Ne10
Eg:
configure --host=arm-linux-gnueabihf --enable-intrinsics
--with-NE10-libraries=${BUILD_NE10_LIB}
--with-NE10-includes=${BUILD_NE10_INC}"
So, in my normal testing, I explicitly specify where the NE10 header
files are installed and where the NE10 libraries are installed.
Looking back at configure.ac
2015 Jan 29
0
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Hi Timothy,
Appreciate the comprehensive code review.
The biggest issue I see is the peak stack usage.... rest looks like
fairly straight forward cleanup.
Is the peak stack usage a complete blocker in current form?
If it is indeed a blocker, would it be acceptable if we can reduce
additional buffer requirement from 2 buffers (current) to 1, possibly
by moving scaling inside
2015 Jan 29
2
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Viswanath Puttagunta wrote:
> if OPUS_ARM_NEON_INTR
> CELT_ARM_NEON_INTR_OBJ = $(CELT_SOURCES_ARM_NEON_INTR:.c=.lo) \
> - %test_unit_rotation.o %test_unit_mathops.o
> -$(CELT_ARM_NEON_INTR_OBJ): CFLAGS += $(OPUS_ARM_NEON_INTR_CPPFLAGS)
> + $(CELT_SOURCES_ARM_NE10:.c=.lo) \
> + %test_unit_rotation.o %test_unit_mathops.o \
> +
2014 Dec 10
0
[ARM][FFT][NEON] Integrate Ne10 into Opus?
Hi everyone,
I am working on Ne10 project. Ne10 provides NEON optimized FFT routines that
are much faster (compared to those without NEON), on most ARMv7-A and all
ARMv8-A devices. How about integrate it into Opus?
I am not familiar with configure script, but I find "Optinal Packages" in
it. If we provides --with-ne10-fft option, the one extra thing that users
need to do is to
2015 Jan 20
0
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics
2015 Feb 04
0
[RFC PATCH v2] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics
2015 Mar 03
0
[RFC PATCHv3] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics