Displaying 20 results from an estimated 5000 matches similar to: "[RFC PATCH v2] Encode optimize using libNe10"
2015 Feb 26
0
[RFC PATCH v2] Encode optimize using libNe10
On 25 February 2015 at 19:54, Timothy B. Terriberry <tterribe at xiph.org> wrote:
> Viswanath Puttagunta wrote:
>>
>> Can we please have review on RFCv2? We have quite a few optimizations
>> (Eg: ifft/mdct_backwards, fixed point fft/ifft mdct_forward/backward
>> etc) that are in my pipeline that depend on this patch series being
>> accepted.
>
>
>
2015 Feb 26
3
[RFC PATCH v2] Encode optimize using libNe10
Viswanath Puttagunta wrote:
> Can we please have review on RFCv2? We have quite a few optimizations
> (Eg: ifft/mdct_backwards, fixed point fft/ifft mdct_forward/backward
> etc) that are in my pipeline that depend on this patch series being
> accepted.
So, trying to make progress on this...
On an armv7l board running Ubuntu, you've broken the build with just
--enable-intrinsics
2015 Feb 16
0
[RFC PATCH v2] Encode optimize using libNe10
Hello opus-dev,
Can we please have review on RFCv2? We have quite a few optimizations
(Eg: ifft/mdct_backwards, fixed point fft/ifft mdct_forward/backward
etc) that are in my pipeline that depend on this patch series being
accepted.
Regards,
Vish
On 4 February 2015 at 12:21, Viswanath Puttagunta
<viswanath.puttagunta at linaro.org> wrote:
> Changes from RFC PATCH v1:
> - passing
2015 Feb 05
0
[RFC PATCH v2] Encode optimize using libNe10
Hi Viswanath,
Great to see it coming.
> Phil,
>
> As you mentioned earlier, could you please address all
> compile and linker errors/warnings coming out of Ne10 library?
[Phil Wang] OK, I will deliver it. But I will try to add -funsafe-math-optimisations flag to our build system first.
Also I will have a look into our build system on Linux. From your previous response, I guess there
2015 Mar 04
0
[RFC PATCHv3] Encode optimize using libNe10
Hi Timothy and Viswanath,
> FYI, I got Phil @ ARM to independently verify for any compile/link
> warning/errors and he said he did not find any... And since I haven't
> heard from you for a week, I went ahead and pushed RFCv3.
Yes, I do get it built without compile/link warning/errors.
To save some time, please turn off other modules in Ne10
Open $NE10_DIR/CMakeLists.txt and find
2015 Feb 27
0
[RFC PATCH v2] Encode optimize using libNe10
Hi Timothy,
> The instructions for building libNE10 in doc/BuildingNe10.txt are also
> incorrect. I don't know if -DGNULINUX_PLATFORM=ON does anything, but I
> needed to specify -DNE10_LINUX_TARGET_ARCH=armv7 to avoid cmake
> failing with a 'syntax error' at line 83.
>
>
> In short, this needs some work.
Yes, -DNE10_LINUX_TARGET_ARCH=armv7 is needed for native
2015 Jan 29
0
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Hi Timothy,
Appreciate the comprehensive code review.
The biggest issue I see is the peak stack usage.... rest looks like
fairly straight forward cleanup.
Is the peak stack usage a complete blocker in current form?
If it is indeed a blocker, would it be acceptable if we can reduce
additional buffer requirement from 2 buffers (current) to 1, possibly
by moving scaling inside
2015 Apr 02
0
Testing ARMv8 Ne10 and intrinsics branch
Hello Thomas,
I use the following configure command to link against Ne10
Eg:
configure --host=arm-linux-gnueabihf --enable-intrinsics
--with-NE10-libraries=${BUILD_NE10_LIB}
--with-NE10-includes=${BUILD_NE10_INC}"
So, in my normal testing, I explicitly specify where the NE10 header
files are installed and where the NE10 libraries are installed.
Looking back at configure.ac
2015 Jan 29
2
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Viswanath Puttagunta wrote:
> if OPUS_ARM_NEON_INTR
> CELT_ARM_NEON_INTR_OBJ = $(CELT_SOURCES_ARM_NEON_INTR:.c=.lo) \
> - %test_unit_rotation.o %test_unit_mathops.o
> -$(CELT_ARM_NEON_INTR_OBJ): CFLAGS += $(OPUS_ARM_NEON_INTR_CPPFLAGS)
> + $(CELT_SOURCES_ARM_NE10:.c=.lo) \
> + %test_unit_rotation.o %test_unit_mathops.o \
> +
2015 Mar 04
1
[RFC PATCH v1] Decode(float) optimize using libNe10
Hello All,
I extended the libNE10 optimizations for float towards
mdct_backwards/opus_ifft.
I am able to get about 14.26% improvement for Decode use
case now on my Beaglebone Black. Please see [1] for measurements.
Questions
1. Since this patch needs to go in after Encode [2] patch)
should I submit this as patch series?
2. Since Jonathan Lennox posted intrinsics cleanup [3]
patch, should
2015 Jan 20
0
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics
2015 Feb 04
4
[RFC PATCH v2] Encode optimize using libNe10
Changes from RFC PATCH v1:
- passing arch parameter explicitly
- reduced stack usage by ~3.5K by using scaled NE10 fft version
- moved all optimization array functions to arm_celt_map.c
- Other cleanups pointed out by Timothy
Phil,
As you mentioned earlier, could you please address all
compile and linker errors/warnings coming out of Ne10 library?
You can find my working Ne10 repo at [1]
You
2015 Apr 02
2
Testing ARMv8 Ne10 and intrinsics branch
Using GCC 4.9.2, decoding the opus test vector set 10 times. All tests
pass. I will do longer tests later with a larger test set, but it looks
good so far.
I am having a lot of trouble with the Ne10 detection. Using the
precompiled Ne10 binaries at
http://people.linaro.org/~viswanath.puttagunta/opus/NE10_root/
Both NE10 and Ne10 capitalizations seem to be in use. Also, where should
the NE10
2015 May 08
0
[[RFC PATCH v2]: Ne10 fft fixed and previous 1/8] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics
2015 Feb 04
0
[RFC PATCH v2] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics
2015 Mar 03
0
[RFC PATCHv3] armv7(float): Optimize encode usecase using NE10 library
Optimize opus encode (float only) usecase using ARM NE10
library. Mainly effects opus_fft and ctl_mdct_forward
and related functions.
This optimization can be used for ARM CPUs that have NEON
VFP unit. This patch only enables optimizations for ARMv7.
Official ARM NE10 library page available at
http://projectne10.github.io/Ne10/
To enable this optimization, use
--enable-intrinsics
2014 Dec 11
2
[ARM][FFT][NEON] Integrate Ne10 into Opus?
Hi everyone,
I am working on Ne10 project. Ne10 provides NEON optimized FFT routines that
are much faster (compared to those without NEON), on most ARMv7-A and all
ARMv8-A devices. How about integrate it into Opus?
I am not familiar with configure script, but I find "Optinal Packages" in
it. If we provides --with-ne10-fft option, the one extra thing that users
need to do is to
2015 Mar 03
2
[RFC PATCHv3] Encode optimize using libNe10
Changes from RFC PATCH v2
- fixed compile issue when just compiling for --enable-intrinsics
for ARMv7 without NE10
- Notes for NE10:
- All compile/link warnings are now in upstream NE10
- Only patch pending upstream in NE10 is the one that
needs to add -funsafe-math-optimizations for ARMv7
targets.
- Phil Wang @ ARM is working on getting this fixed.
- Note that even without
2015 Jan 30
1
[RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Viswanath Puttagunta wrote:
> Is the peak stack usage a complete blocker in current form?
Since this only affects people who enable NE10, I don't think this is a
blocker.
2014 Dec 10
0
[ARM][FFT][NEON] Integrate Ne10 into Opus?
Hi everyone,
I am working on Ne10 project. Ne10 provides NEON optimized FFT routines that
are much faster (compared to those without NEON), on most ARMv7-A and all
ARMv8-A devices. How about integrate it into Opus?
I am not familiar with configure script, but I find "Optinal Packages" in
it. If we provides --with-ne10-fft option, the one extra thing that users
need to do is to