search for: silk_warped_autocorrelation_fix_c

Displaying 15 results from an estimated 15 matches for "silk_warped_autocorrelation_fix_c".

2017 Jun 02
0
[PATCH] Don't use MAY_HAVE_NEON in arm_silk_map.c.
.../* Media */ - MAY_HAVE_NEON(silk_NSQ_del_dec), /* Neon */ + silk_NSQ_del_dec_neon, /* Neon */ }; /*There is no table for silk_noise_shape_quantizer_short_prediction because the @@ -115,7 +115,7 @@ void (*const SILK_WARPED_AUTOCORRELATION_FIX_IMPL[OPUS_ARCHMASK + 1])( silk_warped_autocorrelation_FIX_c, /* ARMv4 */ silk_warped_autocorrelation_FIX_c, /* EDSP */ silk_warped_autocorrelation_FIX_c, /* Media */ - MAY_HAVE_NEON(silk_warped_autocorrelation_FIX), /* Neon */ + silk_warped_autocorrelation_FIX_neon, /* Neon */ }; # endif...
2017 Jun 02
2
Opus floating-point NEON jump table question
Thank Jonathan! I'll fix the MAY_HAVE_NEON() in silk/arm/arm_silk_map.c Linfeng On Thu, Jun 1, 2017 at 3:34 PM, Jonathan Lennox <jonathan at vidyo.com> wrote: > Semantically, OPUS_ARM_MAY_HAVE_NEON is supposed to mean the compiler > supports, and the CPU may support, Neon assembly code, which isn’t > necessarily the same thing as the compiler supporting Neon intrinsics. >
2017 Feb 06
2
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
Hi Jean-Marc, Thanks a lot for reviewing this huge assembly function! silk_warped_autocorrelation_FIX_c()'s kernel part is for( n = 0; n < length; n++ ) { tmp1_QS = silk_LSHIFT32( (opus_int32)input[ n ], QS ); /* Loop over allpass sections */ for( i = 0; i < order; i++ ) { /* Output of allpass section */ tmp2_QS = silk_SMLAWB( state_QS[ i...
2016 Jul 01
1
silk_warped_autocorrelation_FIX() NEON optimization
Hi all, I'm sending patch "Optimize silk_warped_autocorrelation_FIX() for ARM NEON" in an separate email. It is based on Tim’s aarch64v8 branch https://git.xiph.org/?p=users/tterribe/opus.git;a=shortlog;h=refs/heads/aarch64v8 Thanks for your comments. Linfeng
2017 Jan 31
6
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
Hi, Attached is a patch with arm neon optimizations for silk_warped_autocorrelation_FIX(). Please review. Thanks, Felicia -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xiph.org/pipermail/opus/attachments/20170131/9a912bb4/attachment-0001.html> -------------- next part -------------- A non-text attachment was scrubbed... Name:
2017 Feb 07
2
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...That's why the prolog and epilog cannot be saved to the best of my > > knowledge. > > > > The assembly size of silk_warped_autocorrelation_FIX_neon() is about > > 2,744 bytes. Compared with the C code size (about 452 bytes), it's 2.3 > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the second > > place CPU heavy function in fixed-point, and our testing shows up to 7% > > CPU run time saving of the total encoder with this optimization (at > > Complexity 8), maybe we can take the I-cache burden even if finally we > > still cannot remove the big chun...
2017 Feb 07
3
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...o the best of my > > > knowledge. > > > > > > The assembly size of silk_warped_autocorrelation_FIX_neon() is > about > > > 2,744 bytes. Compared with the C code size (about 452 bytes), it's > 2.3 > > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the > > second > > > place CPU heavy function in fixed-point, and our testing shows up > > to 7% > > > CPU run time saving of the total encoder with this optimization (at > > > Complexity 8), maybe we can take the I-cache burden even i...
2017 Feb 06
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...ill be wrong. > > That's why the prolog and epilog cannot be saved to the best of my > knowledge. > > The assembly size of silk_warped_autocorrelation_FIX_neon() is about > 2,744 bytes. Compared with the C code size (about 452 bytes), it's 2.3 > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the second > place CPU heavy function in fixed-point, and our testing shows up to 7% > CPU run time saving of the total encoder with this optimization (at > Complexity 8), maybe we can take the I-cache burden even if finally we > still cannot remove the big chunk of prolog and epil...
2017 Apr 05
2
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...gt;>> > > >>> > > The assembly size of silk_warped_autocorrelation_FIX_neon() is >>> about >>> > > 2,744 bytes. Compared with the C code size (about 452 bytes), >>> it's 2.3 >>> > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is >>> the >>> > second >>> > > place CPU heavy function in fixed-point, and our testing shows up >>> > to 7% >>> > > CPU run time saving of the total encoder with this optimization >>> (at >>> >...
2017 Feb 07
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...prolog and epilog cannot be saved to the best of my > > knowledge. > > > > The assembly size of silk_warped_autocorrelation_FIX_neon() is about > > 2,744 bytes. Compared with the C code size (about 452 bytes), it's 2.3 > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the > second > > place CPU heavy function in fixed-point, and our testing shows up > to 7% > > CPU run time saving of the total encoder with this optimization (at > > Complexity 8), maybe we can take the I-cache burden even if finally we > &gt...
2017 Apr 03
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...t; > knowledge. >> > > >> > > The assembly size of silk_warped_autocorrelation_FIX_neon() is >> about >> > > 2,744 bytes. Compared with the C code size (about 452 bytes), >> it's 2.3 >> > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the >> > second >> > > place CPU heavy function in fixed-point, and our testing shows up >> > to 7% >> > > CPU run time saving of the total encoder with this optimization >> (at >> > > Complexity 8), maybe we can t...
2016 Jul 14
6
Several patches of ARM NEON optimization
I rebased my previous 3 patches to the current master with minor changes. Patches 1 to 3 replace all my previous submitted patches. Patches 4 and 5 are new. Thanks, Linfeng Zhang
2017 Apr 05
4
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...ssembly size of > > silk_warped_autocorrelation_FIX_neon() is about > > > > 2,744 bytes. Compared with the C code size (about > > 452 bytes), it's 2.3 > > > > KB larger. Considering > > silk_warped_autocorrelation_FIX_c() is the > > > second > > > > place CPU heavy function in fixed-point, and our > > testing shows up > > > to 7% > > > > CPU run time saving of the total encoder with this >...
2017 Apr 05
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...> > The assembly size of > silk_warped_autocorrelation_FIX_neon() is about > > > 2,744 bytes. Compared with the C code size (about > 452 bytes), it's 2.3 > > > KB larger. Considering > silk_warped_autocorrelation_FIX_c() is the > > second > > > place CPU heavy function in fixed-point, and our > testing shows up > > to 7% > > > CPU run time saving of the total encoder with this > optimization...
2017 Apr 06
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...> silk_warped_autocorrelation_FIX_neon() is about > > > > 2,744 bytes. Compared with the C code size (about > > 452 bytes), it's 2.3 > > > > KB larger. Considering > > silk_warped_autocorrelation_FIX_c() is the > > > second > > > > place CPU heavy function in fixed-point, and our > > testing shows up > > > to 7% > > > > CPU run time saving of the total en...