Displaying 15 results from an estimated 15 matches for "silk_warped_autocorrelation_fix_c".
Did you mean:
silk_warped_autocorrelation_fix
2017 Jun 02
0
[PATCH] Don't use MAY_HAVE_NEON in arm_silk_map.c.
.../* Media */
- MAY_HAVE_NEON(silk_NSQ_del_dec), /* Neon */
+ silk_NSQ_del_dec_neon, /* Neon */
};
/*There is no table for silk_noise_shape_quantizer_short_prediction because the
@@ -115,7 +115,7 @@ void (*const SILK_WARPED_AUTOCORRELATION_FIX_IMPL[OPUS_ARCHMASK + 1])(
silk_warped_autocorrelation_FIX_c, /* ARMv4 */
silk_warped_autocorrelation_FIX_c, /* EDSP */
silk_warped_autocorrelation_FIX_c, /* Media */
- MAY_HAVE_NEON(silk_warped_autocorrelation_FIX), /* Neon */
+ silk_warped_autocorrelation_FIX_neon, /* Neon */
};
# endif...
2017 Jun 02
2
Opus floating-point NEON jump table question
Thank Jonathan!
I'll fix the MAY_HAVE_NEON() in silk/arm/arm_silk_map.c
Linfeng
On Thu, Jun 1, 2017 at 3:34 PM, Jonathan Lennox <jonathan at vidyo.com> wrote:
> Semantically, OPUS_ARM_MAY_HAVE_NEON is supposed to mean the compiler
> supports, and the CPU may support, Neon assembly code, which isn’t
> necessarily the same thing as the compiler supporting Neon intrinsics.
>
2017 Feb 06
2
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
Hi Jean-Marc,
Thanks a lot for reviewing this huge assembly function!
silk_warped_autocorrelation_FIX_c()'s kernel part is
for( n = 0; n < length; n++ ) {
tmp1_QS = silk_LSHIFT32( (opus_int32)input[ n ], QS );
/* Loop over allpass sections */
for( i = 0; i < order; i++ ) {
/* Output of allpass section */
tmp2_QS = silk_SMLAWB( state_QS[ i...
2016 Jul 01
1
silk_warped_autocorrelation_FIX() NEON optimization
Hi all,
I'm sending patch "Optimize silk_warped_autocorrelation_FIX() for ARM NEON" in an separate email.
It is based on Tim’s aarch64v8 branch https://git.xiph.org/?p=users/tterribe/opus.git;a=shortlog;h=refs/heads/aarch64v8
Thanks for your comments.
Linfeng
2017 Jan 31
6
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
Hi,
Attached is a patch with arm neon optimizations for
silk_warped_autocorrelation_FIX(). Please review.
Thanks,
Felicia
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xiph.org/pipermail/opus/attachments/20170131/9a912bb4/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name:
2017 Feb 07
2
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...That's why the prolog and epilog cannot be saved to the best of my
> > knowledge.
> >
> > The assembly size of silk_warped_autocorrelation_FIX_neon() is about
> > 2,744 bytes. Compared with the C code size (about 452 bytes), it's 2.3
> > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the second
> > place CPU heavy function in fixed-point, and our testing shows up to 7%
> > CPU run time saving of the total encoder with this optimization (at
> > Complexity 8), maybe we can take the I-cache burden even if finally we
> > still cannot remove the big chun...
2017 Feb 07
3
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...o the best of my
> > > knowledge.
> > >
> > > The assembly size of silk_warped_autocorrelation_FIX_neon() is
> about
> > > 2,744 bytes. Compared with the C code size (about 452 bytes), it's
> 2.3
> > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the
> > second
> > > place CPU heavy function in fixed-point, and our testing shows up
> > to 7%
> > > CPU run time saving of the total encoder with this optimization (at
> > > Complexity 8), maybe we can take the I-cache burden even i...
2017 Feb 06
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...ill be wrong.
>
> That's why the prolog and epilog cannot be saved to the best of my
> knowledge.
>
> The assembly size of silk_warped_autocorrelation_FIX_neon() is about
> 2,744 bytes. Compared with the C code size (about 452 bytes), it's 2.3
> KB larger. Considering silk_warped_autocorrelation_FIX_c() is the second
> place CPU heavy function in fixed-point, and our testing shows up to 7%
> CPU run time saving of the total encoder with this optimization (at
> Complexity 8), maybe we can take the I-cache burden even if finally we
> still cannot remove the big chunk of prolog and epil...
2017 Apr 05
2
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...gt;>> > >
>>> > > The assembly size of silk_warped_autocorrelation_FIX_neon() is
>>> about
>>> > > 2,744 bytes. Compared with the C code size (about 452 bytes),
>>> it's 2.3
>>> > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is
>>> the
>>> > second
>>> > > place CPU heavy function in fixed-point, and our testing shows up
>>> > to 7%
>>> > > CPU run time saving of the total encoder with this optimization
>>> (at
>>> >...
2017 Feb 07
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...prolog and epilog cannot be saved to the best of my
> > knowledge.
> >
> > The assembly size of silk_warped_autocorrelation_FIX_neon() is about
> > 2,744 bytes. Compared with the C code size (about 452 bytes), it's 2.3
> > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the
> second
> > place CPU heavy function in fixed-point, and our testing shows up
> to 7%
> > CPU run time saving of the total encoder with this optimization (at
> > Complexity 8), maybe we can take the I-cache burden even if finally we
> >...
2017 Apr 03
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...t; > knowledge.
>> > >
>> > > The assembly size of silk_warped_autocorrelation_FIX_neon() is
>> about
>> > > 2,744 bytes. Compared with the C code size (about 452 bytes),
>> it's 2.3
>> > > KB larger. Considering silk_warped_autocorrelation_FIX_c() is the
>> > second
>> > > place CPU heavy function in fixed-point, and our testing shows up
>> > to 7%
>> > > CPU run time saving of the total encoder with this optimization
>> (at
>> > > Complexity 8), maybe we can t...
2016 Jul 14
6
Several patches of ARM NEON optimization
I rebased my previous 3 patches to the current master with minor changes.
Patches 1 to 3 replace all my previous submitted patches.
Patches 4 and 5 are new.
Thanks,
Linfeng Zhang
2017 Apr 05
4
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...ssembly size of
> > silk_warped_autocorrelation_FIX_neon() is about
> > > > 2,744 bytes. Compared with the C code size (about
> > 452 bytes), it's 2.3
> > > > KB larger. Considering
> > silk_warped_autocorrelation_FIX_c() is the
> > > second
> > > > place CPU heavy function in fixed-point, and our
> > testing shows up
> > > to 7%
> > > > CPU run time saving of the total encoder with this
>...
2017 Apr 05
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...> > The assembly size of
> silk_warped_autocorrelation_FIX_neon() is about
> > > 2,744 bytes. Compared with the C code size (about
> 452 bytes), it's 2.3
> > > KB larger. Considering
> silk_warped_autocorrelation_FIX_c() is the
> > second
> > > place CPU heavy function in fixed-point, and our
> testing shows up
> > to 7%
> > > CPU run time saving of the total encoder with this
> optimization...
2017 Apr 06
0
[PATCH] Optimize silk_warped_autocorrelation_FIX() for ARM NEON
...> silk_warped_autocorrelation_FIX_neon() is about
> > > > 2,744 bytes. Compared with the C code size (about
> > 452 bytes), it's 2.3
> > > > KB larger. Considering
> > silk_warped_autocorrelation_FIX_c() is the
> > > second
> > > > place CPU heavy function in fixed-point, and our
> > testing shows up
> > > to 7%
> > > > CPU run time saving of the total en...