Displaying 7 results from an estimated 7 matches for "elt_inner_prod".
Did you mean:
celt_inner_prod
2017 Jun 06
3
celt_inner_prod() and dual_inner_prod() NEON intrinsics
...linfengz at google.com
> <mailto:linfengz at google.com>> wrote:
>
> Hi Jean-Marc,
>
> I attached the new version in inner_prod_5patches_v2.zip which
> synced to the current master.
>
> For fixed-point ARM, only
> 0003-Optimize-fixed-point-celt_inner_prod-and-dual_inner_.patch
> changes the performance.
> For floating-point ARM, only
> 0004-Optimize-floating-point-celt_inner_prod-and-dual_inn.pa
> <http://elt_inner_prod-and-dual_inn.pa>tch changes the performance.
> Patch 1 and 2 are code clean-up and can onl...
2017 Jun 06
4
Antw: Re: celt_inner_prod() and dual_inner_prod() NEON intrinsics
...wrote:
>> >
>> > Hi Jean-Marc,
>> >
>> > I attached the new version in inner_prod_5patches_v2.zip which
>> > synced to the current master.
>> >
>> > For fixed-point ARM, only
>> > 0003-Optimize-fixed-point-celt_inner_prod-and-dual_inner_.patch
>> > changes the performance.
>> > For floating-point ARM, only
>> > 0004-Optimize-floating-point-celt_inner_prod-and-dual_inn.pa
>> > <http://elt_inner_prod-and-dual_inn.pa>tch changes the performance.
>> >...
2017 Jun 05
4
celt_inner_prod() and dual_inner_prod() NEON intrinsics
Hi Jean-Marc,
I attached the new version in inner_prod_5patches_v2.zip which synced to
the current master.
For fixed-point ARM, only 0003-Optimize-fixed-point-celt
_inner_prod-and-dual_inner_.patch changes the performance.
For floating-point ARM, only 0004-Optimize-floating-point-c
elt_inner_prod-and-dual_inn.patch changes the performance.
Patch 1 and 2 are code clean-up and can only affect x86 performance.
Patch 5 has neglectable effect on floating-point ARM performance.
Thanks,
Linfeng
On Fri, Jun 2, 2017 at 11:26 AM, Jean-Marc Valin <jmvalin at jmvalin.ca> wrote:
> Hi Linfeng...
2017 Jun 06
0
celt_inner_prod() and dual_inner_prod() NEON intrinsics
...o:linfengz at google.com>> wrote:
> >
> > Hi Jean-Marc,
> >
> > I attached the new version in inner_prod_5patches_v2.zip which
> > synced to the current master.
> >
> > For fixed-point ARM, only
> > 0003-Optimize-fixed-point-celt_inner_prod-and-dual_inner_.patch
> > changes the performance.
> > For floating-point ARM, only
> > 0004-Optimize-floating-point-celt_inner_prod-and-dual_inn.pa
> > <http://elt_inner_prod-and-dual_inn.pa>tch changes the performance.
> > Patch 1 and 2 are...
2017 Jun 06
0
celt_inner_prod() and dual_inner_prod() NEON intrinsics
Thank Ulrich!
Yes, using
celt_assert(1.0 + celt_inner_prod_neon_float_c_simulation(x, y, N)
== 1.0 + xy);
celt_assert(1.0 + xy1_c == 1.0 + *xy1);
celt_assert(1.0 + xy2_c == 1.0 + *xy2);
can avoid the useage of VERY_SMALL.
Hi Jean-Marc,
I added
{
const opus_val32 xy_c = celt_inner_prod_neon_float_c_simulation(x,
y, N);...
2017 Jun 05
0
celt_inner_prod() and dual_inner_prod() NEON intrinsics
...I attached the new version in inner_prod_5patches_v2.zip which synced to
> the current master.
>
> For fixed-point ARM, only 0003-Optimize-fixed-point-celt
> _inner_prod-and-dual_inner_.patch changes the performance.
> For floating-point ARM, only 0004-Optimize-floating-point-c
> elt_inner_prod-and-dual_inn.patch changes the performance.
> Patch 1 and 2 are code clean-up and can only affect x86 performance.
> Patch 5 has neglectable effect on floating-point ARM performance.
>
> Thanks,
> Linfeng
>
> On Fri, Jun 2, 2017 at 11:26 AM, Jean-Marc Valin <jmvalin at jmval...
2017 Jun 01
4
celt_inner_prod() and dual_inner_prod() NEON intrinsics
Hi,
Attached are 5 patches related to celt_inner_prod() and dual_inner_prod()
NEON intrinsics optimization.
In 0004-Optimize-floating-point-celt_inner_prod-and-dual_inn.patch, the
optimization changed the order of floating-point inner products, which will
change the results. I created celt_inner_prod_neon_float_c_simulation()
and dual_inner_prod_neon...