Displaying 5 results from an estimated 5 matches for "vec_st".
Did you mean:
vec_set
2005 Jan 29
4
A couple of points about flac 1.1.1 on ppc/linux/altivec
On Thu, 27 Jan 2005, John Steele Scott wrote:
> That looks fine to me as well. However, the best solution is something which
> Luca suggested a few months ago, which is to use the functions defined in
> altivec.h. These are C functions which map directly to Altivec machine
> instructions. I am willing to help out, but I don't find the current lpc_asm.s
> very easy to follow, and
2004 Aug 06
0
[PATCH] Make SSE Run Time option.
...= vec_ld( 15, b );
a0 = vec_perm( a0, a1, vec_lvsl( 0, a ) );
b0 = vec_perm( b0, b1, vec_lvsl( 0, b ) );
a0 = vec_madd( a0, b0, (vector float) vec_splat_u32(0) ) ;
a0 = vec_add( a0, vec_sld( a0, a0, 8 ) );
a0 = vec_add( a0, vec_sld( a0, a0, 4 ) );
vec_ste( a0, 0, &sum );
return sum;
Please note that dot products of simple vector floats are usually faster
in the scalar units. The add across and transfer to scalar is just too
expensive. Its generally only worthwhile if the data starts and ends in
the vector units, and it is inlined so th...
2004 Aug 06
6
[PATCH] Make SSE Run Time option.
...vec_ld(0, b);
b += 4;
vec_result = vec_madd(vec_a, vec_b, vec_result);
}
}
vec_result = vec_add(vec_result, vec_sld(vec_result, vec_result, 8));
vec_result = vec_add(vec_result, vec_sld(vec_result, vec_result, 4));
vec_ste(vec_result, 0, &sum);
return sum;
<p>--- >8 ----
List archives: http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'speex-dev-request@xiph.org'
containing only the word 'unsubscribe'...
2005 Dec 02
0
run time assembler patch for altivec, sse + bug fixes
...aska);
vec_b = vec_ld(0, b);
b += 4;
vec_result = vec_madd(vec_a, vec_b, vec_result);
}
}
vec_result = vec_add(vec_result, vec_sld(vec_result, vec_result, 8));
vec_result = vec_add(vec_result, vec_sld(vec_result, vec_result, 4));
vec_ste(vec_result, 0, &sum);
return sum;
}
#endif
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ltp_altivec.h
Type: application/octet-stream
Size: 1786 bytes
Desc: not available
Url : http://lists.xiph.org/pipermail/speex-dev/attachments/20051202/46c05f55/l...
2004 Aug 06
2
[PATCH] Make SSE Run Time option. Add Win32 SSE code
Jean-Marc,
>I'm still not sure I get it. On an Athlon XP, I can do something like
>"mulps xmm0, xmm1", which means that the xmm registers are indeed
>supported. Besides, without the xmm registers, you can't use much of
>SSE.
In the Atholon XP 2400+ that we have in our QA lab (Win2000 ) if you run
that code it generates an Illegal Instruction Error. In addition,