Displaying 5 results from an estimated 5 matches for "_sparse_".
2011 Mar 08
3
[LLVMdev] Vector select/compare support in LLVM
...general purpose register. Luckily, SSE has the MOVMSKPS instruction, which converts sparse masks to packed masks. I am not sure which representation is better, but both are reasonable. The former may cause register pressure in some cases, while the latter may add the packing-unpacking overhead.
_Sparse_
After my discussion with Duncan, last week, I started working on the promotion of type <4 x i1> to <4 x i32>, and I ran into a problem. It looks like the codegen term ‘promote’ is overloaded. For scalars, the ‘promote’ operation converts scalars to larger bit-width scalars. For vect...
2011 Mar 09
0
[LLVMdev] Vector select/compare support in LLVM
...> I can think of two ways to represent masks in x86: sparse and
> packed. In the sparse method, the masks are kept in <4 x 32bit>
> registers, which are mapped to xmm registers. This is the ‘native’ way
> of using masks.
This argues for the sparse representation, I think.
> _Sparse_ After my discussion with Duncan, last week, I started working
> on the promotion of type <4 x i1> to <4 x i32>, and I ran into a
> problem. It looks like the codegen term ‘promote’ is overloaded.
Heavily. :-/
> For scalars, the ‘promote’ operation converts scalars to large...
2011 Mar 10
2
[LLVMdev] Vector select/compare support in LLVM
...> I can think of two ways to represent masks in x86: sparse and
> packed. In the sparse method, the masks are kept in <4 x 32bit>
> registers, which are mapped to xmm registers. This is the ‘native’ way
> of using masks.
This argues for the sparse representation, I think.
> _Sparse_ After my discussion with Duncan, last week, I started working
> on the promotion of type <4 x i1> to <4 x i32>, and I ran into a
> problem. It looks like the codegen term ‘promote’ is overloaded.
Heavily. :-/
> For scalars, the ‘promote’ operation converts scalars to large...
2011 Mar 10
0
[LLVMdev] Vector select/compare support in LLVM
...ys to represent masks in x86: sparse and
>> packed. In the sparse method, the masks are kept in<4 x 32bit>
>> registers, which are mapped to xmm registers. This is the ‘native’ way
>> of using masks.
>
> This argues for the sparse representation, I think.
>
>> _Sparse_ After my discussion with Duncan, last week, I started working
>> on the promotion of type<4 x i1> to<4 x i32>, and I ran into a
>> problem. It looks like the codegen term ‘promote’ is overloaded.
>
> Heavily. :-/
>
>> For scalars, the ‘promote’ operation...
2011 Mar 10
2
[LLVMdev] Vector select/compare support in LLVM
...> I can think of two ways to represent masks in x86: sparse and
> packed. In the sparse method, the masks are kept in <4 x 32bit>
> registers, which are mapped to xmm registers. This is the ‘native’ way
> of using masks.
This argues for the sparse representation, I think.
> _Sparse_ After my discussion with Duncan, last week, I started working
> on the promotion of type <4 x i1> to <4 x i32>, and I ran into a
> problem. It looks like the codegen term ‘promote’ is overloaded.
Heavily. :-/
> For scalars, the ‘promote’ operation converts scalars to large...