search for: _sparse_

Displaying 5 results from an estimated 5 matches for "_sparse_".

2011 Mar 08
3
[LLVMdev] Vector select/compare support in LLVM
...general purpose register. Luckily, SSE has the MOVMSKPS instruction, which converts sparse masks to packed masks. I am not sure which representation is better, but both are reasonable. The former may cause register pressure in some cases, while the latter may add the packing-unpacking overhead. _Sparse_ After my discussion with Duncan, last week, I started working on the promotion of type <4 x i1> to <4 x i32>, and I ran into a problem.  It looks like the codegen term ‘promote’ is overloaded.  For scalars, the ‘promote’ operation converts scalars to larger bit-width scalars.  For vect...
2011 Mar 09
0
[LLVMdev] Vector select/compare support in LLVM
...> I can think of two ways to represent masks in x86: sparse and > packed. In the sparse method, the masks are kept in <4 x 32bit> > registers, which are mapped to xmm registers. This is the ‘native’ way > of using masks. This argues for the sparse representation, I think. > _Sparse_ After my discussion with Duncan, last week, I started working > on the promotion of type <4 x i1> to <4 x i32>, and I ran into a > problem.  It looks like the codegen term ‘promote’ is overloaded. Heavily. :-/ >  For scalars, the ‘promote’ operation converts scalars to large...
2011 Mar 10
2
[LLVMdev] Vector select/compare support in LLVM
...> I can think of two ways to represent masks in x86: sparse and > packed. In the sparse method, the masks are kept in <4 x 32bit> > registers, which are mapped to xmm registers. This is the ‘native’ way > of using masks. This argues for the sparse representation, I think. > _Sparse_ After my discussion with Duncan, last week, I started working > on the promotion of type <4 x i1> to <4 x i32>, and I ran into a > problem.  It looks like the codegen term ‘promote’ is overloaded. Heavily. :-/ >  For scalars, the ‘promote’ operation converts scalars to large...
2011 Mar 10
0
[LLVMdev] Vector select/compare support in LLVM
...ys to represent masks in x86: sparse and >> packed. In the sparse method, the masks are kept in<4 x 32bit> >> registers, which are mapped to xmm registers. This is the ‘native’ way >> of using masks. > > This argues for the sparse representation, I think. > >> _Sparse_ After my discussion with Duncan, last week, I started working >> on the promotion of type<4 x i1> to<4 x i32>, and I ran into a >> problem. It looks like the codegen term ‘promote’ is overloaded. > > Heavily. :-/ > >> For scalars, the ‘promote’ operation...
2011 Mar 10
2
[LLVMdev] Vector select/compare support in LLVM
...> I can think of two ways to represent masks in x86: sparse and > packed. In the sparse method, the masks are kept in <4 x 32bit> > registers, which are mapped to xmm registers. This is the ‘native’ way > of using masks. This argues for the sparse representation, I think. > _Sparse_ After my discussion with Duncan, last week, I started working > on the promotion of type <4 x i1> to <4 x i32>, and I ran into a > problem.  It looks like the codegen term ‘promote’ is overloaded. Heavily. :-/ >  For scalars, the ‘promote’ operation converts scalars to large...