thr3ads.net - search: "_sparse

[LLVMdev] Vector select/compare support in LLVM

2011 Mar 08

3

[LLVMdev] Vector select/compare support in LLVM

...general purpose register. Luckily, SSE has the MOVMSKPS instruction, which converts sparse masks to packed masks. I am not sure which representation is better, but both are reasonable. The former may cause register pressure in some cases, while the latter may add the packing-unpacking overhead. _Sparse_ After my discussion with Duncan, last week, I started working on the promotion of type <4 x i1> to <4 x i32>, and I ran into a problem. It looks like the codegen term ‘promote’ is overloaded. For scalars, the ‘promote’ operation converts scalars to larger bit-width scalars. For vect...

[LLVMdev] Vector select/compare support in LLVM

2011 Mar 09

0

[LLVMdev] Vector select/compare support in LLVM

...> I can think of two ways to represent masks in x86: sparse and > packed. In the sparse method, the masks are kept in <4 x 32bit> > registers, which are mapped to xmm registers. This is the ‘native’ way > of using masks. This argues for the sparse representation, I think. > _Sparse_ After my discussion with Duncan, last week, I started working > on the promotion of type <4 x i1> to <4 x i32>, and I ran into a > problem. It looks like the codegen term ‘promote’ is overloaded. Heavily. :-/ > For scalars, the ‘promote’ operation converts scalars to large...

[LLVMdev] Vector select/compare support in LLVM

2011 Mar 10

2

[LLVMdev] Vector select/compare support in LLVM

...> I can think of two ways to represent masks in x86: sparse and > packed. In the sparse method, the masks are kept in <4 x 32bit> > registers, which are mapped to xmm registers. This is the ‘native’ way > of using masks. This argues for the sparse representation, I think. > _Sparse_ After my discussion with Duncan, last week, I started working > on the promotion of type <4 x i1> to <4 x i32>, and I ran into a > problem. It looks like the codegen term ‘promote’ is overloaded. Heavily. :-/ > For scalars, the ‘promote’ operation converts scalars to large...

[LLVMdev] Vector select/compare support in LLVM

2011 Mar 10

0

[LLVMdev] Vector select/compare support in LLVM

...ys to represent masks in x86: sparse and >> packed. In the sparse method, the masks are kept in<4 x 32bit> >> registers, which are mapped to xmm registers. This is the ‘native’ way >> of using masks. > > This argues for the sparse representation, I think. > >> _Sparse_ After my discussion with Duncan, last week, I started working >> on the promotion of type<4 x i1> to<4 x i32>, and I ran into a >> problem. It looks like the codegen term ‘promote’ is overloaded. > > Heavily. :-/ > >> For scalars, the ‘promote’ operation...

[LLVMdev] Vector select/compare support in LLVM

2011 Mar 10

2

[LLVMdev] Vector select/compare support in LLVM

...> I can think of two ways to represent masks in x86: sparse and > packed. In the sparse method, the masks are kept in <4 x 32bit> > registers, which are mapped to xmm registers. This is the ‘native’ way > of using masks. This argues for the sparse representation, I think. > _Sparse_ After my discussion with Duncan, last week, I started working > on the promotion of type <4 x i1> to <4 x i32>, and I ran into a > problem. It looks like the codegen term ‘promote’ is overloaded. Heavily. :-/ > For scalars, the ‘promote’ operation converts scalars to large...

search for: _sparse_