search for: input_array_ptr0

Displaying 5 results from an estimated 5 matches for "input_array_ptr0".

Did you mean: input_array_ptr1
2010 May 29
3
[LLVMdev] Vectorized LLVM IR
...be computed in separated loops that can be auto-vectorized by some compilers like Intel ICC, this works quite well...) The scalar LLVM code is : define void @llvm_compute(%struct.llvm_dsp* %obj, i32 %count, float** noalias %inputs, float** noalias %outputs) nounwind readnone ssp { entry: %input_array_ptr0 = getelementptr inbounds float** %inputs, i64 0 %input0 = load float** %input_array_ptr0, align 8 %input_array_ptr1 = getelementptr inbounds float** %inputs, i64 1 %input1 = load float** %input_array_ptr1, align 8 %input_array_ptr2 = getelementptr inbounds float** %inputs, i64 2...
2010 May 29
0
[LLVMdev] Vectorized LLVM IR
...be auto-vectorized by some compilers like Intel ICC, this works quite well...) > > The scalar LLVM code is : > > define void @llvm_compute(%struct.llvm_dsp*  %obj, i32 %count, float** noalias %inputs, float** noalias %outputs) nounwind readnone ssp { >        entry: >            %input_array_ptr0 = getelementptr inbounds float** %inputs, i64 0 >            %input0 = load float** %input_array_ptr0, align 8 >            %input_array_ptr1 = getelementptr inbounds float** %inputs, i64 1 >            %input1 = load float** %input_array_ptr1, align 8 >            %input_array_ptr2 = g...
2010 May 29
1
[LLVMdev] Vectorized LLVM IR
...ome compilers like Intel ICC, this works quite well...) >> >> The scalar LLVM code is : >> >> define void @llvm_compute(%struct.llvm_dsp*  %obj, i32 %count, float** noalias %inputs, float** noalias %outputs) nounwind readnone ssp { >>        entry: >>            %input_array_ptr0 = getelementptr inbounds float** %inputs, i64 0 >>            %input0 = load float** %input_array_ptr0, align 8 >>            %input_array_ptr1 = getelementptr inbounds float** %inputs, i64 1 >>            %input1 = load float** %input_array_ptr1, align 8 >>            %inpu...
2010 May 28
0
[LLVMdev] Vectorized LLVM IR
Hi Stéphane, The SSE support is the LLVM backend is fine. What is the code that's generated? Do you have some short examples of where LLVM doesn't do as well as the equivalent scalar code? -bw On May 28, 2010, at 12:13 PM, Stéphane Letz wrote: > Hi, > > We are experimenting directly generating vectorized LLVM IR (using <8 x float> kind of types), then compiling the code
2010 May 28
3
[LLVMdev] Vectorized LLVM IR
Hi, We are experimenting directly generating vectorized LLVM IR (using <8 x float> kind of types), then compiling the code to SSE on a 64 bits machine. Right now the equivalent code in scalar mode sill outperform the SSE one. What is the quality of the SSE support in X86 LLVL backend? Are they any specific things to be aware of to improve the speed? Thanks Stéphane Letz