search for: vecinit5

Displaying 7 results from an estimated 7 matches for "vecinit5".

Did you mean: vecinit
2012 Feb 28
1
[LLVMdev] How to vectorize a vector type cast?
...= insertelement <4 x float> undef, float %conv, i32 0 %2 = extractelement <4 x i8> %0, i32 1 %conv2 = uitofp i8 %2 to float %vecinit3 = insertelement <4 x float> %vecinit, float %conv2, i32 1 %3 = extractelement <4 x i8> %0, i32 2 %conv4 = uitofp i8 %3 to float %vecinit5 = insertelement <4 x float> %vecinit3, float %conv4, i32 2 %4 = extractelement <4 x i8> %0, i32 3 %conv6 = uitofp i8 %4 to float %vecinit7 = insertelement <4 x float> %vecinit5, float %conv6, i32 3 ret <4 x float> %vecinit7 Which does the cast as a sequence of scala...
2017 Sep 13
2
RFC phantom memory intrinsic
...s double, double* %ptr, i64 %add %1 = load double, double* %arrayidx1, align 8 %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1 %add3 = add i64 %i, 2 %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3 %2 = load double, double* %arrayidx4, align 8 %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2 %add6 = add i64 %i, 3 %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6 %3 = load double, double* %arrayidx7, align 8 %vecinit8 = insertelement <4 x double> %vecinit5, double %3, i32 3 %shuffle = shuffl...
2017 Sep 13
2
RFC phantom memory intrinsic
...e, double* %arrayidx1, align 8 >> %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1 >> %add3 = add i64 %i, 2 >> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3 >> %2 = load double, double* %arrayidx4, align 8 >> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2 >> %add6 = add i64 %i, 3 >> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6 >> %3 = load double, double* %arrayidx7, align 8 >> %vecinit8 = insertelement <4 x double> %vecini...
2017 Sep 26
0
RFC phantom memory intrinsic
...ign 8 >>> %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1 >>> %add3 = add i64 %i, 2 >>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3 >>> %2 = load double, double* %arrayidx4, align 8 >>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2 >>> %add6 = add i64 %i, 3 >>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6 >>> %3 = load double, double* %arrayidx7, align 8 >>> %vecinit8 = insertelement <4...
2017 Sep 26
2
RFC phantom memory intrinsic
...t; %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1 >>>> %add3 = add i64 %i, 2 >>>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3 >>>> %2 = load double, double* %arrayidx4, align 8 >>>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2 >>>> %add6 = add i64 %i, 3 >>>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6 >>>> %3 = load double, double* %arrayidx7, align 8 >>>> %vecinit8 = inse...
2017 Sep 26
0
RFC phantom memory intrinsic
...nsertelement <4 x double> %vecinit, double %1, i32 1 >>>>> %add3 = add i64 %i, 2 >>>>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3 >>>>> %2 = load double, double* %arrayidx4, align 8 >>>>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2 >>>>> %add6 = add i64 %i, 3 >>>>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6 >>>>> %3 = load double, double* %arrayidx7, align 8 >>>>>...
2017 Sep 12
3
RFC phantom memory intrinsic
Hi, For PR21780 solution, I plan to add a new functionality to restore memory operations that was once deleted, in this particular case it is the load operations that were deleted by InstCombine, please note that once the load was removed there is no way to restore it back and that prevents us from vectorizing the shuffle operation. There are probably more similar issues where this approach could