search for: gopalasubramanian

Displaying 5 results from an estimated 5 matches for "gopalasubramanian".

2018 May 11
0
Query on unswitching + vectorization
On 5/10/2018 10:44 PM, Gopalasubramanian, Ganesh via llvm-dev wrote: > > Hi, > > I am going through analysis on unswitching + vectorization. > > For the below test, llvm unswitches successfully but fails to > vectorize the loop after unswitching. > > Llvm bails out saying “Found an outside user” apparently whic...
2018 May 14
1
Query on unswitching + vectorization
...y not be able to get the indvars sorted out because of the integer type limitation in SCEV! Is there a way out when we have a scalar float computation that is dependent on induction variable? -Ganesh From: Friedman, Eli [mailto:efriedma at codeaurora.org] Sent: Saturday, May 12, 2018 12:36 AM To: Gopalasubramanian, Ganesh <Ganesh.Gopalasubramanian at amd.com>; llvm-dev at lists.llvm.org Subject: Re: [llvm-dev] Query on unswitching + vectorization On 5/10/2018 10:44 PM, Gopalasubramanian, Ganesh via llvm-dev wrote: Hi, I am going through analysis on unswitching + vectorization. For the below test, llv...
2018 May 11
2
Query on unswitching + vectorization
Hi, I am going through analysis on unswitching + vectorization. For the below test, llvm unswitches successfully but fails to vectorize the loop after unswitching. Llvm bails out saying "Found an outside user" apparently which is the value of 'tmp'. int i, w, x[1000], y[1000],tmp; void fn() { for (i = 0; i < 1000; i++) { if (w==1) { y[i] = 1; tmp = i*2; }
2018 Apr 04
0
SCEV and LoopStrengthReduction Formulae
> cmpq %rbx, %r14 > jne .LBB0_1 > > LLVM can perform compare-jump fusion, it already does in certain cases, but > not in the case above. We can remove the cmp above if we were to perform > the following transformation: Do you mean branch-fusion (https://en.wikichip.org/wiki/macro-operation_fusion)? Is there any more limitation why these two or not fused? > -----Original
2018 Apr 03
4
SCEV and LoopStrengthReduction Formulae
I am attempting to implement a minor loop strength reduction optimization for targets that support compare and jump fusion, specifically TTI::canMacroFuseCmp(). My approach might be wrong; however, I am soliciting the idea for feedback, so that I can implement this correctly. My plan is to add a Supplemental LSR formula to LoopStrengthReduce.cpp that optimizes the following case, but perhaps