search for: kernel_2mm

Displaying 3 results from an estimated 3 matches for "kernel_2mm".

2018 Mar 14
0
LLVM opt unable to vectorize PolyBench code
It would help if you sent the IR you're giving to opt or at least a complete C function and your clang command line. ~Craig On Wed, Mar 14, 2018 at 3:05 PM, hameeza ahmed <hahmed2305 at gmail.com> wrote: > Hello, > > I m unable to vectorize following kernel by opt tool; > > for (i = 0; i < _PB_NI; i++) > for (j = 0; j < _PB_NJ; j++) > { >
2018 Mar 14
2
LLVM opt unable to vectorize PolyBench code
Hello, I m unable to vectorize following kernel by opt tool; for (i = 0; i < _PB_NI; i++) for (j = 0; j < _PB_NJ; j++) { tmp[i][j] = 0; for (k = 0; k < _PB_NK; ++k) tmp[i][j] += alpha * A[i][k] * B[k][j]; } for (i = 0; i < _PB_NI; i++) for (j = 0; j < _PB_NL; j++) { D[i][j] *= beta; for (k = 0; k < _PB_NJ; ++k) D[i][j] +=
2019 Apr 15
2
Loop Strength Reduction Pass Does Not Work for Some Varialbles Related to Induction Variables
...oopStrengthReduction is unable to remove all the multiplications for the element offset calculation. My test code is shown below and thanks a lot in advance for your time and suggestion! ----------------------------------------------- #define NI 16 #define NJ 18 #define NK 22 #define NL 24 void kernel_2mm(int ni, int nj, int nk, int nl, int alpha, int beta, int tmp[NI][NJ], int A[NI][NK], int B[NK][NJ], int C[NJ][NL], int D[NI][NL]) { int i, j, k; /* D := alpha*A*B*C + beta*D */ for (i = 0; i < NI; i++) for (j = 0; j < NJ; j++) { tmp[i][j] = 0...