search for: i0

Displaying 20 results from an estimated 273 matches for "i0".

2013 Aug 08
2
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...{ x[ 4] ^= R(x[ 0]+x[12], 7); x[ 8] ^= R(x[ 4]+x[ 0], 9); x[12] ^= R(x[ 8]+x[ 4],13); x[ 0] ^= R(x[12]+x[ 8],18); .... x[14] ^= R(x[13]+x[12],13); x[15] ^= R(x[14]+x[13],18); } which would be translated into a serial of affine functions: ReadAccess := { Stmt_for_body5[i0] -> MemRef_x[12] }; ReadAccess := { Stmt_for_body5[i0] -> MemRef_x[4] }; MustWriteAccess := { Stmt_for_body5[i0] -> MemRef_x[4] }; ReadAccess := { Stmt_for_body5[i0] -> MemRef_x[0] }; ... Consequently, the PollyDependence pass would produce very complex...
2013 Aug 08
0
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...x[ 8] ^= R(x[ 4]+x[ 0], 9); > x[12] ^= R(x[ 8]+x[ 4],13); x[ 0] ^= R(x[12]+x[ 8],18); > .... > x[14] ^= R(x[13]+x[12],13); x[15] ^= R(x[14]+x[13],18); > } > which would be translated into a serial of affine functions: > ReadAccess := { Stmt_for_body5[i0] -> MemRef_x[12] }; > ReadAccess := { Stmt_for_body5[i0] -> MemRef_x[4] }; > MustWriteAccess := { Stmt_for_body5[i0] -> MemRef_x[4] }; > ReadAccess := { Stmt_for_body5[i0] -> MemRef_x[0] }; > ... > Consequently, the PollyDependence pas...
2013 Aug 09
2
[LLVMdev] [Polly] Summary of some expensive compiler passes, especially PollyDependence
...9); >>      x[12] ^= R(x[ 8]+x[ 4],13);  x[ 0] ^= R(x[12]+x[ 8],18); >>      .... >>      x[14] ^= R(x[13]+x[12],13);  x[15] ^= R(x[14]+x[13],18); >>    } >> which would be translated into a serial of affine functions: >>        ReadAccess :=    { Stmt_for_body5[i0] -> MemRef_x[12] }; >>        ReadAccess :=    { Stmt_for_body5[i0] -> MemRef_x[4] }; >>        MustWriteAccess :=    { Stmt_for_body5[i0] -> MemRef_x[4] }; >>        ReadAccess :=    { Stmt_for_body5[i0] -> MemRef_x[0] }; >>        ... >> Consequently, the...
2014 Dec 31
2
[PATCH 1/2] nv50: regenerate rnndb headers
...E_NEVER 0x00000000 -#define NV50_2D_COND_MODE_ALWAYS 0x00000001 +#define NV50_2D_COND_MODE_ALWAYS 0x00000001 #define NV50_2D_COND_MODE_RES_NON_ZERO 0x00000002 #define NV50_2D_COND_MODE_EQUAL 0x00000003 #define NV50_2D_COND_MODE_NOT_EQUAL 0x00000004 +#define NVC0_2D_UNK0270(i0) (0x00000270 + 0x4*(i0)) +#define NVC0_2D_UNK0270__ESIZE 0x00000004 +#define NVC0_2D_UNK0270__LEN 0x00000004 + #define NV50_2D_CLIP_X 0x00000280 #define NV50_2D_CLIP_Y 0x00000284 @@ -158,7 +168,7 @@ WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. #...
2014 Dec 31
0
[PATCH 2/2] nvc0: regenerate rnndb headers
...e NVC0_2D_PATTERN_COLOR_FORMAT_UNK4 0x00000004 -#define NVC0_2D_PATTERN_COLOR_FORMAT_UNK5 0x00000005 - -#define NVC0_2D_PATTERN_MONO_FORMAT 0x000002ec -#define NVC0_2D_PATTERN_MONO_FORMAT_CGA6 0x00000000 -#define NVC0_2D_PATTERN_MONO_FORMAT_LE 0x00000001 - -#define NVC0_2D_PATTERN_COLOR(i0) (0x000002f0 + 0x4*(i0)) -#define NVC0_2D_PATTERN_COLOR__ESIZE 0x00000004 -#define NVC0_2D_PATTERN_COLOR__LEN 0x00000002 - -#define NVC0_2D_PATTERN_BITMAP(i0) (0x000002f8 + 0x4*(i0)) -#define NVC0_2D_PATTERN_BITMAP__ESIZE 0x00000004 -#define NVC0_2D_PATTERN_BITMAP__LEN...
2016 Oct 16
10
[PATCH 1/5] hwdefs: update nvc0_3d, add gm107_texture for new TIC format
...C0_3D_TESS_MODE_SPACING_EQUAL 0x00000000 #define NVC0_3D_TESS_MODE_SPACING_FRACTIONAL_ODD 0x00000010 #define NVC0_3D_TESS_MODE_SPACING_FRACTIONAL_EVEN 0x00000020 #define NVC0_3D_TESS_MODE_CW 0x00000100 #define NVC0_3D_TESS_MODE_CONNECTED 0x00000200 #define NVC0_3D_TESS_LEVEL_OUTER(i0) (0x00000324 + 0x4*(i0)) -#define NVC0_3D_TESS_LEVEL_OUTER__ESIZE 0x00000004 +#define NVC0_3D_TESS_LEVEL_OUTER__ESIZE 0x00000004 #define NVC0_3D_TESS_LEVEL_OUTER__LEN 0x00000004 #define NVC0_3D_TESS_LEVEL_INNER(i0) (0x00000334 + 0x4*(i0)) -#define NVC0_3D_TESS_LEVEL_IN...
2013 Aug 16
0
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
On 08/16/2013 02:42 AM, Star Tan wrote: > At 2013-08-16 12:44:02,"Tobias Grosser" <tobias at grosser.es> wrote: >> Hi, >> >> I tried to reproduce your findings, but could not do so. > > > Sorry, I did not put all code in my previous email because the code seems a little too long and complicated. > You can refer to the detailed C code and LLVM IR
2013 Jul 28
0
[LLVMdev] [Polly] Analysis of the expensive compile-time overhead of Polly Dependence pass
...ways, we skip the "base" value when compare parameters. > With this patch file, the Scop becomes: > > Context: > p0: {0,+,128}<%for.cond2.preheader> > Statements { > Stmt_for_body6 > Domain := > { Stmt_for_body6[i0] : i0 >= 0 and i0 <= 7 }; > Scattering := > { Stmt_for_body6[i0] -> scattering[0, i0, 0] }; > ReadAccess := > [p_0] -> { Stmt_for_body6[i0] -> MemRef_input[o0] : 2o0 = p_0 + 16i0 }; > ReadAccess :...
2013 Jul 29
0
[LLVMdev] [Polly] Analysis of the expensive compile-time overhead of Polly Dependence pass
On 07/29/2013 09:15 AM, Sven Verdoolaege wrote: > On Mon, Jul 29, 2013 at 07:37:14AM -0700, Tobias Grosser wrote: >> On 07/29/2013 03:18 AM, Sven Verdoolaege wrote: >>> On Sun, Jul 28, 2013 at 04:42:25PM -0700, Tobias Grosser wrote: >>>> Sven: In terms of making the behaviour of isl easier to understand, >>>> it may make sense to fail/assert in case
2019 May 29
2
Basic block merging
Under certain circumstances, my compiler outputs basic blocks having the same function: bb_97: ; preds = %bb_1 %476 = getelementptr inbounds %LMtop.I0.ARType, %LMtop.I0.ARType* %0, i64 0, i32 6 %477 = bitcast i8** %476 to %LBstd.Cprocess.CRType** %478 = load %LBstd.Cprocess.CRType*, %LBstd.Cprocess.CRType** %477, align 8 %479 = getelementptr inbounds %LBstd.Cprocess.CRType, %LBstd.Cprocess.CRType* %478, i64 0, i32 3 %480 = bitcast i8** %4...
2013 Jul 31
1
[LLVMdev] [Polly] Analysis of the expensive compile-time overhead of Polly Dependence pass
...<= 9223372036854775807 and p_2 >= -9223372036854775808 and p_2 <= 9223372036854775807 } p0: {0,+,128}<%for.cond2.preheader> p1: {2,+,128}<%for.cond2.preheader> p2: {4,+,128}<%for.cond2.preheader> Statements { Stmt_for_body6 Domain := [p_0, p_1, p_2] -> { Stmt_for_body6[i0] : i0 >= 0 and i0 <= 7 }; Scattering := [p_0, p_1, p_2] -> { Stmt_for_body6[i0] -> scattering[0, i0, 0] }; ReadAccess := [p_0, p_1, p_2] -> { Stmt_for_body6[i0] -> MemRef_input[o0] : 2o0 = p_0 + 16i0 }; ReadAccess := [p_0, p_1, p_2] -> { Stmt_for_body6[i0] -> MemRef_input[o0...
2013 Jul 26
6
[LLVMdev] [Polly] Analysis of the expensive compile-time overhead of Polly Dependence pass
Hi Sebastian, Recently, I found the "Polly - Calculate dependences" pass would lead to significant compile-time overhead when compiling some loop-intensive source code. Tobias told me you found similar problem as follows: http://llvm.org/bugs/show_bug.cgi?id=14240 My evaluation shows that "Polly - Calculate dependences" pass consumes 96.4% of total compile-time overhead
2013 Aug 17
2
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
...preheader => for.end32' in function 'main': Context: [cond.reload] -> { : cond.reload >= -2147483648 and cond.reload <= 2147483647 } p0: %cond.reload Statements { Stmt_for_body16 Domain := [cond.reload] -> { Stmt_for_body16[i0, i1, i2, i3, i4, i5] : i0 >= 0 and i0 <= -1 + cond.reload and i1 >= 0 and i1 <= -1 + cond.reload and i2 >= 0 and i2 <= -1 + cond.reload and i3 >= 0 and i3 <= -1 + cond.reload and i4 >= 0 and i4 <= -1 + cond.reload and i5 >= 0 and i5 <= -1 + cond.reload and cond.r...
2019 May 29
3
Basic block merging
...llvm-dev > <llvm-dev at lists.llvm.org> wrote: > > > > Under certain circumstances, my compiler outputs basic blocks having the same function: > > > > bb_97: ; preds = %bb_1 > > %476 = getelementptr inbounds %LMtop.I0.ARType, %LMtop.I0.ARType* %0, i64 0, i32 6 > > %477 = bitcast i8** %476 to %LBstd.Cprocess.CRType** > > %478 = load %LBstd.Cprocess.CRType*, %LBstd.Cprocess.CRType** %477, align 8 > > %479 = getelementptr inbounds %LBstd.Cprocess.CRType, %LBstd.Cprocess.CRType* %478, i64 0,...
2013 Aug 16
2
[LLVMdev] [Polly] Analysis of extra compile-time overhead for simple nested loops
At 2013-08-16 12:44:02,"Tobias Grosser" <tobias at grosser.es> wrote: >Hi, > >I tried to reproduce your findings, but could not do so. Sorry, I did not put all code in my previous email because the code seems a little too long and complicated. You can refer to the detailed C code and LLVM IR code on http://llvm.org/bugs/show_bug.cgi?id=16843 There are four attachments
2016 May 16
2
Determination of statements that contain only matrix multiplication
...j < Upper Bound2; j++) for (int k = 0; k < Upper Bound3; j++) C[i][j] += A[i][k] * B[k][j] We could probably check that memory access relations have the following form: "accesses" : [ { "kind" : "read", "relation" : "{ Stmt_14[i0, i1, i2] -> MemRef_1[i0, i2] }" }, { "kind" : "read", "relation" : "{ Stmt_14[i0, i1, i2] -> MemRef_2[i2, i1] }" }, { "kind" : "read", "relation" : "{ Stmt_14[i0, i1, i2] -> MemRef_3[i0,...
2013 Jan 02
0
[LLVMdev] [DragonEgg] [Polly] Should we expect DragonEgg to produce identical LLVM IR for identical GIMPLE?
...ptrValue = 47248855040 offset = 40, ptrValue = 47246749696 offset = 48, ptrValue = 47247802368 offset = 16, value = 64 offset = 20, value = 64 offset = 24, value = 64 MemoryAccess to pointer: float* inttoptr (i64 47246749696 to float*) { Stmt__12_cloned_[i0, i1, i2] -> MemRef_nttoptr (i64 47246749696 to float*)[4096i0 + 64i1 + i2] } allocSize: 4 storeSize: 4 replacedBy: { Stmt__12_cloned_[i0, i1, i2] -> NULL[o0] : o0 >= 47246749696 + 16384i0 + 256i1 + 4i2 and o0 <= 47246749699 + 16384i0 + 256i1 + 4i2 } MemoryAccess to pointer:...
2017 May 06
2
Build polly-amd64-linux Failure
...mp: [branch trunk] 302339 > Blamelist: meinersbur Hi Michael, the recent commit caused a failure: /home/grosser/buildslave/polly-amd64-linux/llvm.src/tools/polly/test/DeLICM/reduction_overapproximate.ll:102:16: error: expected string not found in input ; APPROX-NEXT: new: { Stmt_reduction_inc[i0, i1] -> MemRef_A[2] : i0 <= 3 and 0 <= i1 <= -2 + i0 }; ^ <stdin>:62:2: note: scanning from here new: { Stmt_reduction_inc[i0, i1] -> MemRef_A[2] : 2 <= i0 <= 3 and 0 <= i1 <= -2 + i0 }; I have no idea where it comes from. On my machine (and my ear...
2023 Dec 04
1
Fit NLE - was: computer algebra in R
...at.ethz.ch/pipermail/r-help/2023-November/478619.html Dear Kornad, I think I have started to understand what you try to achieve. The problem is to fit a NLE and compute the parameters of the NL-Eq. I have included the R Help-list back in the loop, as I am not an expert in optimization. Goal: y ~ I0 + IHD * hd + ID * d; where: y = given vector of measurements; x = given vector of values; I0, IHD, ID, kd = parameters to optimize; hd = satisfies a polynomial of order 3; As d = d0 - hd, the previous formula can be written: y ~ I0 + ID * d0 + (IHD - ID) * hd; f(x, hd, kd) = 0, where f = a poly...
2013 Jan 02
2
[LLVMdev] [DragonEgg] [Polly] Should we expect DragonEgg to produce identical LLVM IR for identical GIMPLE?
On 01/01/2013 02:45 PM, Duncan Sands wrote: > Hi Dmitry, > >> >> In our compiler we use a modified version LLVM Polly, which is very >> sensitive to >> proper code generation. Among the number of limitations, the loop region >> (enclosed by phi node on induction variable and branch) is required to >> be free >> of additional memory-dependent