Displaying 3 results from an estimated 3 matches for "r284044".
2016 Oct 13
2
Loop Unrolling Fail in Simple Vectorized loop
...here's a few different angles you could use to attack this: you could
> teach the unroller to unroll loops with an uncomputable trip count, or you
> can make the trip count of your loop computable somehow. Changing the
> unroller is probably straightforward (see the recently committed r284044).
> Making the trip count computable is more complicated... it's probably
> possible to teach SCEV to reason about the overflow in the pointer
> computation, or maybe you could version the loop.
>
> -Eli
>
> --
> Employee of Qualcomm Innovation Center, Inc.
> Qualcomm...
2016 Oct 12
2
Loop Unrolling Fail in Simple Vectorized loop
Hi all,
Attached herewith is a simple vectorized function with loops performing a
simple shuffle.
I want all loops (inner and outer) to be unrolled by 2 and as such used
-unroll-count=2
The inner loops(with k as the induction variable and having constant trip
counts) unroll fully, but the outer loop with (j) fails to unroll.
The llvm code is also attached with inner loops fully unrolled.
To
2016 Oct 13
2
Loop Unrolling Fail in Simple Vectorized loop
...nt angles you could use to attack this: you could
>>> teach the unroller to unroll loops with an uncomputable trip count, or you
>>> can make the trip count of your loop computable somehow. Changing the
>>> unroller is probably straightforward (see the recently committed r284044).
>>> Making the trip count computable is more complicated... it's probably
>>> possible to teach SCEV to reason about the overflow in the pointer
>>> computation, or maybe you could version the loop.
>>>
>>> -Eli
>>>
>>> --
>>...