Displaying 3 results from an estimated 3 matches for "bd2d9130".
2013 Oct 30
0
[LLVMdev] loop vectorizer
Hi Frank,
The access pattern to arrays a and b is non-linear. Unrolled loops are usually handled by the SLP-vectorizer. Are ir0 and ir1 consecutive for all values for i ?
Thanks,
Nadav
On Oct 30, 2013, at 9:05 AM, Frank Winter <fwinter at jlab.org> wrote:
> The loop vectorizer seems to be not able to vectorize the following code:
>
> void bar(std::uint64_t start,
2013 Oct 30
3
[LLVMdev] loop vectorizer
...the relationship becomes clear.
Maybe I'm wrong, but this looks like a problem of missed opportunities, not
technically hard to implement.
--renato
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131030/bd2d9130/attachment.html>
2013 Oct 30
2
[LLVMdev] loop vectorizer
The loop vectorizer seems to be not able to vectorize the following code:
void bar(std::uint64_t start, std::uint64_t end, float * __restrict__
c, float * __restrict__ a, float * __restrict__ b)
{
const std::uint64_t inner = 4;
for (std::uint64_t i = start ; i < end ; ++i ) {
const std::uint64_t ir0 = ( (i/inner) * 2 + 0 ) * inner + i%4;
const std::uint64_t ir1 = ( (i/inner)