Displaying 1 result from an estimated 1 matches for "l_copy".
Did you mean:
tl_copy
2012 Sep 13
1
[LLVMdev] Question about optimizing mem in loop
Is there a strong reason why this simple code:
for(rnd = 0; rnd < Nrnd - 1; ++rnd)
{
// round(inv_rnd, b1, b0, kp);
for (iter = 0; iter < 4; ++iter) {
round_i(inv_rnd, b1, b0, kp, iter);
}
l_copy(b0, b1); kp -= nc;
}
Produces the complicated control flow logic in the attached CFG?
If I unroll the loop I no longer have the crazy control flow logic. It
seems that instead of calculating the GEPs one at a time inside the loop,
it's pulling all 4 out of the inner loop into the oute...