test.c :
-----------------------------------------------------------------------
int foo(int a)
{
int zero = 0;
for (int i = 0; i < 10000; i++)
zero *= a;
return zero;
}
-------------------------------------------------------------------------
run clang : clang -O2 -S test.c -o test.s
My clang version is 3.7.1.
We will get a horrible assembly output.
Why constant propagation and other optimization skills can not find out
that variable zero is initialized 0, and the only statement in for loop
(i.e. zero *= a) always get a 0?
I can read the clang/llvm source, Please tell me more details.
THX
--
业精于勤,荒于嬉..
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20160302/f15da086/attachment.html>
Hongbin Zheng via llvm-dev
2016-Mar-02 07:31 UTC
[llvm-dev] Why LLVM cannot optimize this?
if you replace "zero *= a;" by "zero += a;", you get:
; Function Attrs: norecurse nounwind readnone uwtable
define i32 @foo(i32 %a) #0 {
entry:
%0 = mul i32 %a, 10000
ret i32 %0
}
I think the problem is ScalarEvolution only have "SCEVAddRecExpr" for
zero
+= a, but no corresponding expression to represent and optimize zero *= a;
On Wed, Mar 2, 2016 at 3:24 PM, zet via llvm-dev <llvm-dev at
lists.llvm.org>
wrote:
> test.c :
> -----------------------------------------------------------------------
> int foo(int a)
> {
> int zero = 0;
> for (int i = 0; i < 10000; i++)
> zero *= a;
> return zero;
> }
> -------------------------------------------------------------------------
>
> run clang : clang -O2 -S test.c -o test.s
>
> My clang version is 3.7.1.
> We will get a horrible assembly output.
>
> Why constant propagation and other optimization skills can not find out
> that variable zero is initialized 0, and the only statement in for loop
> (i.e. zero *= a) always get a 0?
>
> I can read the clang/llvm source, Please tell me more details.
>
> THX
>
>
> --
> 业精于勤,荒于嬉..
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20160302/606c2416/attachment.html>
Hi, Yes SCEV is pretty limited on this aspect. This kind of code can trigger LLVM to explode in time/memory: https://llvm.org/bugs/show_bug.cgi?id=18606 <https://llvm.org/bugs/show_bug.cgi?id=18606> See also this llvm-dev thread: SCEV implementation and limitations, do we need "pow"? : http://lists.llvm.org/pipermail/llvm-dev/2014-February/070062.html CC: Sanjoy who may have an opinion on how to improve SCEV for this? -- Mehdi> On Mar 1, 2016, at 11:31 PM, Hongbin Zheng via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > if you replace "zero *= a;" by "zero += a;", you get: > > ; Function Attrs: norecurse nounwind readnone uwtable > define i32 @foo(i32 %a) #0 { > entry: > %0 = mul i32 %a, 10000 > ret i32 %0 > } > > I think the problem is ScalarEvolution only have "SCEVAddRecExpr" for zero += a, but no corresponding expression to represent and optimize zero *= a; > > > > On Wed, Mar 2, 2016 at 3:24 PM, zet via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote: > test.c : > ----------------------------------------------------------------------- > int foo(int a) > { > int zero = 0; > for (int i = 0; i < 10000; i++) > zero *= a; > return zero; > } > ------------------------------------------------------------------------- > > run clang : clang -O2 -S test.c -o test.s > > My clang version is 3.7.1. > We will get a horrible assembly output. > > Why constant propagation and other optimization skills can not find out that variable zero is initialized 0, and the only statement in for loop (i.e. zero *= a) always get a 0? > > I can read the clang/llvm source, Please tell me more details. > > THX > > > -- > 业精于勤,荒于嬉.. > > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org> > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev <http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev> > > > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20160302/2fae2532/attachment.html>
Possibly Parallel Threads
- Why LLVM cannot optimize this?
- [LLVMdev] [PATCH] Emit rbit, clz on ARM for __builtin_ctz
- [LLVMdev] [PATCH] Emit rbit, clz on ARM for __builtin_ctz
- [LLVMdev] [PATCH] Emit rbit, clz on ARM for __builtin_ctz
- [LLVMdev] [PATCH] Emit rbit, clz on ARM for __builtin_ctz