thr3ads.net - llvm dev - [LLVMdev] A case where llvm created different cfg for same code [Aug 2008]

If this information is useful, please help other people find it:
Share via:

Sheng Zhou

2008-Aug-13 07:03 UTC

[LLVMdev] A case where llvm created different cfg for same code

>
> Message: 4
> Date: Tue, 12 Aug 2008 13:23:52 -0700
> From: "Bill Wendling" <isanbard at gmail.com>
> Subject: Re: [LLVMdev] A case where llvm created different cfg for
> 	same code
> To: "LLVM Developers Mailing List" <llvmdev at cs.uiuc.edu>
> Message-ID:
> 	<16e5fdf90808121323g1ae2a2e3lb6c5bd62521df621 at mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
> On Tue, Aug 12, 2008 at 1:24 AM, Sheng Zhou <zhousheng at
autoesl.com> wrote:
>   
>> > Hi,
>> >
>> > The following two segments of code are actually the same,
>> > but llvm created different cfg for them.
>> >
>>     
> ...
>   
>> >
>> > The prime difference is that: cfg of form2 has additional basic
block
>> > which has a back edge to a non-header-block
>> > I think the loop in that cfg is not canonical.
>> >
>> > I tried -loopsimplify and -indvars , but no improvement.
>> >
>> > Any comments for this? Thanks in advance.
>> >
>>     
> The code is not the same. 7 is commented out in the first and 8 in the
> second. Why would you expect the CFGs to match? Is there something
> wrong (inefficient) in the resulting output of these two functions
> when compiled?
>   7 for(i=0; i<j && i+j+1<N; i++) {

8 for(i=0; i<j && i<N-j-1; i++) {


Line 7 and Line 8 actually have the same expression,
Line 8 just move the "j+1" to the right hand of the inequation.

I just wonder why the two equal inequation result in different cfg, the
additional basicblock, backedge...
Surely the two cfg are all correct, but just the additional backedge
makes it hard for loop pipeline.

Can we improve it?
> -bw

Bill Wendling

2008-Aug-13 08:01 UTC

head link

[LLVMdev] A case where llvm created different cfg for same code

On Aug 13, 2008, at 12:03 AM, Sheng Zhou wrote:
>>>>
>>>> The prime difference is that: cfg of form2 has additional basic
>>>> block
>>>> which has a back edge to a non-header-block
>>>> I think the loop in that cfg is not canonical.
>>>>
>>>> I tried -loopsimplify and -indvars , but no improvement.
>>>>
>>>> Any comments for this? Thanks in advance.
>>>>
>>>
>> The code is not the same. 7 is commented out in the first and 8 in  
>> the
>> second. Why would you expect the CFGs to match? Is there something
>> wrong (inefficient) in the resulting output of these two functions
>> when compiled?
>>
> 7 for(i=0; i<j && i+j+1<N; i++) {
>
> 8 for(i=0; i<j && i<N-j-1; i++) {
>
>
> Line 7 and Line 8 actually have the same expression,
The expressions are logically similar, but they aren't the same to the  
compiler unless some pass that recognizes these types of polynomial  
equations and can prove that they are the same is run (I'm not sure if  
such a pass exists in LLVM). For instance, because you use "j - 1" in
8, you now have a situation where you have a negative number when j ==  
0, etc.
> Line 8 just move the "j+1" to the right hand of the inequation.
>
> I just wonder why the two equal inequation result in different cfg,  
> the
> additional basicblock, backedge...
> Surely the two cfg are all correct, but just the additional backedge
> makes it hard for loop pipeline.
>
> Can we improve it?
>
>>
For what it's worth, I got the exact same code for both forms when  
compiling at -O3.

-bw

Duncan Sands

2008-Aug-13 08:24 UTC

head link

[LLVMdev] A case where llvm created different cfg for same code

Hi,
> 7 for(i=0; i<j && i+j+1<N; i++) {
> 
> 8 for(i=0; i<j && i<N-j-1; i++) {
the arithmetic might overflow in one of these
but not in the other.

Best wishes,

Duncan.

Daniel Berlin

2008-Aug-14 03:35 UTC

head link

[LLVMdev] A case where llvm created different cfg for same code

On Wed, Aug 13, 2008 at 3:03 AM, Sheng Zhou <zhousheng at autoesl.com>
wrote:
> 7 for(i=0; i<j && i+j+1<N; i++) {
>
> 8 for(i=0; i<j && i<N-j-1; i++) {
>
>
> Line 7 and Line 8 actually have the same expression,
> Line 8 just move the "j+1" to the right hand of the inequation.
>
>
These expressions overflow in different cases.
They are not equivalent.

Sheng Zhou

2008-Aug-14 09:46 UTC

head link

[LLVMdev] A case where llvm created different cfg for same code

Duncan Sands 写道:> Hi,
>
>   
>> 7 for(i=0; i<j && i+j+1<N; i++) {
>>
>> 8 for(i=0; i<j && i<N-j-1; i++) {
>>     
>
> the arithmetic might overflow in one of these
> but not in the other.
>   ok, finally I found the reason.

First, the llvm-gcc with -O0 will emit same cfg for the two forms of codes.
In the loop of Line 7/8, the "i<j" and "i+j+1<N or i <
N-j-1" each
occupies one basic block with a exit edge to the same successor basic block.
hence the loop has two exit edges.

Several passes will optimize the cfg and the ll code.

The final difference of the two cfg is due to pass licm and simplifycfg.
The licm will move the common expression out of the loop, like "j+1"
and
"N-j-1", thus the basic block for "i < N-j-1" will
finally has only two
instruction: the cmp and the branchInst. basic block for "i+j+1<N"
has
an addtional "adder" in it.

The pass simplifycfg has a process function :
static bool FoldBranchToCommonDest(BranchInst *BI)
Its comments :
"/// FoldBranchToCommonDest - If this basic block is ONLY a setcc and a 
branch,
/// and if a predecessor branches to us and one of our successors, fold the
/// setcc into the predecessor and use logical operations to pick the right
/// destination."
So, as the basicblock for "i < N -j-1" has just has cmp and
branchinst,
it will be folded, and hence get a better cfg.
The basicblock for "i+j+1<N", as having additional
"adder", leaves.

So my question is: can we loose the constraint "ONLY a setcc and a 
branch" ? why it's necessary?


> Best wishes,
>
> Duncan.
>

Possibly Parallel Threads

Search for more maybe matching threads

llvm dev - Aug 2008 - [LLVMdev] A case where llvm created different cfg for same code

[LLVMdev] A case where llvm created different cfg for same code

[LLVMdev] A case where llvm created different cfg for same code

[LLVMdev] A case where llvm created different cfg for same code

[LLVMdev] A case where llvm created different cfg for same code

[LLVMdev] A case where llvm created different cfg for same code

Possibly Parallel Threads