thr3ads.net - llvm dev - [LLVMdev] InstCombine "pessimizes" trunc i8 to i1? [Dec 2011]

If this information is useful, please help other people find it:
Share via:

Jochen Wilhelmy

2011-Dec-28 11:45 UTC

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

>> Hi!
>>
>> before InstCombine (llvm::createInstructionCombiningPass()) I have
>> a trunc from i8 to i1 and then a select:
>>
>> %45 = load i8* @myGlobal, align 1
>> %tobool = trunc i8 %45 to i1
>> %cond = select i1 %tobool, float 1.000000e+00, float -1.000000e+00
>>
>> after instCombine I have:
>>
>> %29 = load i8* @myGlobal, align 1
>> %30 = and i8 %29, 1
>> %tobool = icmp ne i8 %30, 0
>> %cond = select i1 %tobool, float 1.000000e+00, float -1.000000e+00
>>
>> is this a bug or intended? My version is 3.0 release.
>> Please tell me where I can remove this rule even if it is intended for
>> mainline.
> This is intentional: an 'and' must be done in both cases, so this
transformation is exposing it to the optimizer.
>
> Why do you consider this to be a pessimization?  Does one produce inferior
machine code?
I consider it a pessimization as it is one additional instruction and 
I'm mainly interested in target
independent optimizations because I regenerate highlevel code from it
("Exporting 3D scenes from Maya to WebGL using clang and llvm").
For example from the given code before the transformation I can easily 
regenerate
myGlobal ? 1.0f : -1.0f
while after the transformation I get
(myGlobal & 1) != 0 ? 1.0f : -1.0f
which is not good for shading languages.

So I can remove the transformation in my local copy (I found it by now) 
or if it would be possible to
move it into the optimizer that needs it this would be benificial for me.

-Jochen

Reid Kleckner

2011-Dec-29 18:52 UTC

head link

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

I think Chris is saying that the and is necessary because with your i1
trunc you're ignoring all of the high bits.  The and implements that.  If
you don't want this behavior, don't generate the trunc in the first
place
and just compare the full width to zero.

Reid

On Wed, Dec 28, 2011 at 6:45 AM, Jochen Wilhelmy <j.wilhelmy at
arcor.de>wrote:
>
> >> Hi!
> >>
> >> before InstCombine (llvm::createInstructionCombiningPass()) I have
> >> a trunc from i8 to i1 and then a select:
> >>
> >> %45 = load i8* @myGlobal, align 1
> >> %tobool = trunc i8 %45 to i1
> >> %cond = select i1 %tobool, float 1.000000e+00, float -1.000000e+00
> >>
> >> after instCombine I have:
> >>
> >> %29 = load i8* @myGlobal, align 1
> >> %30 = and i8 %29, 1
> >> %tobool = icmp ne i8 %30, 0
> >> %cond = select i1 %tobool, float 1.000000e+00, float -1.000000e+00
> >>
> >> is this a bug or intended? My version is 3.0 release.
> >> Please tell me where I can remove this rule even if it is intended
for
> >> mainline.
> > This is intentional: an 'and' must be done in both cases, so
this
> transformation is exposing it to the optimizer.
> >
> > Why do you consider this to be a pessimization?  Does one produce
> inferior machine code?
>
> I consider it a pessimization as it is one additional instruction and
> I'm mainly interested in target
> independent optimizations because I regenerate highlevel code from it
> ("Exporting 3D scenes from Maya to WebGL using clang and llvm").
> For example from the given code before the transformation I can easily
> regenerate
> myGlobal ? 1.0f : -1.0f
> while after the transformation I get
> (myGlobal & 1) != 0 ? 1.0f : -1.0f
> which is not good for shading languages.
>
> So I can remove the transformation in my local copy (I found it by now)
> or if it would be possible to
> move it into the optimizer that needs it this would be benificial for me.
>
> -Jochen
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20111229/5ab31ec8/attachment.html>

Chris Lattner

2011-Dec-30 07:45 UTC

head link

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

On Dec 29, 2011, at 10:52 AM, Reid Kleckner wrote:
> I think Chris is saying that the and is necessary because with your i1
trunc you're ignoring all of the high bits.  The and implements that.  If
you don't want this behavior, don't generate the trunc in the first
place and just compare the full width to zero.
Right.  Turning this into "myGlobal ? 1.0f : -1.0f" is not correct.

-Chris
> 
> Reid
> 
> On Wed, Dec 28, 2011 at 6:45 AM, Jochen Wilhelmy <j.wilhelmy at
arcor.de> wrote:
> 
> >> Hi!
> >>
> >> before InstCombine (llvm::createInstructionCombiningPass()) I have
> >> a trunc from i8 to i1 and then a select:
> >>
> >> %45 = load i8* @myGlobal, align 1
> >> %tobool = trunc i8 %45 to i1
> >> %cond = select i1 %tobool, float 1.000000e+00, float -1.000000e+00
> >>
> >> after instCombine I have:
> >>
> >> %29 = load i8* @myGlobal, align 1
> >> %30 = and i8 %29, 1
> >> %tobool = icmp ne i8 %30, 0
> >> %cond = select i1 %tobool, float 1.000000e+00, float -1.000000e+00
> >>
> >> is this a bug or intended? My version is 3.0 release.
> >> Please tell me where I can remove this rule even if it is intended
for
> >> mainline.
> > This is intentional: an 'and' must be done in both cases, so
this transformation is exposing it to the optimizer.
> >
> > Why do you consider this to be a pessimization?  Does one produce
inferior machine code?
> 
> I consider it a pessimization as it is one additional instruction and
> I'm mainly interested in target
> independent optimizations because I regenerate highlevel code from it
> ("Exporting 3D scenes from Maya to WebGL using clang and llvm").
> For example from the given code before the transformation I can easily
> regenerate
> myGlobal ? 1.0f : -1.0f
> while after the transformation I get
> (myGlobal & 1) != 0 ? 1.0f : -1.0f
> which is not good for shading languages.
> 
> So I can remove the transformation in my local copy (I found it by now)
> or if it would be possible to
> move it into the optimizer that needs it this would be benificial for me.
> 
> -Jochen
> 
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20111229/27cd2e82/attachment.html>

Jochen Wilhelmy

2011-Dec-30 14:21 UTC

head link

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

Am 29.12.2011 19:52, schrieb Reid Kleckner:> I think Chris is saying that the and is necessary because with your i1 
> trunc you're ignoring all of the high bits.  The and implements that. 
>  If you don't want this behavior, don't generate the trunc in the 
> first place and just compare the full width to zero.But if a backend sees trunc from i8 to i1 it should know about the and, 
therefore I think replacing it by and is not
a target independent transformation. I'm not saying the and is wrong, I 
just think InstCombine is the wrong place
if InstCombine is supposed to be target independent (which is my 
assumption that is possibly wrong).
By the way i8 and trunc come from clang as clang represents a bool as i8 
in memory. of course it would
be a nice feature if I could tell clang to always use i1 for bool, this 
would also remove the problem.
Is this possible?

-Jochen

Apparently Analagous Threads

Search for more seemingly similar threads

llvm dev - Dec 2011 - [LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

[LLVMdev] InstCombine "pessimizes" trunc i8 to i1?

Apparently Analagous Threads