thr3ads.net - llvm dev - [LLVMdev] RFC: Proposal for Poison Semantics [Feb 2015]

If this information is useful, please help other people find it:
Share via:

Bruce Hoult

2015-Feb-01 10:44 UTC

[LLVMdev] RFC: Proposal for Poison Semantics

I don't know how things work at the moment, but it seems to me that you can
do lots of sensible things, and avoid lots of silly things, if you keep
track of four possible values for each bit:

- undef (the default)
- poison
- known to be 0
- known to be 1

This makes both David's and Chandler's examples work nicely if you
assume:

- ZEXT makes all the new bits known 0
- SEXT makes all the new bits the same as the high bit
- AND clears unknown and poison bits to known 0 if the other input is known
0
- OR sets unknown and poison bits to known 1 if the other input is known 1

Also things such as ZEXTing a poison i32 to i64 and then right shifting by
32 will result in all known 0 bits.


On Sun, Feb 1, 2015 at 11:19 PM, Chandler Carruth <chandlerc at
google.com>
wrote:
>
> On Sun, Feb 1, 2015 at 1:57 AM, David Majnemer <david.majnemer at
gmail.com>
> wrote:
>
>>
>> On Tue, Jan 27, 2015 at 8:58 PM, Sanjoy Das <
>> sanjoy at playingwithpointers.com> wrote:
>>
>>> > Ah, yes.  You are right, we cannot always assume that %y would
be zero
>>> in
>>> > the second case.
>>> > This wouldn't be the first time we've lost information
that we could
>>> use to
>>> > optimize a program by transforming it.
>>> >
>>> > Do you think this result would be problematic?  It seems
consistent
>>> with the
>>> > RFC and LLVM's current behavior.
>>> >
>>>
>>> The problem is not that we're losing information, the problem
is that
>>> we're changing the behavior of a well-defined program.
>>>
>>> I'll try to put the whole argument in one place:
>>>
>>> We start with
>>>
>>>   %x = add nuw i32 %m, %n
>>>   %y = zext i32 %x to i64
>>>   %s = lshr i64 %y, 32
>>>   %addr = gep %some_global, %s
>>>   store i32 42, i32* %addr
>>>
>>> In the above program, for all values of %x, %s is 0.  This means
the
>>> program is well-defined when %x is poison (since you don't need
to
>>> look at %x to determine the value of %addr, in the same sense as
you
>>> don't need to look at X to determine the value of "and X,
0"); and it
>>> stores 42 to &(%some_global)[0].  Specifically, the above
program is
>>> well defined for "%m = %n = 2^32-1".
>>>
>>> Now if we do the usual transform of "zext (add nuw X Y)"
=> "add nuw
>>> (zext X) (zext Y)" then we get
>>>
>>>   %m.wide = zext i32 %m to i64
>>>   %n.wide = zext i32 %n to i64
>>>   %z = add nuw i64 %m.wide, %n.wide
>>>   %s = lshr i64 %y, 32
>>>   %addr = gep %some_global, %s
>>>   store i32 42, i32* %addr
>>>
>>> The new program does *not* have the same behavior as the old
program
>>> for "%m = %n = 2^32-1".  We have changed the behavior of
a
>>> well-defined program by doing the "zext (add nuw X Y)"
=> "add nuw
>>> (zext X) (zext Y)" transform.
>>>
>>
>> After some pondering and combing through LLVM's implementation, I
think
>> we must conclude that zexting a value with any poison bits creates
poison
>> in every new bit.
>>
>> Considering the following program:
>>
>> %zext = zext i32 %x to i64
>> %icmp = icmp i64 %zext, i64 1
>>
>> we'd like to transform this to:
>>
>> %icmp = icmp i32 %x, i32 1
>>
>> Is it reasonable to say that '%icmp' in the before case is not
poison but
>> '%icmp' in the after case is poison?  LLVM assumes it can
remove casts with
>> impunity, I think this is a useful property to maintain.
>>
>
> FWIW, I agree with your statement.
>
> Here is the line of reasoning that I find troubling.
>
> If we accept the above, we have a surprising result (using small bit-width
> integers to make it easier to read)
>
> %zext = zext i1 %x to i2
> %and = and i2 %zext, 1
>
> We cannot replace %and with %zext because the %and might be removing
> poison.
>
> Perhaps this restriction is OK though. I just find it somewhat troubling.
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20150201/323e5a03/attachment.html>

David Majnemer

2015-Feb-01 11:20 UTC

head link

[LLVMdev] RFC: Proposal for Poison Semantics

Let's consider Sanjoy's earlier example,

We start with the following program:

%x = add nuw i32 %m, %n
%y = zext i32 %x to i64
%s = lshr i64 %y, 32

If %m and %n are 2**32 - 1, then using your model: %x is entirely poison,
%y's top 32-bits would be zero and bottom 32-bits are poison and %s would
be enitrely zero.

This would mean that we couldn't transform the program into:

%m.wide = zext i32 %m to i64
%n.wide = zext i32 %n to i64
%z = add nuw i64 %m.wide, %n.wide
%s = lshr i64 %z, 32

Because we would have: %m.wide and %n.wide are 2**32 - 1, %z is 2**33 - 2,
%s is 1

That being said, we don't perform this transform today and I don't see
why
we would want to.  I'd happily give up this flexibility if it meant that we
could keep everything else.

On Sun, Feb 1, 2015 at 2:44 AM, Bruce Hoult <bruce at hoult.org> wrote:
> I don't know how things work at the moment, but it seems to me that you
> can do lots of sensible things, and avoid lots of silly things, if you keep
> track of four possible values for each bit:
>
> - undef (the default)
> - poison
> - known to be 0
> - known to be 1
>
> This makes both David's and Chandler's examples work nicely if you
assume:
>
> - ZEXT makes all the new bits known 0
> - SEXT makes all the new bits the same as the high bit
> - AND clears unknown and poison bits to known 0 if the other input is
> known 0
> - OR sets unknown and poison bits to known 1 if the other input is known 1
>
> Also things such as ZEXTing a poison i32 to i64 and then right shifting by
> 32 will result in all known 0 bits.
>
>
> On Sun, Feb 1, 2015 at 11:19 PM, Chandler Carruth <chandlerc at
google.com>
> wrote:
>
>>
>> On Sun, Feb 1, 2015 at 1:57 AM, David Majnemer <david.majnemer at
gmail.com>
>> wrote:
>>
>>>
>>> On Tue, Jan 27, 2015 at 8:58 PM, Sanjoy Das <
>>> sanjoy at playingwithpointers.com> wrote:
>>>
>>>> > Ah, yes.  You are right, we cannot always assume that %y
would be
>>>> zero in
>>>> > the second case.
>>>> > This wouldn't be the first time we've lost
information that we could
>>>> use to
>>>> > optimize a program by transforming it.
>>>> >
>>>> > Do you think this result would be problematic?  It seems
consistent
>>>> with the
>>>> > RFC and LLVM's current behavior.
>>>> >
>>>>
>>>> The problem is not that we're losing information, the
problem is that
>>>> we're changing the behavior of a well-defined program.
>>>>
>>>> I'll try to put the whole argument in one place:
>>>>
>>>> We start with
>>>>
>>>>   %x = add nuw i32 %m, %n
>>>>   %y = zext i32 %x to i64
>>>>   %s = lshr i64 %y, 32
>>>>   %addr = gep %some_global, %s
>>>>   store i32 42, i32* %addr
>>>>
>>>> In the above program, for all values of %x, %s is 0.  This
means the
>>>> program is well-defined when %x is poison (since you don't
need to
>>>> look at %x to determine the value of %addr, in the same sense
as you
>>>> don't need to look at X to determine the value of "and
X, 0"); and it
>>>> stores 42 to &(%some_global)[0].  Specifically, the above
program is
>>>> well defined for "%m = %n = 2^32-1".
>>>>
>>>> Now if we do the usual transform of "zext (add nuw X
Y)" => "add nuw
>>>> (zext X) (zext Y)" then we get
>>>>
>>>>   %m.wide = zext i32 %m to i64
>>>>   %n.wide = zext i32 %n to i64
>>>>   %z = add nuw i64 %m.wide, %n.wide
>>>>   %s = lshr i64 %y, 32
>>>>   %addr = gep %some_global, %s
>>>>   store i32 42, i32* %addr
>>>>
>>>> The new program does *not* have the same behavior as the old
program
>>>> for "%m = %n = 2^32-1".  We have changed the behavior
of a
>>>> well-defined program by doing the "zext (add nuw X
Y)" => "add nuw
>>>> (zext X) (zext Y)" transform.
>>>>
>>>
>>> After some pondering and combing through LLVM's implementation,
I think
>>> we must conclude that zexting a value with any poison bits creates
poison
>>> in every new bit.
>>>
>>> Considering the following program:
>>>
>>> %zext = zext i32 %x to i64
>>> %icmp = icmp i64 %zext, i64 1
>>>
>>> we'd like to transform this to:
>>>
>>> %icmp = icmp i32 %x, i32 1
>>>
>>> Is it reasonable to say that '%icmp' in the before case is
not poison
>>> but '%icmp' in the after case is poison?  LLVM assumes it
can remove casts
>>> with impunity, I think this is a useful property to maintain.
>>>
>>
>> FWIW, I agree with your statement.
>>
>> Here is the line of reasoning that I find troubling.
>>
>> If we accept the above, we have a surprising result (using small
>> bit-width integers to make it easier to read)
>>
>> %zext = zext i1 %x to i2
>> %and = and i2 %zext, 1
>>
>> We cannot replace %and with %zext because the %and might be removing
>> poison.
>>
>> Perhaps this restriction is OK though. I just find it somewhat
troubling.
>>
>> _______________________________________________
>> LLVM Developers mailing list
>> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>
>>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20150201/9200055a/attachment.html>

Sanjoy Das

2015-Feb-01 20:58 UTC

head link

[LLVMdev] RFC: Proposal for Poison Semantics

> That being said, we don't perform this transform today and I don't
see why
> we would want to.  I'd happily give up this flexibility if it meant
that we
> could keep everything else.
I was under the impression that "zext(add nuw X Y) == add nuw zext(X),
zext(Y)" is the very definition of nuw (i.e. all properties of nuw/nsw
are supposed to
follow from that).  If we wish to go ahead without that equivalence,
then I agree we probably don't need the higher bits to be poison when
zexting poison.

IOW, the higher bits of the zext of poison needs to be poison because
we want the "zext(add nuw X Y) == add nuw zext(X), zext(Y)" transform
to not change the meaning of a program.  Hence any program whose
meaning would change because of this transform needs to have UB to
start with.

This is also the root of the issue Chandler pointed out -- "(zext X)"
and "(and 0x7f (zext X))" are _not_ the same value because a
non-trivial use of the first value induces a dependence on the bits
that may change with the "zext(add nuw X Y) == add nuw zext(X),
zext(Y)" transform while a non-trivial use of the second value does
not.

-- Sanjoy

Sanjoy Das

2015-Feb-01 21:06 UTC

head link

[LLVMdev] RFC: Proposal for Poison Semantics

> I don't know how things work at the moment, but it seems to me that you
can
> do lots of sensible things, and avoid lots of silly things, if you keep
> track of four possible values for each bit:
>
> - undef (the default)
> - poison
> - known to be 0
> - known to be 1
I'm not clear what you mean by "known to be X", but assuming you
mean
"known" in the same sense of computeKnownBits, then I don't think
such
a distinction will lead to intuitive semantics.  If the semantics of a
program depended on what the compiler can prove about the program,
then the meaning of a program will change based on how smart the
compiler is (i.e. does the /compiler/ know that `add i32 10, 9` is
`i32 19`?).  As a concrete example, outlining a function to a separate
module will no longer be a meaning preserving transform because then a
bit could go from "known to be 0" to "nothing is known".

Sorry for the tangent, in case you meant something completely
different by "known". :)

-- Sanjoy

Bruce Hoult

2015-Feb-02 01:09 UTC

head link

[LLVMdev] RFC: Proposal for Poison Semantics

I don't see how it changes the meaning of the program whether the compiler
notices that two variables will always be 10 and 9 and does a compile time
evaluation of 10 + 9 = 19, or generates an add instruction that turns out
to  calculate the same result at runtime?

If you outline a function as you suggest, and then as a result don't know
the return value, it just means you generate actual instructions to use
that result as the programmer directed, rather than compile time
propagating a constant 19 to whatever follows.

On Mon, Feb 2, 2015 at 10:06 AM, Sanjoy Das <sanjoy at
playingwithpointers.com>
wrote:
> > I don't know how things work at the moment, but it seems to me
that you
> can
> > do lots of sensible things, and avoid lots of silly things, if you
keep
> > track of four possible values for each bit:
> >
> > - undef (the default)
> > - poison
> > - known to be 0
> > - known to be 1
>
> I'm not clear what you mean by "known to be X", but assuming
you mean
> "known" in the same sense of computeKnownBits, then I don't
think such
> a distinction will lead to intuitive semantics.  If the semantics of a
> program depended on what the compiler can prove about the program,
> then the meaning of a program will change based on how smart the
> compiler is (i.e. does the /compiler/ know that `add i32 10, 9` is
> `i32 19`?).  As a concrete example, outlining a function to a separate
> module will no longer be a meaning preserving transform because then a
> bit could go from "known to be 0" to "nothing is
known".
>
> Sorry for the tangent, in case you meant something completely
> different by "known". :)
>
> -- Sanjoy
>
> --
> This message has been scanned for viruses and
> dangerous content by MailScanner, and is
> believed to be clean.
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20150202/fae940ba/attachment.html>

llvm dev - Feb 2015 - [LLVMdev] RFC: Proposal for Poison Semantics

[LLVMdev] RFC: Proposal for Poison Semantics

[LLVMdev] RFC: Proposal for Poison Semantics

[LLVMdev] RFC: Proposal for Poison Semantics

[LLVMdev] RFC: Proposal for Poison Semantics

[LLVMdev] RFC: Proposal for Poison Semantics