thr3ads.net - llvm dev - [LLVMdev] Dependence Analysis [was: Flow-Sensitive AA] [Aug 2008]

If this information is useful, please help other people find it:
Share via:

Chris Lattner

2008-Aug-22 16:34 UTC

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

On Aug 22, 2008, at 9:30 AM, Vikram S. Adve wrote:
> In the general case, I think you have to be conservative about this
> because programmers may deliberately want this kind of
"wraparound"
> behavior, e.g., with periodic boundary conditions.  But 99.9% of
> programs probably don't need that so it would be bad to penalize them
> for this corner case.  In such a situation, I think you just have to
> support both choices, but choose the default as sensibly as possible.
> I discussed this with Matthieu and this is what we came up with:
C has a way to express this: signed integers are defined to never  
overflow, unsigned integers are defined to wrap gracefully on  
overflow.  Other languages that target this may want some sort of  
command line option to control the behavior or the language may define  
it one way or the other.

We don't model this in LLVM IR, but this is a desired feature if  
anyone is interested.  Capturing this in the IR is a better way to go  
than having the optimizers have a big global knob.

-Chris
>
>
> 1. (Eventually) Support both choices: conservative or optimistic.
> Conservative means you assume that index arithmetic can wrap around;
> optimistic assumes it cannot.  What I think you've implemented is the
> optimistic case.
>
> 2. Have a default choice but give the programmer an option to force
> this on or off.
>
> 3. (Matthieu's idea) Default to optimistic if the index expressions
> are i32, but conservative if i8 or i16.  This assumes that there is no
> good reason to use i8 or i16 index variables except for wraparound
> behavior (and it is highly unlikely that programmers are using arrays
> of size 2^32 *and* requiring wraparound behavior for that).
>
> Of course, for now, we don't have to implement the conservative
> version: what you've done should be good enough.
>
> --Vikram
> Associate Professor, Computer Science
> University of Illinois at Urbana-Champaign
> http://llvm.org/~vadve
>
>
>
>
> On Aug 22, 2008, at 9:41 AM, matthieu at illinois.edu wrote:
>
>>
>>
>>
>>> However, there is one issue I have ignored - possibility of
>>> overflow in
>>> the index expression. Suppose, we have such a loop:
>>> for (i8 i = 0; i != 200; ++i) {
>>>  A[2 * i + 5] = ...
>>>  ... = A[2 * i + 3]
>>> }
>>> If both index expressions are evaluated in 8-bit arithmetic,
>>> then the dependence equation should be solved in modular
arithmetic:
>>> 2 * i + 5 == 2 * (i + delta) + 3 (mod 256)
>>> So the dependence distance is delta == 1 or delta == -127 what
gives
>>> two direction vectors: (<), (>). My version will produce only
the
>>> first
>>> one. Taking into account overflow issue during dependence analysis
>>> seems
>>> very difficult to me. Does anybody have some experience in this  
>>> area?
>>> Fortunately, programmers generally do not write programs in such a
>>> nasty way.
>>
>> I asked myself the same question. Without mod, how do you ensure
>> that for instance the expression 2*i+255 was not actually 2*i-1 ?
>>
>>
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Dale Johannesen

2008-Aug-22 16:53 UTC

head link

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

On Aug 22, 2008, at 9:34 AMPDT, Chris Lattner wrote:
>
> On Aug 22, 2008, at 9:30 AM, Vikram S. Adve wrote:
>
>> In the general case, I think you have to be conservative about this
>> because programmers may deliberately want this kind of
"wraparound"
>> behavior, e.g., with periodic boundary conditions.  But 99.9% of
>> programs probably don't need that so it would be bad to penalize
them
>> for this corner case.  In such a situation, I think you just have to
>> support both choices, but choose the default as sensibly as possible.
>> I discussed this with Matthieu and this is what we came up with:
>
> C has a way to express this: signed integers are defined to never
> overflow,
More precisely, signed integer overflow is undefined behavior, which  
gives the compiler great latitude.
Assuming this will never happen and doing optimizations on that basis  
is valid, but so are other things.
An easy implementation, which often matches user expectations because  
it is what most hardware does,
is to treat signed and unsigned overflow alike, with clean wraparound.
> unsigned integers are defined to wrap gracefully on
> overflow.  Other languages that target this may want some sort of
> command line option to control the behavior or the language may define
> it one way or the other.
>
> We don't model this in LLVM IR, but this is a desired feature if
> anyone is interested.  Capturing this in the IR is a better way to go
> than having the optimizers have a big global knob.
>
> -Chris
>
>>
>>
>> 1. (Eventually) Support both choices: conservative or optimistic.
>> Conservative means you assume that index arithmetic can wrap around;
>> optimistic assumes it cannot.  What I think you've implemented is
the
>> optimistic case.
>>
>> 2. Have a default choice but give the programmer an option to force
>> this on or off.
>>
>> 3. (Matthieu's idea) Default to optimistic if the index expressions
>> are i32, but conservative if i8 or i16.  This assumes that there is  
>> no
>> good reason to use i8 or i16 index variables except for wraparound
>> behavior (and it is highly unlikely that programmers are using arrays
>> of size 2^32 *and* requiring wraparound behavior for that).
>>
>> Of course, for now, we don't have to implement the conservative
>> version: what you've done should be good enough.
>>
>> --Vikram
>> Associate Professor, Computer Science
>> University of Illinois at Urbana-Champaign
>> http://llvm.org/~vadve
>>
>>
>>
>>
>> On Aug 22, 2008, at 9:41 AM, matthieu at illinois.edu wrote:
>>
>>>
>>>
>>>
>>>> However, there is one issue I have ignored - possibility of
>>>> overflow in
>>>> the index expression. Suppose, we have such a loop:
>>>> for (i8 i = 0; i != 200; ++i) {
>>>> A[2 * i + 5] = ...
>>>> ... = A[2 * i + 3]
>>>> }
>>>> If both index expressions are evaluated in 8-bit arithmetic,
>>>> then the dependence equation should be solved in modular  
>>>> arithmetic:
>>>> 2 * i + 5 == 2 * (i + delta) + 3 (mod 256)
>>>> So the dependence distance is delta == 1 or delta == -127 what
>>>> gives
>>>> two direction vectors: (<), (>). My version will produce
only the
>>>> first
>>>> one. Taking into account overflow issue during dependence
analysis
>>>> seems
>>>> very difficult to me. Does anybody have some experience in this
>>>> area?
>>>> Fortunately, programmers generally do not write programs in
such a
>>>> nasty way.
>>>
>>> I asked myself the same question. Without mod, how do you ensure
>>> that for instance the expression 2*i+255 was not actually 2*i-1 ?
>>>
>>>
>>
>> _______________________________________________
>> LLVM Developers mailing list
>> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Vikram S. Adve

2008-Aug-22 17:34 UTC

head link

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

On Aug 22, 2008, at 11:53 AM, Dale Johannesen wrote:
>
> On Aug 22, 2008, at 9:34 AMPDT, Chris Lattner wrote:
>
>>
>> On Aug 22, 2008, at 9:30 AM, Vikram S. Adve wrote:
>>
>>> In the general case, I think you have to be conservative about this
>>> because programmers may deliberately want this kind of
"wraparound"
>>> behavior, e.g., with periodic boundary conditions.  But 99.9% of
>>> programs probably don't need that so it would be bad to
penalize
>>> them
>>> for this corner case.  In such a situation, I think you just have
to
>>> support both choices, but choose the default as sensibly as  
>>> possible.
>>> I discussed this with Matthieu and this is what we came up with:
>>
>> C has a way to express this: signed integers are defined to never
>> overflow,
Ok, I thought they both had the same wraparound behavior: thanks for  
clarifying that.


>> More precisely, signed integer overflow is undefined behavior, which
> gives the compiler great latitude.
> Assuming this will never happen and doing optimizations on that basis
> is valid, but so are other things.
> An easy implementation, which often matches user expectations because
> it is what most hardware does,
> is to treat signed and unsigned overflow alike, with clean wraparound.

That makes sense.  But we would have to distinguish them during  
dependence analysis at least, if we want to use the solution Chris  
suggested.  Which brings me to a question:

>> unsigned integers are defined to wrap gracefully on
>> overflow.  Other languages that target this may want some sort of
>> command line option to control the behavior or the language may  
>> define
>> it one way or the other.
>>
>> We don't model this in LLVM IR, but this is a desired feature if
>> anyone is interested.  Capturing this in the IR is a better way to go
>> than having the optimizers have a big global knob.

But signed and unsigned values are not distinguished any more in the  
LLVM IR, and I don't think you're suggesting we restore unsigned  
types.  What do you have in mind here?

--Vikram

Mike Stump

2008-Aug-22 20:50 UTC

head link

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

On Aug 22, 2008, at 9:34 AM, Chris Lattner wrote:> C has a way to express this: signed integers are defined to never
> overflow, unsigned integers are defined to wrap gracefully on
> overflow.
And gcc has yet more fun in it:

-fstrict-overflow
Allow the compiler to assume strict signed overflow rules,
depending on the language
being compiled. For C (and C++) this means that overflow
when doing arithmetic with
signed numbers is undefined, which means that the compiler
may assume that it will
not happen. This permits various optimizations. For
example, the compiler will
assume that an expression like "i + 10 > i" will always
be
true for signed "i". This
assumption is only valid if signed overflow is undefined,
as the expression is false
if "i + 10" overflows when using twos complement
arithmetic. When this option is in
effect any attempt to determine whether an operation on
signed numbers will overflow
must be written carefully to not actually involve overflow.

See also the -fwrapv option. Using -fwrapv means that
signed overflow is fully
defined: it wraps. When -fwrapv is used, there is no
difference between
-fstrict-overflow and -fno-strict-overflow. With -fwrapv
certain types of overflow
are permitted. For example, if the compiler gets an
overflow when doing arithmetic
on constants, the overflowed value can still be used with -
fwrapv, but not otherwise.

The -fstrict-overflow option is enabled at levels -O2, -
O3, -Os.

-ftrapv
This option generates traps for signed overflow on
addition, subtraction,
multiplication operations.

-fwrapv
This option instructs the compiler to assume that signed
arithmetic overflow of
addition, subtraction and multiplication wraps around
using twos-complement
representation. This flag enables some optimizations and
disables others. This
option is enabled by default for the Java front-end, as
required by the Java language
specification.

Vikram S. Adve

2008-Aug-22 21:03 UTC

head link

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

Thanks!  This is all very interesting, and tells me that LLVM has a  
way to go to fully support all of these capabilities (if that is the  
right thing to do, which isn't clear).  OTOH, it looks like a lot of  
real-world software that is using LLVM already doesn't seem to be  
affected by the lack of them.

Does anyone know of any C/C++ programs that require integer overflow  
on signed arithmetic (even though it is not strictly allowed by the  
standard)?

Also, does anyone know how -ftrapv is implemented on processors that  
don't have hardware detection of integer overflow?  It sounds very  
expensive to do entirely in software.

Thanks,

--Vikram
Associate Professor, Computer Science
University of Illinois at Urbana-Champaign
http://llvm.org/~vadve




On Aug 22, 2008, at 3:50 PM, Mike Stump wrote:
> On Aug 22, 2008, at 9:34 AM, Chris Lattner wrote:
>> C has a way to express this: signed integers are defined to never
>> overflow, unsigned integers are defined to wrap gracefully on
>> overflow.
>
> And gcc has yet more fun in it:
>
>        -fstrict-overflow
>            Allow the compiler to assume strict signed overflow rules,
> depending on the language
>            being compiled.  For C (and C++) this means that overflow
> when doing arithmetic with
>            signed numbers is undefined, which means that the compiler
> may assume that it will
>            not happen.  This permits various optimizations.  For
> example, the compiler will
>            assume that an expression like "i + 10 > i" will
always be
> true for signed "i".  This
>            assumption is only valid if signed overflow is undefined,
> as the expression is false
>            if "i + 10" overflows when using twos complement
> arithmetic.  When this option is in
>            effect any attempt to determine whether an operation on
> signed numbers will overflow
>            must be written carefully to not actually involve overflow.
>
>            See also the -fwrapv option.  Using -fwrapv means that
> signed overflow is fully
>            defined: it wraps.  When -fwrapv is used, there is no
> difference between
>            -fstrict-overflow and -fno-strict-overflow.  With -fwrapv
> certain types of overflow
>            are permitted.  For example, if the compiler gets an
> overflow when doing arithmetic
>            on constants, the overflowed value can still be used with -
> fwrapv, but not otherwise.
>
>            The -fstrict-overflow option is enabled at levels -O2, -
> O3, -Os.
>
>        -ftrapv
>            This option generates traps for signed overflow on
> addition, subtraction,
>            multiplication operations.
>
>        -fwrapv
>            This option instructs the compiler to assume that signed
> arithmetic overflow of
>            addition, subtraction and multiplication wraps around
> using twos-complement
>            representation.  This flag enables some optimizations and
> disables others.  This
>            option is enabled by default for the Java front-end, as
> required by the Java language
>            specification.
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Chris Lattner

2008-Aug-22 21:12 UTC

head link

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

On Aug 22, 2008, at 9:53 AM, Dale Johannesen wrote:
>>
>> C has a way to express this: signed integers are defined to never
>> overflow,
>
> More precisely, signed integer overflow is undefined behavior, which
> gives the compiler great latitude.
> Assuming this will never happen and doing optimizations on that basis
> is valid, but so are other things.
> An easy implementation, which often matches user expectations because
> it is what most hardware does,
> is to treat signed and unsigned overflow alike, with clean wraparound.
In other words, our current implementation is correct.  However, we  
are missing the opportunity to optimize some things.  Trivial examples  
include some cases where you can't compute a simple loop count due to  
potential overflow.

-Chris

Apparently Analagous Threads

Search for more apparently analagous threads

llvm dev - Aug 2008 - [LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

[LLVMdev] Dependence Analysis [was: Flow-Sensitive AA]

Apparently Analagous Threads