thr3ads.net - llvm dev - [llvm-dev] Speculative execution of FP divide Instructions

If this information is useful, please help other people find it:
Share via:

Hal Finkel via llvm-dev

2017-Mar-15 15:22 UTC

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

[+current llvm-dev address]


On 03/15/2017 09:23 AM, Hal Finkel wrote:> Hi Samuel,
>
> What are you taking about? ;)
>
> The only way to get a SIGFPE from a floating-point division by zero is 
> to modify the floating-point environment to enable those exceptions.  
> We don't support that (*). In the default (supported) environment, 
> floating point division is well defined for all inputs (dividing by 0 
> gives inf, by NaN gives, NaN, etc.).
>
> Regarding whether it makes sense to speculate, that's clearly a 
> target-dependent property. On some targets this makes sense (e.g. OOO 
> cores where divides have high, by hideable, latency) and on some 
> targets it really doesn't. If we're speculating these on targets
where
> we shouldn't, then we need to fix the cost model.
>
> (*) There's ongoing work to change that. Search the mailing list for 
> Andrew Kaylor.
>
>  -Hal
>
> On 03/15/2017 04:38 AM, Samuel Antão wrote:
>> Hi all,
>>
>> I came across an issue caused by the Call-Graph Simplify Pass. Here 
>> is a a small repro:
>>
>> ```
>> define double @foo(double %a1, double %a2, double %a3) #0 {
>> entry:
>>   %a_mul = fmul double %a1, %a2
>>   %a_cmp = fcmp ogt double %a3, %a_mul
>>   br i1 %a_cmp, label %a.then, label %a.end
>>
>> a.then:
>>   %a_div = fdiv double %a_mul, %a3
>>   br label %a.end
>>
>> a.end:
>>   %a_factor = phi double [ %a_div, %a.then ], [ 1.000000e+00, %entry ]
>>   ret double %a_factor
>> }
>> ```
>> Here, the conditional is guarding a possible division by zero. 
>> However if I run CGSimplify on this I get:
>> ```
>> define double @foo(double %a1, double %a2, double %a3) 
>> local_unnamed_addr #0 {
>> entry:
>>   %a_mul = fmul double %a1, %a2
>>   %a_cmp = fcmp olt double %a_mul, %a3
>>   %a_div = fdiv double %a_mul, %a3
>>   %a_factor = select i1 %a_cmp, double %a_div, double 1.000000e+00
>>   ret double %a_factor
>> }
>> ```
>> This will cause a FP arithmetic exception, and the application will 
>> get a SIGFPE signal. The code that drives the change in the optimizer 
>> relies on `llvm::isSafeToSpeculativelyExecute` to decide whether the 
>> division should be performed speculatively. Right now, this function 
>> returns true. One could do something similar to integer divisions and 
>> add a case there (this solved the issue for me):
>> ```
>> diff --git a/lib/Analysis/ValueTracking.cpp 
>> b/lib/Analysis/ValueTracking.cpp
>> index 1761dac..c61fefd 100644
>> --- a/lib/Analysis/ValueTracking.cpp
>> +++ b/lib/Analysis/ValueTracking.cpp
>> @@ -3352,6 +3352,21 @@ bool llvm::isSafeToSpeculativelyExecute(const 
>> Value *V,
>>      // The numerator *might* be MinSignedValue.
>>      return false;
>>    }
>> +  case Instruction::FDiv:
>> +  case Instruction::FRem:{
>> +    const Value *Denominator = Inst->getOperand(1);
>> +    // x / y is undefined if y == 0
>> +    // The denominator is not a constant, so there is nothing we can 
>> do to prove
>> +    // it is non-zero.
>> +    if (auto *VV = dyn_cast<ConstantVector>(Denominator))
>> +      Denominator = VV->getSplatValue();
>> +    if (!isa<ConstantFP>(Denominator))
>> +      return false;
>> +    // The denominator is a zero constant - we can't speculate
here.
>> +    if (m_AnyZero().match(Denominator))
>> +      return false;
>> +    return true;
>> +  }
>>    case Instruction::Load: {
>>      const LoadInst *LI = cast<LoadInst>(Inst);
>>      if (!LI->isUnordered() ||
>> ```
>> I did my tests using a powerpc64le target, but I couldn't find any 
>> target specific login involved in this transform. In any case, I 
>> wanted to drop the questions:
>>
>> - is there any target that would benefit from speculative fp divisions?
>> - is there any target for which fp division does not have side effects?
>>
>> If not, I can go ahead and post the patch above for review.
>>
>> Many thanks!
>> Samuel
>
-- 
Hal Finkel
Lead, Compiler Technology and Programming Languages
Leadership Computing Facility
Argonne National Laboratory

Samuel Antão via llvm-dev

2017-Mar-15 17:41 UTC

head link

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

Thank you all for the responses!

I understand that FP exceptions are activated programatically and can be
disabled. I guess my point is that those divisions can have side effects
and the compiler can't prove they have not. I imagine that, from a user
viewpoint, if I write code like:

double a, b, c
...
if (a > b)
  c = a/b

thinking that my conditional also guards against divisions by zero, and
activate the exceptions exactly to assert that, transformations that
speculate the execution of the divisions won't serve me well. The whole
purpose of having FP exception support just vanishes.

I guess that at the end of the day this is about choosing what is a
"good"
default behaviour. I don't feel strongly one way or the other, just thought
that this deserved some discussion.

I don't think this is what Andrew is trying to solve, is it? Exceptions can
be activated anywhere, not necessarily in the same compilation unit.

Thanks again!
Samuel

On Wed, Mar 15, 2017 at 3:22 PM, Hal Finkel <hfinkel at anl.gov> wrote:
> [+current llvm-dev address]
>
>
>
> On 03/15/2017 09:23 AM, Hal Finkel wrote:
>
>> Hi Samuel,
>>
>> What are you taking about? ;)
>>
>> The only way to get a SIGFPE from a floating-point division by zero is
to
>> modify the floating-point environment to enable those exceptions.  We
don't
>> support that (*). In the default (supported) environment, floating
point
>> division is well defined for all inputs (dividing by 0 gives inf, by
NaN
>> gives, NaN, etc.).
>>
>> Regarding whether it makes sense to speculate, that's clearly a
>> target-dependent property. On some targets this makes sense (e.g. OOO
cores
>> where divides have high, by hideable, latency) and on some targets it
>> really doesn't. If we're speculating these on targets where we
shouldn't,
>> then we need to fix the cost model.
>>
>> (*) There's ongoing work to change that. Search the mailing list
for
>> Andrew Kaylor.
>>
>>  -Hal
>>
>> On 03/15/2017 04:38 AM, Samuel Antão wrote:
>>
>>> Hi all,
>>>
>>> I came across an issue caused by the Call-Graph Simplify Pass. Here
is a
>>> a small repro:
>>>
>>> ```
>>> define double @foo(double %a1, double %a2, double %a3) #0 {
>>> entry:
>>>   %a_mul = fmul double %a1, %a2
>>>   %a_cmp = fcmp ogt double %a3, %a_mul
>>>   br i1 %a_cmp, label %a.then, label %a.end
>>>
>>> a.then:
>>>   %a_div = fdiv double %a_mul, %a3
>>>   br label %a.end
>>>
>>> a.end:
>>>   %a_factor = phi double [ %a_div, %a.then ], [ 1.000000e+00,
%entry ]
>>>   ret double %a_factor
>>> }
>>> ```
>>> Here, the conditional is guarding a possible division by zero.
However
>>> if I run CGSimplify on this I get:
>>> ```
>>> define double @foo(double %a1, double %a2, double %a3)
>>> local_unnamed_addr #0 {
>>> entry:
>>>   %a_mul = fmul double %a1, %a2
>>>   %a_cmp = fcmp olt double %a_mul, %a3
>>>   %a_div = fdiv double %a_mul, %a3
>>>   %a_factor = select i1 %a_cmp, double %a_div, double 1.000000e+00
>>>   ret double %a_factor
>>> }
>>> ```
>>> This will cause a FP arithmetic exception, and the application will
get
>>> a SIGFPE signal. The code that drives the change in the optimizer
relies on
>>> `llvm::isSafeToSpeculativelyExecute` to decide whether the division
>>> should be performed speculatively. Right now, this function returns
true.
>>> One could do something similar to integer divisions and add a case
there
>>> (this solved the issue for me):
>>> ```
>>> diff --git a/lib/Analysis/ValueTracking.cpp
>>> b/lib/Analysis/ValueTracking.cpp
>>> index 1761dac..c61fefd 100644
>>> --- a/lib/Analysis/ValueTracking.cpp
>>> +++ b/lib/Analysis/ValueTracking.cpp
>>> @@ -3352,6 +3352,21 @@ bool
llvm::isSafeToSpeculativelyExecute(const
>>> Value *V,
>>>      // The numerator *might* be MinSignedValue.
>>>      return false;
>>>    }
>>> +  case Instruction::FDiv:
>>> +  case Instruction::FRem:{
>>> +    const Value *Denominator = Inst->getOperand(1);
>>> +    // x / y is undefined if y == 0
>>> +    // The denominator is not a constant, so there is nothing we
can do
>>> to prove
>>> +    // it is non-zero.
>>> +    if (auto *VV = dyn_cast<ConstantVector>(Denominator))
>>> +      Denominator = VV->getSplatValue();
>>> +    if (!isa<ConstantFP>(Denominator))
>>> +      return false;
>>> +    // The denominator is a zero constant - we can't speculate
here.
>>> +    if (m_AnyZero().match(Denominator))
>>> +      return false;
>>> +    return true;
>>> +  }
>>>    case Instruction::Load: {
>>>      const LoadInst *LI = cast<LoadInst>(Inst);
>>>      if (!LI->isUnordered() ||
>>> ```
>>> I did my tests using a powerpc64le target, but I couldn't find
any
>>> target specific login involved in this transform. In any case, I
wanted to
>>> drop the questions:
>>>
>>> - is there any target that would benefit from speculative fp
divisions?
>>> - is there any target for which fp division does not have side
effects?
>>>
>>> If not, I can go ahead and post the patch above for review.
>>>
>>> Many thanks!
>>> Samuel
>>>
>>
>>
> --
> Hal Finkel
> Lead, Compiler Technology and Programming Languages
> Leadership Computing Facility
> Argonne National Laboratory
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170315/91505b50/attachment.html>

Flamedoge via llvm-dev

2017-Mar-15 18:18 UTC

head link

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

I'd imagine it's very difficult for compiler to track FP exception
control
(unless it actually inserts fegetenv calls to get control bits at runtime
:/).
I think Hal was pointing out that straight-lining fp division is very rare
thing given that it is very expensive in many target architectures. It
probably means that cost model is whacky somewhere.

Kevin

On Wed, Mar 15, 2017 at 10:41 AM, Samuel Antão via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> Thank you all for the responses!
>
> I understand that FP exceptions are activated programatically and can be
> disabled. I guess my point is that those divisions can have side effects
> and the compiler can't prove they have not. I imagine that, from a user
> viewpoint, if I write code like:
>
> double a, b, c
> ...
> if (a > b)
>   c = a/b
>
> thinking that my conditional also guards against divisions by zero, and
> activate the exceptions exactly to assert that, transformations that
> speculate the execution of the divisions won't serve me well. The whole
> purpose of having FP exception support just vanishes.
>
> I guess that at the end of the day this is about choosing what is a
"good"
> default behaviour. I don't feel strongly one way or the other, just
thought
> that this deserved some discussion.
>
> I don't think this is what Andrew is trying to solve, is it? Exceptions
> can be activated anywhere, not necessarily in the same compilation unit.
>
> Thanks again!
> Samuel
>
> On Wed, Mar 15, 2017 at 3:22 PM, Hal Finkel <hfinkel at anl.gov>
wrote:
>
>> [+current llvm-dev address]
>>
>>
>>
>> On 03/15/2017 09:23 AM, Hal Finkel wrote:
>>
>>> Hi Samuel,
>>>
>>> What are you taking about? ;)
>>>
>>> The only way to get a SIGFPE from a floating-point division by zero
is
>>> to modify the floating-point environment to enable those
exceptions.  We
>>> don't support that (*). In the default (supported) environment,
floating
>>> point division is well defined for all inputs (dividing by 0 gives
inf, by
>>> NaN gives, NaN, etc.).
>>>
>>> Regarding whether it makes sense to speculate, that's clearly a
>>> target-dependent property. On some targets this makes sense (e.g.
OOO cores
>>> where divides have high, by hideable, latency) and on some targets
it
>>> really doesn't. If we're speculating these on targets where
we shouldn't,
>>> then we need to fix the cost model.
>>>
>>> (*) There's ongoing work to change that. Search the mailing
list for
>>> Andrew Kaylor.
>>>
>>>  -Hal
>>>
>>> On 03/15/2017 04:38 AM, Samuel Antão wrote:
>>>
>>>> Hi all,
>>>>
>>>> I came across an issue caused by the Call-Graph Simplify Pass.
Here is
>>>> a a small repro:
>>>>
>>>> ```
>>>> define double @foo(double %a1, double %a2, double %a3) #0 {
>>>> entry:
>>>>   %a_mul = fmul double %a1, %a2
>>>>   %a_cmp = fcmp ogt double %a3, %a_mul
>>>>   br i1 %a_cmp, label %a.then, label %a.end
>>>>
>>>> a.then:
>>>>   %a_div = fdiv double %a_mul, %a3
>>>>   br label %a.end
>>>>
>>>> a.end:
>>>>   %a_factor = phi double [ %a_div, %a.then ], [ 1.000000e+00,
%entry ]
>>>>   ret double %a_factor
>>>> }
>>>> ```
>>>> Here, the conditional is guarding a possible division by zero.
However
>>>> if I run CGSimplify on this I get:
>>>> ```
>>>> define double @foo(double %a1, double %a2, double %a3)
>>>> local_unnamed_addr #0 {
>>>> entry:
>>>>   %a_mul = fmul double %a1, %a2
>>>>   %a_cmp = fcmp olt double %a_mul, %a3
>>>>   %a_div = fdiv double %a_mul, %a3
>>>>   %a_factor = select i1 %a_cmp, double %a_div, double
1.000000e+00
>>>>   ret double %a_factor
>>>> }
>>>> ```
>>>> This will cause a FP arithmetic exception, and the application
will get
>>>> a SIGFPE signal. The code that drives the change in the
optimizer relies on
>>>> `llvm::isSafeToSpeculativelyExecute` to decide whether the
division
>>>> should be performed speculatively. Right now, this function
returns true.
>>>> One could do something similar to integer divisions and add a
case there
>>>> (this solved the issue for me):
>>>> ```
>>>> diff --git a/lib/Analysis/ValueTracking.cpp
>>>> b/lib/Analysis/ValueTracking.cpp
>>>> index 1761dac..c61fefd 100644
>>>> --- a/lib/Analysis/ValueTracking.cpp
>>>> +++ b/lib/Analysis/ValueTracking.cpp
>>>> @@ -3352,6 +3352,21 @@ bool
llvm::isSafeToSpeculativelyExecute(const
>>>> Value *V,
>>>>      // The numerator *might* be MinSignedValue.
>>>>      return false;
>>>>    }
>>>> +  case Instruction::FDiv:
>>>> +  case Instruction::FRem:{
>>>> +    const Value *Denominator = Inst->getOperand(1);
>>>> +    // x / y is undefined if y == 0
>>>> +    // The denominator is not a constant, so there is nothing
we can
>>>> do to prove
>>>> +    // it is non-zero.
>>>> +    if (auto *VV =
dyn_cast<ConstantVector>(Denominator))
>>>> +      Denominator = VV->getSplatValue();
>>>> +    if (!isa<ConstantFP>(Denominator))
>>>> +      return false;
>>>> +    // The denominator is a zero constant - we can't
speculate here.
>>>> +    if (m_AnyZero().match(Denominator))
>>>> +      return false;
>>>> +    return true;
>>>> +  }
>>>>    case Instruction::Load: {
>>>>      const LoadInst *LI = cast<LoadInst>(Inst);
>>>>      if (!LI->isUnordered() ||
>>>> ```
>>>> I did my tests using a powerpc64le target, but I couldn't
find any
>>>> target specific login involved in this transform. In any case,
I wanted to
>>>> drop the questions:
>>>>
>>>> - is there any target that would benefit from speculative fp
divisions?
>>>> - is there any target for which fp division does not have side
effects?
>>>>
>>>> If not, I can go ahead and post the patch above for review.
>>>>
>>>> Many thanks!
>>>> Samuel
>>>>
>>>
>>>
>> --
>> Hal Finkel
>> Lead, Compiler Technology and Programming Languages
>> Leadership Computing Facility
>> Argonne National Laboratory
>>
>>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170315/feef59f7/attachment.html>

Hal Finkel via llvm-dev

2017-Mar-15 18:52 UTC

head link

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

On 03/15/2017 12:41 PM, Samuel Antão wrote:> Thank you all for the responses!
>
> I understand that FP exceptions are activated programatically and can 
> be disabled.
The point is that they should be disabled by default. If you enable 
them, you need to tell the compiler you've done so (using a command-line 
option or #pragma STC FENV_ACCESS ON). Otherwise, the compiler gets to 
assume that they're off. We currently don't support anything else.
> I guess my point is that those divisions can have side effects and the 
> compiler can't prove they have not. I imagine that, from a user 
> viewpoint, if I write code like:
>
> double a, b, c
> ...
> if (a > b)
>   c = a/b
>
> thinking that my conditional also guards against divisions by zero, 
> and activate the exceptions exactly to assert that, transformations 
> that speculate the execution of the divisions won't serve me well. The 
> whole purpose of having FP exception support just vanishes.
When we support a mode where FP exceptions are enabled, we'll disable 
such transformations (as we're currently working on doing this, with 
special intrinsics for the FP operations in these modes, we get a lack 
of this kind of speculation naturally).
>
> I guess that at the end of the day this is about choosing what is a 
> "good" default behaviour. I don't feel strongly one way or
the other,
> just thought that this deserved some discussion.
>
> I don't think this is what Andrew is trying to solve, is it? 
> Exceptions can be activated anywhere, not necessarily in the same 
> compilation unit.
You should be able to use the FENV_ACCESS pragma to turn these on/off 
even within the same translation unit.

  -Hal
>
> Thanks again!
> Samuel
>
> On Wed, Mar 15, 2017 at 3:22 PM, Hal Finkel <hfinkel at anl.gov 
> <mailto:hfinkel at anl.gov>> wrote:
>
>     [+current llvm-dev address]
>
>
>
>     On 03/15/2017 09:23 AM, Hal Finkel wrote:
>
>         Hi Samuel,
>
>         What are you taking about? ;)
>
>         The only way to get a SIGFPE from a floating-point division by
>         zero is to modify the floating-point environment to enable
>         those exceptions.  We don't support that (*). In the default
>         (supported) environment, floating point division is well
>         defined for all inputs (dividing by 0 gives inf, by NaN gives,
>         NaN, etc.).
>
>         Regarding whether it makes sense to speculate, that's clearly
>         a target-dependent property. On some targets this makes sense
>         (e.g. OOO cores where divides have high, by hideable, latency)
>         and on some targets it really doesn't. If we're speculating
>         these on targets where we shouldn't, then we need to fix the
>         cost model.
>
>         (*) There's ongoing work to change that. Search the mailing
>         list for Andrew Kaylor.
>
>          -Hal
>
>         On 03/15/2017 04:38 AM, Samuel Antão wrote:
>
>             Hi all,
>
>             I came across an issue caused by the Call-Graph Simplify
>             Pass. Here is a a small repro:
>
>             ```
>             define double @foo(double %a1, double %a2, double %a3) #0 {
>             entry:
>               %a_mul = fmul double %a1, %a2
>               %a_cmp = fcmp ogt double %a3, %a_mul
>               br i1 %a_cmp, label %a.then, label %a.end
>
>             a.then:
>               %a_div = fdiv double %a_mul, %a3
>               br label %a.end
>
>             a.end:
>               %a_factor = phi double [ %a_div, %a.then ], [
>             1.000000e+00, %entry ]
>               ret double %a_factor
>             }
>             ```
>             Here, the conditional is guarding a possible division by
>             zero. However if I run CGSimplify on this I get:
>             ```
>             define double @foo(double %a1, double %a2, double %a3)
>             local_unnamed_addr #0 {
>             entry:
>               %a_mul = fmul double %a1, %a2
>               %a_cmp = fcmp olt double %a_mul, %a3
>               %a_div = fdiv double %a_mul, %a3
>               %a_factor = select i1 %a_cmp, double %a_div, double
>             1.000000e+00
>               ret double %a_factor
>             }
>             ```
>             This will cause a FP arithmetic exception, and the
>             application will get a SIGFPE signal. The code that drives
>             the change in the optimizer relies on
>             `llvm::isSafeToSpeculativelyExecute` to decide whether the
>             division should be performed speculatively. Right now,
>             this function returns true. One could do something similar
>             to integer divisions and add a case there (this solved the
>             issue for me):
>             ```
>             diff --git a/lib/Analysis/ValueTracking.cpp
>             b/lib/Analysis/ValueTracking.cpp
>             index 1761dac..c61fefd 100644
>             --- a/lib/Analysis/ValueTracking.cpp
>             +++ b/lib/Analysis/ValueTracking.cpp
>             @@ -3352,6 +3352,21 @@ bool
>             llvm::isSafeToSpeculativelyExecute(const Value *V,
>                  // The numerator *might* be MinSignedValue.
>                  return false;
>                }
>             +  case Instruction::FDiv:
>             +  case Instruction::FRem:{
>             +    const Value *Denominator = Inst->getOperand(1);
>             +    // x / y is undefined if y == 0
>             +    // The denominator is not a constant, so there is
>             nothing we can do to prove
>             +    // it is non-zero.
>             +    if (auto *VV =
dyn_cast<ConstantVector>(Denominator))
>             +      Denominator = VV->getSplatValue();
>             +    if (!isa<ConstantFP>(Denominator))
>             +      return false;
>             +    // The denominator is a zero constant - we can't
>             speculate here.
>             +    if (m_AnyZero().match(Denominator))
>             +      return false;
>             +    return true;
>             +  }
>                case Instruction::Load: {
>                  const LoadInst *LI = cast<LoadInst>(Inst);
>                  if (!LI->isUnordered() ||
>             ```
>             I did my tests using a powerpc64le target, but I couldn't
>             find any target specific login involved in this transform.
>             In any case, I wanted to drop the questions:
>
>             - is there any target that would benefit from speculative
>             fp divisions?
>             - is there any target for which fp division does not have
>             side effects?
>
>             If not, I can go ahead and post the patch above for review.
>
>             Many thanks!
>             Samuel
>
>
>
>     -- 
>     Hal Finkel
>     Lead, Compiler Technology and Programming Languages
>     Leadership Computing Facility
>     Argonne National Laboratory
>
>
-- 
Hal Finkel
Lead, Compiler Technology and Programming Languages
Leadership Computing Facility
Argonne National Laboratory

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170315/b0442dea/attachment.html>

Seemingly Similar Threads

Search for more maybe matching threads

llvm dev - Mar 2017 - Speculative execution of FP divide Instructions - Call-Graph Simplify

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

[llvm-dev] Speculative execution of FP divide Instructions - Call-Graph Simplify

Seemingly Similar Threads