thr3ads.net - llvm dev - [llvm-dev] Change in optimisation with UB in mind [Oct 2017]

If this information is useful, please help other people find it:
Share via:

ORiordan, Martin via llvm-dev

2017-Oct-03 10:37 UTC

[llvm-dev] Change in optimisation with UB in mind

Hi Sanjoy,

Yes these are C tests (from 'gcc.c-torture/execute'), and as I indicated
in my original message, the tests are not valid because the behaviour is
undefined.  However, it was while investigating these new failures in these
tests that I realised that this optimisation existed.

The optimisation itself is perfectly valid, but it does mean that integer
underflow will no longer be detected on systems which have hardware for this
purpose.

My question is not about the validity of the optimisation - it's perfectly
legitimate - but rather about whether a target can configure their TTI so as to
not perform this optimisation and fall-back to using ADD instead so as to
trigger integer underflow detection.

Thanks,

	MartinO

-----Original Message-----
From: Sanjoy Das [mailto:sanjoy at playingwithpointers.com] 
Sent: Tuesday, October 3, 2017 5:59 AM
To: ORiordan, Martin <martin.oriordan at intel.com>
Cc: llvm-dev <llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] Change in optimisation with UB in mind

It isn't just about execution on the target machine -- transforming the Add
to an Or makes it easier to infer that the highest bit of the result is always
set later in the optimization pipeline.  Are the failing tests unit tests for a
C compiler?  If so, I'd say the test cases are broken, unless they're
specifically test that e.g. the compiler does not crash when compiling code like
this.

-- Sanjoy

On Fri, Sep 29, 2017 at 4:34 AM, ORiordan, Martin via llvm-dev <llvm-dev at
lists.llvm.org> wrote:> With LLVM v5.0, I found failures in some of the ‘gcc.c-torture/execute’
> tests due to a change in the optimisation where undefined behaviour is 
> involved.  The tests that fail are the ‘20040409-[123].c’ tests.
>
>
>
> The underlying failure is due to the optimisation of the following:
>
>
>
>    int test2(int x) { return x + INT_MIN; }
>
>
>
> from using an ADD instruction to using an OR instruction.  The 
> optimisation is entirely valid and will produce the correct result for 
> all correct values of ‘x’, but it introduces a change for 
> architectures that have support for detecting integer overflow/underflow
(signalling or quiet).
>
>
>
> For many systems the execution cost of ADD versus OR is the same, so 
> there is no real reason to choose one versus the other, and UB is UB either
way.
> However, it does impact detection of out-of-range integer arithmetic 
> for systems that have support for this.
>
>
>
> Is there a new trait in the TTI, or a call-back elsewhere that allows 
> the target to choose whether it wants this optimisation to use an ADD 
> or an OR (possibly in the cost-models)?  I would generally prefer to 
> use ADD versus OR if the cost of execution is the same.
>
>
>
> Thanks,
>
>
>
>     MartinO
>
>
>
> --------------------------------------------------------------
> Intel Research and Development Ireland Limited Registered in Ireland 
> Registered Office: Collinstown Industrial Park, Leixlip, County 
> Kildare Registered Number: 308263
>
> This e-mail and any attachments may contain confidential material for 
> the sole use of the intended recipient(s). Any review or distribution 
> by others is strictly prohibited. If you are not the intended 
> recipient, please contact the sender and delete all copies.
>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>--------------------------------------------------------------
Intel Research and Development Ireland Limited
Registered in Ireland
Registered Office: Collinstown Industrial Park, Leixlip, County Kildare
Registered Number: 308263

This e-mail and any attachments may contain confidential material for the sole
use of the intended recipient(s). Any review or distribution by others is
strictly prohibited. If you are not the intended recipient, please contact the
sender and delete all copies.

David Chisnall via llvm-dev

2017-Oct-03 11:20 UTC

head link

[llvm-dev] Change in optimisation with UB in mind

On 3 Oct 2017, at 11:37, ORiordan, Martin via llvm-dev <llvm-dev at
lists.llvm.org> wrote:> 
> My question is not about the validity of the optimisation - it's
perfectly legitimate - but rather about whether a target can configure their TTI
so as to not perform this optimisation and fall-back to using ADD instead so as
to trigger integer underflow detection.
I don’t believe that this is the correct approach.  You are asking for behaviour
of integer overflow in C to be well-defined as trapping (which is a valid
implementation of undefined behaviour).  In LLVM, this would be represented as
using the overflow-checking add intrinsic followed by a branch to a trap in the
overflow case.  The back end should then lower this to a trapping add
instruction.  Clang already has -ftrapv that implements almost this behaviour.

David

ORiordan, Martin via llvm-dev

2017-Oct-03 13:43 UTC

head link

[llvm-dev] Change in optimisation with UB in mind

Yes, the hairy-edges of undefined behaviour - UB is UB.

It does mean that given:

  __attribute__((noinline)) int foo(int a, int b) { return a + b; }

  int bar1(int x) { return foo(x, INT_MIN); }
  int bar2(int x) { return x + INT_MIN; }

'bar1' and 'bar2' have different outcomes.

However, I think that the new optimisation is neat and valid and I am not
suggesting that it should be changed, but it would be useful if a target could
override this if they choose to do so.

	MartinO

-----Original Message-----
From: Dr D. Chisnall [mailto:dc552 at hermes.cam.ac.uk] On Behalf Of David
Chisnall
Sent: Tuesday, October 3, 2017 12:20 PM
To: ORiordan, Martin <martin.oriordan at intel.com>
Cc: Sanjoy Das <sanjoy at playingwithpointers.com>; llvm-dev <llvm-dev
at lists.llvm.org>
Subject: Re: [llvm-dev] Change in optimisation with UB in mind

On 3 Oct 2017, at 11:37, ORiordan, Martin via llvm-dev <llvm-dev at
lists.llvm.org> wrote:> 
> My question is not about the validity of the optimisation - it's
perfectly legitimate - but rather about whether a target can configure their TTI
so as to not perform this optimisation and fall-back to using ADD instead so as
to trigger integer underflow detection.
I don’t believe that this is the correct approach.  You are asking for behaviour
of integer overflow in C to be well-defined as trapping (which is a valid
implementation of undefined behaviour).  In LLVM, this would be represented as
using the overflow-checking add intrinsic followed by a branch to a trap in the
overflow case.  The back end should then lower this to a trapping add
instruction.  Clang already has -ftrapv that implements almost this behaviour.

David

--------------------------------------------------------------
Intel Research and Development Ireland Limited
Registered in Ireland
Registered Office: Collinstown Industrial Park, Leixlip, County Kildare
Registered Number: 308263


This e-mail and any attachments may contain confidential material for the sole
use of the intended recipient(s). Any review or distribution by others is
strictly prohibited. If you are not the intended recipient, please contact the
sender and delete all copies.

llvm dev - Oct 2017 - Change in optimisation with UB in mind

[llvm-dev] Change in optimisation with UB in mind

[llvm-dev] Change in optimisation with UB in mind

[llvm-dev] Change in optimisation with UB in mind