thr3ads.net - llvm dev - [LLVMdev] [RFC] Invariants in LLVM [Jul 2014]

If this information is useful, please help other people find it:
Share via:

Hal Finkel

2014-Jul-18 03:19 UTC

[LLVMdev] [RFC] Invariants in LLVM

----- Original Message -----> From: "Philip Reames" <listmail at philipreames.com>
> To: "Hal Finkel" <hfinkel at anl.gov>, "LLVM Dev"
<llvmdev at cs.uiuc.edu>
> Cc: "Chandler Carruth" <chandlerc at gmail.com>,
"Andrew Trick" <atrick at apple.com>, "Richard Smith"
> <richard at metafoo.co.uk>, "Nick Lewycky" <nlewycky at
google.com>
> Sent: Thursday, July 17, 2014 4:31:10 PM
> Subject: Re: [RFC] Invariants in LLVM
> 
> 
> On 07/17/2014 01:39 PM, Hal Finkel wrote:
> > Hello everyone,
> >
> > I'm starting a new thread, as a follow-up to several older ones,
on
> > invariants in LLVM. Fundamentally, invariants are conditions that
> > the optimizer is allowed to assume will be valid during the
> > execution of the program. Many other compilers support a concept
> > like this, MSVC's __assume, for example. GCC has a builtin called
> > __builtin_assume_aligned which also neatly falls into this class.
> First of all, thank you for working on this!  I'm thrilled to see
> progress being made here.
> >
> > In my initial patches, I've named the associated IR-level
intrinsic
> > @llvm.invariant(i1) (maybe it should be @llvm.assume, etc. --
> > please feel free to express an opinion).
> My bikeshed preference would be "assume".  "invariant"
makes me think
> "loop invariant" - i.e. something which needs to be proven.
"assume"
> is
> a traditional name (in the verification world) for something which
> can
> be assumed true without verification and can result in invalid
> results
> if actually untrue.
That's two votes for 'assume' (you and Chandler from his code
review); anyone else?

 -Hal
> 
> >
> > I'll attempt to highlight the primary issue involved with a simple
> > example:
> >
> > int foo(int a, int b) {
> >    if (b < 0) {
> >      __builtin_assume(a > 5);
> >      return a > 3;
> >    } else {
> >      __builtin_assume((a & 3) == 0);
> >      return a & 1;
> >    }
> > }
> >
> > (with the current patches below, this simplifies as you'd expect)
> >
> > When simplifying the expression "a > 3" or "a &
1", how do you find
> > the applicable invariants? The invariants are not direct users of
> > the variables they affect, a in this case, but only use those
> > variables transitively (the invariant is a user of the (a > 5)
> > icmp, which is a user of 'a'). Also not all transitive
invariant
> > users of the variable apply at any particular point in the
> > execution. Specifically, when we call computeKnownBits on 'a'
in
> > the above example, I can't use the invariants without knowing
> > whether I want to know the answer at "return a > 3;" or
at "return
> > a & 1;".
> This path dependence property is important.  We need to make sure
> this
> is clearly spelled out in the documentation or it will lead to
> ongoing
> confusion.
> > Generally speaking, I need two things: The instruction specifying
> > the control-flow context, and a DominatorTree in order to
> > efficiently determine if a particular invariant applies at the
> > specified context instruction.
> >
> > As a result, my currently implementations contains:
> >
> >   1. Changes to several analysis passes, ValueTracking (and
> >   InstSimplify), InstCombine, LazyValueInfo, and ScalarEvolution
> >   in order to find and use invariants. Invariants are generally
> >   found by doing a depth-limited filtered recursive search of
> >   users of the Value being queried for a relevant invariant.
> >
> >   2. Changes to the APIs of those analysis passes to take optional
> >   context instructions and DominatorTree objects, and changes to
> >   almost all users of those analysis passed to provide that
> >   information when available (this obviously touches a fair amount
> >   of code).
> >
> >   3. Documentation changes, lowering, aliasing-analysis updates,
> >   etc.
> >
> > As with MSVC's __assume(0), which means 'unreachable',
I've done
> > the same with @llvm.invariant(false).
> >
> > Over the past several weeks, I've posted a number of patches
> > comprising a proposed implementation of an invariant intrinsic for
> > LLVM. I've listed them here in reverse order (they apply bottom
> > up).
> I'll be happy to help review once we settle on desired semantics. :)
> >
> > Patches:
> >
> > http://reviews.llvm.org/D149 - A patch for Clang that adds
> > __builtin_assume (and __assume in MS-compatibility mode) and GCC's
> > __builtin_assume_aligned.
> >
> > http://reviews.llvm.org/D4568 - Adds awareness of invariants to
> > ScalarEvolution (specifically when looking for loop guards)
> > http://reviews.llvm.org/D4567 - A small addition to InstCombine to
> > look for return values with all known bits
> > http://reviews.llvm.org/D4543 - Adds a bunch more patterns for
> > recognizing known bits from invariants in value tracking
> > http://reviews.llvm.org/D4511 - Adds invariant awareness to
> > LazyValueInfo (so invariants can be used by jump threading and
> > related optimizations)
> > http://reviews.llvm.org/D4491 - Adds basic canonicalization of the
> > invariant intrinsics themselves (splitting invariant(a && b),
> > etc.)
> > http://reviews.llvm.org/D181 - This adds a scalar-evolution-powered
> > pass, run late in the pipeline, to do more sophisticated updating
> > of pointer alignments based on invariants.
> > http://reviews.llvm.org/D4490 - An update to computeKnownBits and
> > other functions in ValueTracking, and almost all callers of these
> > functions, to use invariants to compute known bits of quantities
> > that applicable invariant conditions (this also handles updating
> > basic pointer alignments).
> > http://reviews.llvm.org/D180 - Adds "ephemeral" value
recognition
> > to the inliner and basic code metrics so that side-effect-free
> > instructions only contributing to invariants don't affect
> > inlining, loop unrolling, etc.
> > http://reviews.llvm.org/D178 - Adds the llvm.invariant intrinsic
> > and basic lowering properties.
> >
> > Some points for discussion [by Philip, edited slightly by me]:
> >
> >   1. The current patches remove llvm.invariant(undef) as dead.
> >   Would it be better to transform this into 'unreachable' as
is
> >   done for llvm.invariant(false).
> >
> >   2. Would adding a canonicalization of if(c) { unreachable } to
> >   llvm.invariant(c) would be worthwhile?
> A couple to add:
> 3. An "llvm.invariant" has zero code generation cost.  Given
that, a
> lot
> of pattern matching and profitability heuristics will need adjusted
> to
> ignore them.  I worry about the maintenance cost of this in the
> optimizer.  I can see a couple of possibilities here:
> - Canonicalize the placement of "llvm.invariants: at the end of each
> basic block.  This at least reduces the patterns to be matched.
> - Remove the invariant instructions entirely from the in flight IR.
> Use
> the "llvm.invariant" instruction only for serialization, but have
the
> conditions stored directly on the basic blocks when in memory.  This
> would reduce the pattern matching problem, but at cost of extra
> complexity.
> 
> 4. How will the presence of "llvm.invariants" effect things like
code
> placement?  I could see have lots of extra users (which are dropped
> late
> in the optimizer) causing somewhat surprising results. (i.e. we
> perform
> CSE, instruction gets placed differently, performance differs)
> 
> >
> > Finally, I'd like to mention that alternate schemes are certainly
> > possible:
> >
> >   1. We could 'hide' all invariants inside dominating blocks
> >   guarded by never-true conditional branches (this has the benefit
> >   of being able to express invariants using expressions that
> >   normally might have side effects, and could simplify the process
> >   of finding applicable invariants) [a suggestion by Richard and
> >   Nick].
> I'm not really clear on what this proposal would entail.  Could you
> spell out an example?  (If you think this is worth discussing
> further.)
> >
> >   2. We might want to have passes precompute the Value->(set of
> >   Invariants) map, and update it as necessary instead of doing
> >   transitive-user searches [a suggestion by Chandler].
> This seems like something to consider as a future enhancement.
> >
> >   3. I'm sure there is other design space :-)
> >
> > If you're interested in this functionality, please provide
> > feedback, use cases, etc. (and help review patches).
> Interested!
> >
> > Thanks in advance,
> > Hal
> Philip
> 
-- 
Hal Finkel
Assistant Computational Scientist
Leadership Computing Facility
Argonne National Laboratory

Andrew Trick

2014-Jul-18 03:45 UTC

head link

[LLVMdev] [RFC] Invariants in LLVM

> On Jul 17, 2014, at 8:19 PM, Hal Finkel <hfinkel at anl.gov> wrote:
> 
> ----- Original Message -----
>> From: "Philip Reames" <listmail at philipreames.com>
>> To: "Hal Finkel" <hfinkel at anl.gov>, "LLVM
Dev" <llvmdev at cs.uiuc.edu>
>> Cc: "Chandler Carruth" <chandlerc at gmail.com>,
"Andrew Trick" <atrick at apple.com>, "Richard Smith"
>> <richard at metafoo.co.uk>, "Nick Lewycky" <nlewycky
at google.com>
>> Sent: Thursday, July 17, 2014 4:31:10 PM
>> Subject: Re: [RFC] Invariants in LLVM
>> 
>> 
>> On 07/17/2014 01:39 PM, Hal Finkel wrote:
>>> Hello everyone,
>>> 
>>> I'm starting a new thread, as a follow-up to several older
ones, on
>>> invariants in LLVM. Fundamentally, invariants are conditions that
>>> the optimizer is allowed to assume will be valid during the
>>> execution of the program. Many other compilers support a concept
>>> like this, MSVC's __assume, for example. GCC has a builtin
called
>>> __builtin_assume_aligned which also neatly falls into this class.
>> First of all, thank you for working on this!  I'm thrilled to see
>> progress being made here.
>>> 
>>> In my initial patches, I've named the associated IR-level
intrinsic
>>> @llvm.invariant(i1) (maybe it should be @llvm.assume, etc. --
>>> please feel free to express an opinion).
>> My bikeshed preference would be "assume". 
"invariant" makes me think
>> "loop invariant" - i.e. something which needs to be proven.
"assume"
>> is
>> a traditional name (in the verification world) for something which
>> can
>> be assumed true without verification and can result in invalid
>> results
>> if actually untrue.
> 
> That's two votes for 'assume' (you and Chandler from his code
review); anyone else?
+1

-Andy
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20140717/6793746b/attachment.html>

Hal Finkel

2014-Jul-20 18:56 UTC

head link

[LLVMdev] [RFC] Invariants in LLVM

----- Original Message -----> From: "Andrew Trick" <atrick at apple.com>
> To: "Hal Finkel" <hfinkel at anl.gov>
> Cc: "Philip Reames" <listmail at philipreames.com>,
"Chandler Carruth" <chandlerc at gmail.com>, "Richard
Smith"
> <richard at metafoo.co.uk>, "Nick Lewycky" <nlewycky at
google.com>, "LLVM Dev" <llvmdev at cs.uiuc.edu>
> Sent: Thursday, July 17, 2014 10:45:42 PM
> Subject: Re: [RFC] Invariants in LLVM
> 
> 
> 
> 
> 
> 
> On Jul 17, 2014, at 8:19 PM, Hal Finkel < hfinkel at anl.gov > wrote:
> 
> ----- Original Message -----
> 
> 
> From: "Philip Reames" < listmail at philipreames.com >
> To: "Hal Finkel" < hfinkel at anl.gov >, "LLVM
Dev" <
> llvmdev at cs.uiuc.edu >
> Cc: "Chandler Carruth" < chandlerc at gmail.com >,
"Andrew Trick" <
> atrick at apple.com >, "Richard Smith"
> < richard at metafoo.co.uk >, "Nick Lewycky" < nlewycky
at google.com >
> Sent: Thursday, July 17, 2014 4:31:10 PM
> Subject: Re: [RFC] Invariants in LLVM
> 
> 
> On 07/17/2014 01:39 PM, Hal Finkel wrote:
> 
> 
> Hello everyone,
> 
> I'm starting a new thread, as a follow-up to several older ones, on
> invariants in LLVM. Fundamentally, invariants are conditions that
> the optimizer is allowed to assume will be valid during the
> execution of the program. Many other compilers support a concept
> like this, MSVC's __assume, for example. GCC has a builtin called
> __builtin_assume_aligned which also neatly falls into this class.
> First of all, thank you for working on this! I'm thrilled to see
> progress being made here.
> 
> 
> 
> In my initial patches, I've named the associated IR-level intrinsic
> @llvm.invariant(i1) (maybe it should be @llvm.assume, etc. --
> please feel free to express an opinion).
> My bikeshed preference would be "assume". "invariant"
makes me think
> "loop invariant" - i.e. something which needs to be proven.
"assume"
> is
> a traditional name (in the verification world) for something which
> can
> be assumed true without verification and can result in invalid
> results
> if actually untrue.
> 
> That's two votes for 'assume' (you and Chandler from his code
> review); anyone else?
> 
> +1
Alright, all votes have been for llvm.assume. Renaming in progress...

Thanks again,
Hal
> 
> 
> -Andy
-- 
Hal Finkel
Assistant Computational Scientist
Leadership Computing Facility
Argonne National Laboratory

llvm dev - Jul 2014 - [LLVMdev] [RFC] Invariants in LLVM

[LLVMdev] [RFC] Invariants in LLVM

[LLVMdev] [RFC] Invariants in LLVM

[LLVMdev] [RFC] Invariants in LLVM