thr3ads.net - llvm dev - [llvm-dev] Multi-Threading Compilers [Mar 2020]

If this information is useful, please help other people find it:
Share via:

River Riddle via llvm-dev

2020-Mar-01 00:23 UTC

[llvm-dev] Multi-Threading Compilers

On Sat, Feb 29, 2020 at 4:00 PM Nicholas Krause <xerofoify at gmail.com>
wrote:
>
>
> On 2/29/20 6:17 PM, River Riddle via llvm-dev wrote:
>
>
>
> On Sat, Feb 29, 2020 at 2:25 PM David Blaikie via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>>
>>
>> On Sat, Feb 29, 2020 at 2:19 PM Chris Lattner <clattner at
nondot.org>
>> wrote:
>>
>>> On Feb 29, 2020, at 2:08 PM, David Blaikie <dblaikie at
gmail.com> wrote:
>>>
>>> I've
>>>> curious as
>>>> to how MLIR deals with IPO as that's the problem I was
running into.
>>>>
>>>
>>> FWIW I believe LLVM's new pass manager (NPM) was designed with
>>> parallelism and the ability to support this situation (that MLIR
doesn't?
>>> Or doesn't to the degree/way in which the NPM does). I'll
leave it to folks
>>> (Chandler probably has the most context here) to provide some more
detail
>>> there if they can/have time.
>>>
>>>
>>> Historically speaking, all of the LLVM pass managers have been
designed
>>> to support multithreaded compilation (check out the ancient history
of the
>>> WritingAnLLVMPass
<http://llvm.org/docs/WritingAnLLVMPass.html> doc if
>>> curious).
>>>
>>
>> I think the specific thing that might'v been a bit different in the
NPM
>> was to do with analysis invalidation in a way that's more
parallelism
>> friendly than the previous one - but I may be
>> misrepresenting/misundrstanding some of it.
>>
>>
>>> The problem is that LLVM has global use-def chains on constants,
>>> functions and globals, etc, so it is impractical to do this.  Every
>>> “inst->setOperand” would have to be able to take locks or use
something
>>> like software transactional memory techniques in their
implementation.
>>> This would be very complicated and very slow.
>>>
>>
>> Oh, yeah - I recall that particular limitation being discussed/not
>> addressed as yet.
>>
>>
>>> MLIR defines this away from the beginning.  This is a result of the
core
>>> IR design, not the pass manager design itself.
>>>
>>
>> What does MLIR do differently here/how does it define that issue away?
>> (doesn't have use-lists built-in?)
>>
>
> The major thing is that constants and global-like objects don't produce
> SSA values and thus don't have use-lists.
> https://mlir.llvm.org/docs/Rationale/#multithreading-the-compiler discusses
> this a bit.
>
> For constants, the data is stored as an Attribute(context uniqued
> metadata, have no use-list, not SSA). This attribute can either placed in
> the attribute list(if the operand is always constant, like for the value of
> a switch case), otherwise it must be explicitly materialized via some
> operation. For example, the `std.constant
> <https://mlir.llvm.org/docs/Dialects/Standard/#constant-operation>`
> operation will materialize an SSA value from some attribute data.
>
> For references to functions and other global-like objects, we have a
> non-SSA mechanism built around `symbols`. This is essentially using a
> special attribute to reference the function by-name, instead of by ssa
> value. You can find more information on MLIR symbols here
> <https://mlir.llvm.org/docs/SymbolsAndSymbolTables/>.
>
> Along with the above, there is a trait that can be attached to operations
> called `IsolatedFromAbove
> <https://mlir.llvm.org/docs/Traits/#isolatedfromabove>`. This
essentially
> means that no SSA values defined above a region can be referenced from
> within that region. The pass manager only allows schedule passes on
> operations that have this property, meaning that all pipelines are
> implicitly multi-threaded.
>
> The pass manager in MLIR was heavily inspired by the work on the new pass
> manager in LLVM, but with specific constraints/requirements that are unique
> to the design of MLIR. That being said, there are some usability features
> added that would also make great additions to LLVM: instance specific pass
> options and statistics, pipeline crash reproducer generation, etc.
>
> Not sure if any of the above helps clarify, but happy to chat more if you
> are interested.
>
> -- River
>
>
>> - Dave
>>
> River,
> The big thing from my reading of the Pass Manager in MLIR is that it
> allows us to iterate through
> a pass per function or module as a group allowing it to run in async.
I've
> proposed this
> on the GCC side:
> https://gcc.gnu.org/ml/gcc/2020-02/msg00247.html
>
> Its to walk through the IPA passes which are similar to analyze passes on
> the LLVM side.
>
Hi Nicholas,

I can't say anything about the GCC side, but this isn't a particularly
novel aspect of the MLIR pass manager. In many ways, the pass manager is
the easiest/simplest part of the multi-threading problem. The bigger
problem is making sure that the rest of the compiler infrastructure is
structured in a way that is thread-safe, or can be made thread-safe. This
is why most of the discussion is based around how to model things like
constants, global values, etc. When I made MLIR multi-threaded a year ago,
a large majority of my time was spent outside of the pass manager. For a
real example, I spent much more time just on multi-threaded pass timing
<https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing> than
making the pass manager itself multi-threaded.

-- River

> Nick
>
>
>>
>>>
>>> -Chris
>>>
>> _______________________________________________
>> LLVM Developers mailing list
>> llvm-dev at lists.llvm.org
>> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>
> _______________________________________________
> LLVM Developers mailing listllvm-dev at
lists.llvm.orghttps://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200229/94ee82bd/attachment.html>

Nicholas Krause via llvm-dev

2020-Mar-01 01:14 UTC

head link

[llvm-dev] Multi-Threading Compilers

On 2/29/20 7:23 PM, River Riddle wrote:>
>
> On Sat, Feb 29, 2020 at 4:00 PM Nicholas Krause <xerofoify at gmail.com 
> <mailto:xerofoify at gmail.com>> wrote:
>
>
>
>     On 2/29/20 6:17 PM, River Riddle via llvm-dev wrote:
>>
>>
>>     On Sat, Feb 29, 2020 at 2:25 PM David Blaikie via llvm-dev
>>     <llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>> wrote:
>>
>>
>>
>>         On Sat, Feb 29, 2020 at 2:19 PM Chris Lattner
>>         <clattner at nondot.org <mailto:clattner at
nondot.org>> wrote:
>>
>>             On Feb 29, 2020, at 2:08 PM, David Blaikie
>>             <dblaikie at gmail.com <mailto:dblaikie at
gmail.com>> wrote:
>>>
>>>                 I've
>>>                 curious as
>>>                 to how MLIR deals with IPO as that's the
problem I
>>>                 was running into.
>>>
>>>
>>>             FWIW I believe LLVM's new pass manager (NPM) was
>>>             designed with parallelism and the ability to support
>>>             this situation (that MLIR doesn't? Or doesn't
to the
>>>             degree/way in which the NPM does). I'll leave it to
>>>             folks (Chandler probably has the most context here) to
>>>             provide some more detail there if they can/have time.
>>
>>             Historically speaking, all of the LLVM pass managers have
>>             been designed to support multithreaded compilation (check
>>             out the ancient history of the WritingAnLLVMPass
>>             <http://llvm.org/docs/WritingAnLLVMPass.html> doc if
>>             curious).
>>
>>
>>         I think the specific thing that might'v been a bit
different
>>         in the NPM was to do with analysis invalidation in a way
>>         that's more parallelism friendly than the previous one -
but
>>         I may be misrepresenting/misundrstanding some of it.
>>
>>             The problem is that LLVM has global use-def chains on
>>             constants, functions and globals, etc, so it is
>>             impractical to do this.  Every “inst->setOperand” would
>>             have to be able to take locks or use something like
>>             software transactional memory techniques in their
>>             implementation.  This would be very complicated and very
>>             slow.
>>
>>
>>         Oh, yeah - I recall that particular limitation being
>>         discussed/not addressed as yet.
>>
>>             MLIR defines this away from the beginning.  This is a
>>             result of the core IR design, not the pass manager design
>>             itself.
>>
>>
>>         What does MLIR do differently here/how does it define that
>>         issue away? (doesn't have use-lists built-in?)
>>
>>
>>     The major thing is that constants and global-like objects don't
>>     produce SSA values and thus don't have use-lists.
>>    
https://mlir.llvm.org/docs/Rationale/#multithreading-the-compiler discusses
>>     this a bit.
>>
>>     For constants, the data is stored as an Attribute(context uniqued
>>     metadata, have no use-list, not SSA). This attribute can either
>>     placed in the attribute list(if the operand is always constant,
>>     like for the value of a switch case), otherwise it must be
>>     explicitly materialized via some operation. For example, the
>>     `std.constant
>>    
<https://mlir.llvm.org/docs/Dialects/Standard/#constant-operation>`
>>     operation will materialize an SSA value from some attribute data.
>>
>>     For references to functions and other global-like objects, we
>>     have a non-SSA mechanism built around `symbols`. This is
>>     essentially using a special attribute to reference the function
>>     by-name, instead of by ssa value. You can find more information
>>     on MLIR symbols here
>>     <https://mlir.llvm.org/docs/SymbolsAndSymbolTables/>.
>>
>>     Along with the above, there is a trait that can be attached to
>>     operations called `IsolatedFromAbove
>>     <https://mlir.llvm.org/docs/Traits/#isolatedfromabove>`. This
>>     essentially means that no SSA values defined above a region can
>>     be referenced from within that region. The pass manager only
>>     allows schedule passes on operations that have this property,
>>     meaning that all pipelines are implicitly multi-threaded.
>>
>>     The pass manager in MLIR was heavily inspired by the work on the
>>     new pass manager in LLVM, but with specific
>>     constraints/requirements that are unique to the design of MLIR.
>>     That being said, there are some usability features added that
>>     would also make great additions to LLVM: instance specific pass
>>     options and statistics, pipeline crash reproducer generation, etc.
>>
>>     Not sure if any of the above helps clarify, but happy to chat
>>     more if you are interested.
>>
>>     -- River
>>
>>         - Dave
>>
>     River,
>     The big thing from my reading of the Pass Manager in MLIR is that
>     it allows us to iterate through
>     a pass per function or module as a group allowing it to run in
>     async. I've proposed this
>     on the GCC side:
>     https://gcc.gnu.org/ml/gcc/2020-02/msg00247.html
>
>     Its to walk through the IPA passes which are similar to analyze
>     passes on the LLVM side.
>
>
> Hi Nicholas,
>
> I can't say anything about the GCC side, but this isn't a
particularly
> novel aspect of the MLIR pass manager. In many ways, the pass manager 
> is the easiest/simplest part of the multi-threading problem. The 
> bigger problem is making sure that the rest of the compiler 
> infrastructure is structured in a way that is thread-safe, or can be 
> made thread-safe. This is why most of the discussion is based around 
> how to model things like constants, global values, etc. When I made 
> MLIR multi-threaded a year ago, a large majority of my time was spent 
> outside of the pass manager. For a real example, I spent much more 
> time just on multi-threaded pass timing 
> <https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing>
> than making the pass manager itself multi-threaded.
>
> -- RiverActually in my experience, the biggest problem is if we can detect IPO 
and run async guarantees on that. MLIR runs operations but only for a 
module or set of functions
without this. One of my dreams would be to run passes in parallel 
including IPO detection and stop if it cannot continue pass a IPO pass 
or set of passes due to changes.

Maybe MLIR does do that but its the one bottleneck that is really hard 
to fix,>
>
>     Nick
>
>>
>>             -Chris
>>
>>         _______________________________________________
>>         LLVM Developers mailing list
>>         llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>
>>         https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>>
>>     _______________________________________________
>>     LLVM Developers mailing list
>>     llvm-dev at lists.llvm.org  <mailto:llvm-dev at
lists.llvm.org>
>>     https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200229/413c60af/attachment.html>

Mehdi AMINI via llvm-dev

2020-Mar-01 02:38 UTC

head link

[llvm-dev] Multi-Threading Compilers

On Sat, Feb 29, 2020 at 5:14 PM Nicholas Krause via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
>
>
> On 2/29/20 7:23 PM, River Riddle wrote:
>
>
>
> On Sat, Feb 29, 2020 at 4:00 PM Nicholas Krause <xerofoify at
gmail.com>
> wrote:
>
>>
>>
>> On 2/29/20 6:17 PM, River Riddle via llvm-dev wrote:
>>
>>
>>
>> On Sat, Feb 29, 2020 at 2:25 PM David Blaikie via llvm-dev <
>> llvm-dev at lists.llvm.org> wrote:
>>
>>>
>>>
>>> On Sat, Feb 29, 2020 at 2:19 PM Chris Lattner <clattner at
nondot.org>
>>> wrote:
>>>
>>>> On Feb 29, 2020, at 2:08 PM, David Blaikie <dblaikie at
gmail.com> wrote:
>>>>
>>>> I've
>>>>> curious as
>>>>> to how MLIR deals with IPO as that's the problem I was
running into.
>>>>>
>>>>
>>>> FWIW I believe LLVM's new pass manager (NPM) was designed
with
>>>> parallelism and the ability to support this situation (that
MLIR doesn't?
>>>> Or doesn't to the degree/way in which the NPM does).
I'll leave it to folks
>>>> (Chandler probably has the most context here) to provide some
more detail
>>>> there if they can/have time.
>>>>
>>>>
>>>> Historically speaking, all of the LLVM pass managers have been
designed
>>>> to support multithreaded compilation (check out the ancient
history of the
>>>> WritingAnLLVMPass
<http://llvm.org/docs/WritingAnLLVMPass.html> doc if
>>>> curious).
>>>>
>>>
>>> I think the specific thing that might'v been a bit different in
the NPM
>>> was to do with analysis invalidation in a way that's more
parallelism
>>> friendly than the previous one - but I may be
>>> misrepresenting/misundrstanding some of it.
>>>
>>>
>>>> The problem is that LLVM has global use-def chains on
constants,
>>>> functions and globals, etc, so it is impractical to do this. 
Every
>>>> “inst->setOperand” would have to be able to take locks or
use something
>>>> like software transactional memory techniques in their
implementation.
>>>> This would be very complicated and very slow.
>>>>
>>>
>>> Oh, yeah - I recall that particular limitation being discussed/not
>>> addressed as yet.
>>>
>>>
>>>> MLIR defines this away from the beginning.  This is a result of
the
>>>> core IR design, not the pass manager design itself.
>>>>
>>>
>>> What does MLIR do differently here/how does it define that issue
away?
>>> (doesn't have use-lists built-in?)
>>>
>>
>> The major thing is that constants and global-like objects don't
produce
>> SSA values and thus don't have use-lists.
>> https://mlir.llvm.org/docs/Rationale/#multithreading-the-compiler
discusses
>> this a bit.
>>
>> For constants, the data is stored as an Attribute(context uniqued
>> metadata, have no use-list, not SSA). This attribute can either placed
in
>> the attribute list(if the operand is always constant, like for the
value of
>> a switch case), otherwise it must be explicitly materialized via some
>> operation. For example, the `std.constant
>>
<https://mlir.llvm.org/docs/Dialects/Standard/#constant-operation>`
>> operation will materialize an SSA value from some attribute data.
>>
>> For references to functions and other global-like objects, we have a
>> non-SSA mechanism built around `symbols`. This is essentially using a
>> special attribute to reference the function by-name, instead of by ssa
>> value. You can find more information on MLIR symbols here
>> <https://mlir.llvm.org/docs/SymbolsAndSymbolTables/>.
>>
>> Along with the above, there is a trait that can be attached to
operations
>> called `IsolatedFromAbove
>> <https://mlir.llvm.org/docs/Traits/#isolatedfromabove>`. This
>> essentially means that no SSA values defined above a region can be
>> referenced from within that region. The pass manager only allows
schedule
>> passes on operations that have this property, meaning that all
pipelines
>> are implicitly multi-threaded.
>>
>> The pass manager in MLIR was heavily inspired by the work on the new
pass
>> manager in LLVM, but with specific constraints/requirements that are
unique
>> to the design of MLIR. That being said, there are some usability
features
>> added that would also make great additions to LLVM: instance specific
pass
>> options and statistics, pipeline crash reproducer generation, etc.
>>
>> Not sure if any of the above helps clarify, but happy to chat more if
you
>> are interested.
>>
>> -- River
>>
>>
>>> - Dave
>>>
>> River,
>> The big thing from my reading of the Pass Manager in MLIR is that it
>> allows us to iterate through
>> a pass per function or module as a group allowing it to run in async.
>> I've proposed this
>> on the GCC side:
>> https://gcc.gnu.org/ml/gcc/2020-02/msg00247.html
>>
>> Its to walk through the IPA passes which are similar to analyze passes
on
>> the LLVM side.
>>
>
> Hi Nicholas,
>
> I can't say anything about the GCC side, but this isn't a
particularly
> novel aspect of the MLIR pass manager. In many ways, the pass manager is
> the easiest/simplest part of the multi-threading problem. The bigger
> problem is making sure that the rest of the compiler infrastructure is
> structured in a way that is thread-safe, or can be made thread-safe. This
> is why most of the discussion is based around how to model things like
> constants, global values, etc. When I made MLIR multi-threaded a year ago,
> a large majority of my time was spent outside of the pass manager. For a
> real example, I spent much more time just on multi-threaded pass timing
> <https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing>
> than making the pass manager itself multi-threaded.
>
> -- River
>
> Actually in my experience, the biggest problem is if we can detect IPO and
> run async guarantees on that. MLIR runs operations but only for a module or
> set of functions
> without this. One of my dreams would be to run passes in parallel
> including IPO detection and stop if it cannot continue pass a IPO pass or
> set of passes due to changes.
>
> Maybe MLIR does do that but its the one bottleneck that is really hard to
> fix,
>
What MLIR does (that would require quite some work in LLVM) is making sure
that you can process and transform functions in isolation, allowing to run
*local* optimizations in parallel. This does not solve the IPO problem
you're after. As I understand it, this is a difficult thing to design, and
it requires consideration about how you think the passes and the
pass-pipeline entirely.

Running function-passes and "local" optimizations in parallel in LLVM
isn't
possible because the structures in the LLVMContext aren't thread-safe, and
because the IR itself isn't thread-safe. Something like just DCE or CSE a
function call requires to modify the callee (through its use-list).

-- 
Mehdi
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200229/a7633d2b/attachment.html>

Fedor Sergeev via llvm-dev

2020-Mar-03 12:03 UTC

head link

[llvm-dev] Multi-Threading Compilers

> I can't say anything about the GCC side, but this isn't a
particularly
> novel aspect of the MLIR pass manager. In many ways, the pass manager 
> is the easiest/simplest part of the multi-threading problem. The 
> bigger problem is making sure that the rest of the compiler 
> infrastructure is structured in a way that is thread-safe, or can be 
> made thread-safe. This is why most of the discussion is based around 
> how to model things like constants, global values, etc. When I made 
> MLIR multi-threaded a year ago, a large majority of my time was spent 
> outside of the pass manager. For a real example, I spent much more 
> time just on multi-threaded pass timing 
> <https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing>
> than making the pass manager itself multi-threaded.Picking on this point, perhaps a bit off-topic for the whole discussion.
I have recently realized that for the purpose of multi-threaded pass 
timing currently existing LLVM timers


On 3/1/20 3:23 AM, River Riddle via llvm-dev wrote:>
>
> On Sat, Feb 29, 2020 at 4:00 PM Nicholas Krause <xerofoify at gmail.com 
> <mailto:xerofoify at gmail.com>> wrote:
>
>
>
>     On 2/29/20 6:17 PM, River Riddle via llvm-dev wrote:
>>
>>
>>     On Sat, Feb 29, 2020 at 2:25 PM David Blaikie via llvm-dev
>>     <llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>> wrote:
>>
>>
>>
>>         On Sat, Feb 29, 2020 at 2:19 PM Chris Lattner
>>         <clattner at nondot.org <mailto:clattner at
nondot.org>> wrote:
>>
>>             On Feb 29, 2020, at 2:08 PM, David Blaikie
>>             <dblaikie at gmail.com <mailto:dblaikie at
gmail.com>> wrote:
>>>
>>>                 I've
>>>                 curious as
>>>                 to how MLIR deals with IPO as that's the
problem I
>>>                 was running into.
>>>
>>>
>>>             FWIW I believe LLVM's new pass manager (NPM) was
>>>             designed with parallelism and the ability to support
>>>             this situation (that MLIR doesn't? Or doesn't
to the
>>>             degree/way in which the NPM does). I'll leave it to
>>>             folks (Chandler probably has the most context here) to
>>>             provide some more detail there if they can/have time.
>>
>>             Historically speaking, all of the LLVM pass managers have
>>             been designed to support multithreaded compilation (check
>>             out the ancient history of the WritingAnLLVMPass
>>             <http://llvm.org/docs/WritingAnLLVMPass.html> doc if
>>             curious).
>>
>>
>>         I think the specific thing that might'v been a bit
different
>>         in the NPM was to do with analysis invalidation in a way
>>         that's more parallelism friendly than the previous one -
but
>>         I may be misrepresenting/misundrstanding some of it.
>>
>>             The problem is that LLVM has global use-def chains on
>>             constants, functions and globals, etc, so it is
>>             impractical to do this.  Every “inst->setOperand” would
>>             have to be able to take locks or use something like
>>             software transactional memory techniques in their
>>             implementation.  This would be very complicated and very
>>             slow.
>>
>>
>>         Oh, yeah - I recall that particular limitation being
>>         discussed/not addressed as yet.
>>
>>             MLIR defines this away from the beginning.  This is a
>>             result of the core IR design, not the pass manager design
>>             itself.
>>
>>
>>         What does MLIR do differently here/how does it define that
>>         issue away? (doesn't have use-lists built-in?)
>>
>>
>>     The major thing is that constants and global-like objects don't
>>     produce SSA values and thus don't have use-lists.
>>    
https://mlir.llvm.org/docs/Rationale/#multithreading-the-compiler discusses
>>     this a bit.
>>
>>     For constants, the data is stored as an Attribute(context uniqued
>>     metadata, have no use-list, not SSA). This attribute can either
>>     placed in the attribute list(if the operand is always constant,
>>     like for the value of a switch case), otherwise it must be
>>     explicitly materialized via some operation. For example, the
>>     `std.constant
>>    
<https://mlir.llvm.org/docs/Dialects/Standard/#constant-operation>`
>>     operation will materialize an SSA value from some attribute data.
>>
>>     For references to functions and other global-like objects, we
>>     have a non-SSA mechanism built around `symbols`. This is
>>     essentially using a special attribute to reference the function
>>     by-name, instead of by ssa value. You can find more information
>>     on MLIR symbols here
>>     <https://mlir.llvm.org/docs/SymbolsAndSymbolTables/>.
>>
>>     Along with the above, there is a trait that can be attached to
>>     operations called `IsolatedFromAbove
>>     <https://mlir.llvm.org/docs/Traits/#isolatedfromabove>`. This
>>     essentially means that no SSA values defined above a region can
>>     be referenced from within that region. The pass manager only
>>     allows schedule passes on operations that have this property,
>>     meaning that all pipelines are implicitly multi-threaded.
>>
>>     The pass manager in MLIR was heavily inspired by the work on the
>>     new pass manager in LLVM, but with specific
>>     constraints/requirements that are unique to the design of MLIR.
>>     That being said, there are some usability features added that
>>     would also make great additions to LLVM: instance specific pass
>>     options and statistics, pipeline crash reproducer generation, etc.
>>
>>     Not sure if any of the above helps clarify, but happy to chat
>>     more if you are interested.
>>
>>     -- River
>>
>>         - Dave
>>
>     River,
>     The big thing from my reading of the Pass Manager in MLIR is that
>     it allows us to iterate through
>     a pass per function or module as a group allowing it to run in
>     async. I've proposed this
>     on the GCC side:
>     https://gcc.gnu.org/ml/gcc/2020-02/msg00247.html
>
>     Its to walk through the IPA passes which are similar to analyze
>     passes on the LLVM side.
>
>
> Hi Nicholas,
>
> I can't say anything about the GCC side, but this isn't a
particularly
> novel aspect of the MLIR pass manager. In many ways, the pass manager 
> is the easiest/simplest part of the multi-threading problem. The 
> bigger problem is making sure that the rest of the compiler 
> infrastructure is structured in a way that is thread-safe, or can be 
> made thread-safe. This is why most of the discussion is based around 
> how to model things like constants, global values, etc. When I made 
> MLIR multi-threaded a year ago, a large majority of my time was spent 
> outside of the pass manager. For a real example, I spent much more 
> time just on multi-threaded pass timing 
> <https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing>
> than making the pass manager itself multi-threaded.
>
> -- River
>
>
>     Nick
>
>>
>>             -Chris
>>
>>         _______________________________________________
>>         LLVM Developers mailing list
>>         llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>
>>         https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>>
>>     _______________________________________________
>>     LLVM Developers mailing list
>>     llvm-dev at lists.llvm.org  <mailto:llvm-dev at
lists.llvm.org>
>>     https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200303/c81a62c3/attachment.html>

Fedor Sergeev via llvm-dev

2020-Mar-03 12:12 UTC

head link

[llvm-dev] Multi-Threading Compilers

Oops.. sorry, hit send too soon..> I can't say anything about the GCC side, but this isn't a
particularly
> novel aspect of the MLIR pass manager. In many ways, the pass manager 
> is the easiest/simplest part of the multi-threading problem. The 
> bigger problem is making sure that the rest of the compiler 
> infrastructure is structured in a way that is thread-safe, or can be 
> made thread-safe. This is why most of the discussion is based around 
> how to model things like constants, global values, etc. When I made 
> MLIR multi-threaded a year ago, a large majority of my time was spent 
> outside of the pass manager. For a real example, I spent much more 
> time just on multi-threaded pass timing 
> <https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing>
> than making the pass manager itself multi-threaded.Picking on this point, perhaps a bit off-topic for the whole discussion.

I have recently realized that for the purpose of multi-threaded pass 
timing currently existing LLVM timers
are hardly suitable since they measure per-process time instead of 
per-thread time.
(and there seem to be no portable LLVM interfaces for per-thread time 
query :( )

 From a first glance it seems that in your MLIR timing examples all the 
times are also per-process.
How do you handle cases when half of your threads are doing something else?
And if you handle it per-thread - can you point me to the code doing 
that, pls :)

regards,
    Fedor.

PS as I read MLIR docs linked above I see quite a bunch of features that 
would be very welcome in LLVM core...

On 3/1/20 3:23 AM, River Riddle via llvm-dev wrote:>
>
> On Sat, Feb 29, 2020 at 4:00 PM Nicholas Krause <xerofoify at gmail.com 
> <mailto:xerofoify at gmail.com>> wrote:
>
>
>
>     On 2/29/20 6:17 PM, River Riddle via llvm-dev wrote:
>>
>>
>>     On Sat, Feb 29, 2020 at 2:25 PM David Blaikie via llvm-dev
>>     <llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>> wrote:
>>
>>
>>
>>         On Sat, Feb 29, 2020 at 2:19 PM Chris Lattner
>>         <clattner at nondot.org <mailto:clattner at
nondot.org>> wrote:
>>
>>             On Feb 29, 2020, at 2:08 PM, David Blaikie
>>             <dblaikie at gmail.com <mailto:dblaikie at
gmail.com>> wrote:
>>>
>>>                 I've
>>>                 curious as
>>>                 to how MLIR deals with IPO as that's the
problem I
>>>                 was running into.
>>>
>>>
>>>             FWIW I believe LLVM's new pass manager (NPM) was
>>>             designed with parallelism and the ability to support
>>>             this situation (that MLIR doesn't? Or doesn't
to the
>>>             degree/way in which the NPM does). I'll leave it to
>>>             folks (Chandler probably has the most context here) to
>>>             provide some more detail there if they can/have time.
>>
>>             Historically speaking, all of the LLVM pass managers have
>>             been designed to support multithreaded compilation (check
>>             out the ancient history of the WritingAnLLVMPass
>>             <http://llvm.org/docs/WritingAnLLVMPass.html> doc if
>>             curious).
>>
>>
>>         I think the specific thing that might'v been a bit
different
>>         in the NPM was to do with analysis invalidation in a way
>>         that's more parallelism friendly than the previous one -
but
>>         I may be misrepresenting/misundrstanding some of it.
>>
>>             The problem is that LLVM has global use-def chains on
>>             constants, functions and globals, etc, so it is
>>             impractical to do this.  Every “inst->setOperand” would
>>             have to be able to take locks or use something like
>>             software transactional memory techniques in their
>>             implementation.  This would be very complicated and very
>>             slow.
>>
>>
>>         Oh, yeah - I recall that particular limitation being
>>         discussed/not addressed as yet.
>>
>>             MLIR defines this away from the beginning.  This is a
>>             result of the core IR design, not the pass manager design
>>             itself.
>>
>>
>>         What does MLIR do differently here/how does it define that
>>         issue away? (doesn't have use-lists built-in?)
>>
>>
>>     The major thing is that constants and global-like objects don't
>>     produce SSA values and thus don't have use-lists.
>>    
https://mlir.llvm.org/docs/Rationale/#multithreading-the-compiler discusses
>>     this a bit.
>>
>>     For constants, the data is stored as an Attribute(context uniqued
>>     metadata, have no use-list, not SSA). This attribute can either
>>     placed in the attribute list(if the operand is always constant,
>>     like for the value of a switch case), otherwise it must be
>>     explicitly materialized via some operation. For example, the
>>     `std.constant
>>    
<https://mlir.llvm.org/docs/Dialects/Standard/#constant-operation>`
>>     operation will materialize an SSA value from some attribute data.
>>
>>     For references to functions and other global-like objects, we
>>     have a non-SSA mechanism built around `symbols`. This is
>>     essentially using a special attribute to reference the function
>>     by-name, instead of by ssa value. You can find more information
>>     on MLIR symbols here
>>     <https://mlir.llvm.org/docs/SymbolsAndSymbolTables/>.
>>
>>     Along with the above, there is a trait that can be attached to
>>     operations called `IsolatedFromAbove
>>     <https://mlir.llvm.org/docs/Traits/#isolatedfromabove>`. This
>>     essentially means that no SSA values defined above a region can
>>     be referenced from within that region. The pass manager only
>>     allows schedule passes on operations that have this property,
>>     meaning that all pipelines are implicitly multi-threaded.
>>
>>     The pass manager in MLIR was heavily inspired by the work on the
>>     new pass manager in LLVM, but with specific
>>     constraints/requirements that are unique to the design of MLIR.
>>     That being said, there are some usability features added that
>>     would also make great additions to LLVM: instance specific pass
>>     options and statistics, pipeline crash reproducer generation, etc.
>>
>>     Not sure if any of the above helps clarify, but happy to chat
>>     more if you are interested.
>>
>>     -- River
>>
>>         - Dave
>>
>     River,
>     The big thing from my reading of the Pass Manager in MLIR is that
>     it allows us to iterate through
>     a pass per function or module as a group allowing it to run in
>     async. I've proposed this
>     on the GCC side:
>     https://gcc.gnu.org/ml/gcc/2020-02/msg00247.html
>
>     Its to walk through the IPA passes which are similar to analyze
>     passes on the LLVM side.
>
>
> Hi Nicholas,
>
> I can't say anything about the GCC side, but this isn't a
particularly
> novel aspect of the MLIR pass manager. In many ways, the pass manager 
> is the easiest/simplest part of the multi-threading problem. The 
> bigger problem is making sure that the rest of the compiler 
> infrastructure is structured in a way that is thread-safe, or can be 
> made thread-safe. This is why most of the discussion is based around 
> how to model things like constants, global values, etc. When I made 
> MLIR multi-threaded a year ago, a large majority of my time was spent 
> outside of the pass manager. For a real example, I spent much more 
> time just on multi-threaded pass timing 
> <https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing>
> than making the pass manager itself multi-threaded.
>
> -- River
>
>
>     Nick
>
>>
>>             -Chris
>>
>>         _______________________________________________
>>         LLVM Developers mailing list
>>         llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>
>>         https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>>
>>     _______________________________________________
>>     LLVM Developers mailing list
>>     llvm-dev at lists.llvm.org  <mailto:llvm-dev at
lists.llvm.org>
>>     https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200303/a1652457/attachment.html>

Chris Lattner via llvm-dev

2020-Mar-03 17:23 UTC

head link

[llvm-dev] Multi-Threading Compilers

On Mar 3, 2020, at 4:12 AM, Fedor Sergeev via llvm-dev <llvm-dev at
lists.llvm.org> wrote:> Oops.. sorry, hit send too soon..
>> I can't say anything about the GCC side, but this isn't a
particularly novel aspect of the MLIR pass manager. In many ways, the pass
manager is the easiest/simplest part of the multi-threading problem. The bigger
problem is making sure that the rest of the compiler infrastructure is
structured in a way that is thread-safe, or can be made thread-safe. This is why
most of the discussion is based around how to model things like constants,
global values, etc. When I made MLIR multi-threaded a year ago, a large majority
of my time was spent outside of the pass manager. For a real example, I spent
much more time just on multi-threaded pass timing
<https://mlir.llvm.org/docs/WritingAPass/#multi-threaded-pass-timing> than
making the pass manager itself multi-threaded.
> Picking on this point, perhaps a bit off-topic for the whole discussion.
> 
> I have recently realized that for the purpose of multi-threaded pass timing
currently existing LLVM timers
> are hardly suitable since they measure per-process time instead of
per-thread time.
> (and there seem to be no portable LLVM interfaces for per-thread time query
:( )
> 
> From a first glance it seems that in your MLIR timing examples all the
times are also per-process.
> How do you handle cases when half of your threads are doing something else?
> And if you handle it per-thread - can you point me to the code doing that,
pls :)
Indeed, how you account for things is important.  Please take a look at
"Multi-threaded Pass Timing”  on this page:
https://mlir.llvm.org/docs/WritingAPass/
<https://mlir.jackwish.net/writingapass#pass-manager>

-Chris
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200303/f09698b8/attachment.html>

llvm dev - Mar 2020 - Multi-Threading Compilers

[llvm-dev] Multi-Threading Compilers

[llvm-dev] Multi-Threading Compilers

[llvm-dev] Multi-Threading Compilers

[llvm-dev] Multi-Threading Compilers

[llvm-dev] Multi-Threading Compilers

[llvm-dev] Multi-Threading Compilers