thr3ads.net - llvm dev - [LLVMdev] Available code-generation parallism [Nov 2008]

If this information is useful, please help other people find it:
Share via:

Jonathan Brandmeyer

2008-Nov-02 22:20 UTC

[LLVMdev] Available code-generation parallism

I am interested in making my LLVM front-end multi-threaded in a way
similar to the GCC compiler server proposal and was wondering about the
extent that the LLVM passes support it.

Expression-at-a-time parallel construction:
If function definitions are built purely depth-first, such that the
parent pointers are not provided as they are created, what will break?
I noted that the function and module verifiers aren't complaining, at
least not yet.  Is there a generic "fixup upward-pointing parent
pointers" pass that can be run afterwords?  If not, do I need to
implement and perform that pass?  I suspect that emitting code for
individual expressions in parallel will probably end up being too
fine-grained, which leads me to...

Function-at-a-time parallel construction:
Which (if any) LLVM objects support the object-level thread safety
guarantee?  If I construct two separate function pass managers in
separate threads and use them to optimize and emit object code for
separate llvm::Function definitions in the program, will this work?
Same question for llvm::Modules.

Thanks,
-Jonathan Brandmeyer

Evan Cheng

2008-Nov-03 06:55 UTC

head link

[LLVMdev] Available code-generation parallism

Unfortunately, llvm code generator is not *yet* thread safe. My  
understanding is there isn't a huge amount of work to do to make it so  
but I don't have the details.

Evan

On Nov 2, 2008, at 2:20 PM, Jonathan Brandmeyer wrote:
> I am interested in making my LLVM front-end multi-threaded in a way
> similar to the GCC compiler server proposal and was wondering about  
> the
> extent that the LLVM passes support it.
>
> Expression-at-a-time parallel construction:
> If function definitions are built purely depth-first, such that the
> parent pointers are not provided as they are created, what will break?
> I noted that the function and module verifiers aren't complaining, at
> least not yet.  Is there a generic "fixup upward-pointing parent
> pointers" pass that can be run afterwords?  If not, do I need to
> implement and perform that pass?  I suspect that emitting code for
> individual expressions in parallel will probably end up being too
> fine-grained, which leads me to...
>
> Function-at-a-time parallel construction:
> Which (if any) LLVM objects support the object-level thread safety
> guarantee?  If I construct two separate function pass managers in
> separate threads and use them to optimize and emit object code for
> separate llvm::Function definitions in the program, will this work?
> Same question for llvm::Modules.
>
> Thanks,
> -Jonathan Brandmeyer
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Chris Lattner

2008-Nov-03 09:06 UTC

head link

[LLVMdev] Available code-generation parallism

On Nov 2, 2008, at 2:20 PM, Jonathan Brandmeyer wrote:> I am interested in making my LLVM front-end multi-threaded in a way
> similar to the GCC compiler server proposal and was wondering about  
> the
> extent that the LLVM passes support it.
Do you have a link for this?  I'm not familiar with any parallelism  
proposed by that project.  My understanding was that it was mostly  
about sharing across invocations of the compiler.
> Expression-at-a-time parallel construction:
> If function definitions are built purely depth-first, such that the
> parent pointers are not provided as they are created, what will break?
> I noted that the function and module verifiers aren't complaining, at
> least not yet.  Is there a generic "fixup upward-pointing parent
> pointers" pass that can be run afterwords?  If not, do I need to
> implement and perform that pass?  I suspect that emitting code for
> individual expressions in parallel will probably end up being too
> fine-grained, which leads me to...
Are you talking about building your AST or about building LLVM IR.   
The rules for constructing your AST are pretty much defined by you.   
The rules for constructing LLVM IR are a bit more tricky.  The most  
significant issue right now is that certain objects in LLVM IR are  
uniqued (like constants) and these have use/def chains.  Since use/def  
chain updating is not atomic or locked, this means that you can't  
create llvm ir on multiple threads.  This is something that I'm very  
much interested in solving someday, but no one is working on it at  
this time (that I'm aware of).

> Function-at-a-time parallel construction:
> Which (if any) LLVM objects support the object-level thread safety
> guarantee?  If I construct two separate function pass managers in
> separate threads and use them to optimize and emit object code for
> separate llvm::Function definitions in the program, will this work?
> Same question for llvm::Modules.
Unfortunately, for the above reason... basically none.  The LLVM code  
generators are actually very close to being able to run in parallel.   
The major issue is that they run a few llvm IR level passes first (LSR  
and codegen prepare) that hack on LLVM IR before the code generators  
run.  Because of this, they inherit the limitations of LLVM IR  
passes.  Very long term, I'd really like to make the code generator  
not affect the LLVM IR being put into them, but this is not likely to  
happen anytime in the near future.

If you're interested in this, tackling the use/def atomicity issues  
would be a great place to start.

-Chris

heisenbug

2008-Nov-03 23:55 UTC

head link

[LLVMdev] Available code-generation parallism

On 3 Nov., 10:06, Chris Lattner <clatt... at apple.com>
wrote:> On Nov 2, 2008, at 2:20 PM, Jonathan Brandmeyer wrote:
>
> > I am interested in making my LLVM front-end multi-threaded in a way
> > similar to the GCC compiler server proposal and was wondering about  
> > the
> > extent that the LLVM passes support it.
>
> Do you have a link for this?  I'm not familiar with any parallelism  
> proposed by that project.  My understanding was that it was mostly  
> about sharing across invocations of the compiler.
>
> > Expression-at-a-time parallel construction:
> > If function definitions are built purely depth-first, such that the
> > parent pointers are not provided as they are created, what will break?
> > I noted that the function and module verifiers aren't complaining,
at
> > least not yet.  Is there a generic "fixup upward-pointing parent
> > pointers" pass that can be run afterwords?  If not, do I need to
> > implement and perform that pass?  I suspect that emitting code for
> > individual expressions in parallel will probably end up being too
> > fine-grained, which leads me to...
>
> Are you talking about building your AST or about building LLVM IR.  
> The rules for constructing your AST are pretty much defined by you.  
> The rules for constructing LLVM IR are a bit more tricky.  The most  
> significant issue right now is that certain objects in LLVM IR are  
> uniqued (like constants) and these have use/def chains.  Since use/def  
> chain updating is not atomic or locked, this means that you can't  
> create llvm ir on multiple threads.  This is something that I'm very  
> much interested in solving someday, but no one is working on it at  
> this time (that I'm aware of).
What about "inventing" pseudo-constants (which point to the right
thing) and build the piece of IR with them. When done, grab mutex and
RAUW it in. Alternatively, submit to a privileged thread that performs
the RAUW.
The trick is to prepare the def/use chain(s) to a degree that the
mutex is only held a minimal time. If only IR-builder threads are
running concurrently there is no danger that a real constant vanishes,
leaving behind a stale reference from a pseudo-constant.

Any major headaches I have ignored?

Cheers,

   Gabor

>
> > Function-at-a-time parallel construction:
> > Which (if any) LLVM objects support the object-level thread safety
> > guarantee?  If I construct two separate function pass managers in
> > separate threads and use them to optimize and emit object code for
> > separate llvm::Function definitions in the program, will this work?
> > Same question for llvm::Modules.
>
> Unfortunately, for the above reason... basically none.  The LLVM code  
> generators are actually very close to being able to run in parallel.  
> The major issue is that they run a few llvm IR level passes first (LSR  
> and codegen prepare) that hack on LLVM IR before the code generators  
> run.  Because of this, they inherit the limitations of LLVM IR  
> passes.  Very long term, I'd really like to make the code generator  
> not affect the LLVM IR being put into them, but this is not likely to  
> happen anytime in the near future.
>
> If you're interested in this, tackling the use/def atomicity issues  
> would be a great place to start.
>
> -Chris
>
> _______________________________________________
> LLVM Developers mailing list
> LLVM... at cs.uiuc.edu      
 http://llvm.cs.uiuc.eduhttp://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Jonathan Brandmeyer

2008-Nov-07 02:55 UTC

head link

[LLVMdev] Available code-generation parallism

On Mon, 2008-11-03 at 01:06 -0800, Chris Lattner wrote:> On Nov 2, 2008, at 2:20 PM, Jonathan Brandmeyer wrote:
> > I am interested in making my LLVM front-end multi-threaded in a way
> > similar to the GCC compiler server proposal and was wondering about  
> > the
> > extent that the LLVM passes support it.
> 
> Do you have a link for this?  I'm not familiar with any parallelism  
> proposed by that project.  My understanding was that it was mostly  
> about sharing across invocations of the compiler.
Nope, you're right.  I'm not sure where I got that idea, but I certainly
don't see it in their whitepaper.
> Are you talking about building your AST or about building LLVM IR.   
> The rules for constructing your AST are pretty much defined by you.   
> The rules for constructing LLVM IR are a bit more tricky.  The most  
> significant issue right now is that certain objects in LLVM IR are  
> uniqued (like constants) and these have use/def chains.  Since use/def  
> chain updating is not atomic or locked, this means that you can't  
> create llvm ir on multiple threads.  This is something that I'm very  
> much interested in solving someday, but no one is working on it at  
> this time (that I'm aware of).
I'm referring to implementing the construction, optimization, and object
code generation in parallel.
> > Function-at-a-time parallel construction:
> > Which (if any) LLVM objects support the object-level thread safety
> > guarantee?  If I construct two separate function pass managers in
> > separate threads and use them to optimize and emit object code for
> > separate llvm::Function definitions in the program, will this work?
> > Same question for llvm::Modules.
> 
> Unfortunately, for the above reason... basically none.  The LLVM code  
> generators are actually very close to being able to run in parallel.   
> The major issue is that they run a few llvm IR level passes first (LSR  
> and codegen prepare) that hack on LLVM IR before the code generators  
> run.  Because of this, they inherit the limitations of LLVM IR  
> passes.  Very long term, I'd really like to make the code generator  
> not affect the LLVM IR being put into them, but this is not likely to  
> happen anytime in the near future.
> If you're interested in this, tackling the use/def atomicity issues  
> would be a great place to start.
What about lazy unification of uniqued values after IR construction?  If
that pass is performed on a per-module basis, then all of the Modules
will be isolated in memory from each other.  The front-end can partition
its source into N modules in whatever way it sees fit.  Then it can
instantiate a PassManager and Module per thread and build the IR into
them.  That isn't quite as nice as taking advantage of per-function
parallelism where the individual passes allow it, but it would be a step
in the right direction.

Why are Constants uniqued?  Is it purely for the memory savings?

-Jonathan

Apparently Analagous Threads

Search for more seemingly similar threads

llvm dev - Nov 2008 - [LLVMdev] Available code-generation parallism

[LLVMdev] Available code-generation parallism

[LLVMdev] Available code-generation parallism

[LLVMdev] Available code-generation parallism

[LLVMdev] Available code-generation parallism

[LLVMdev] Available code-generation parallism

Apparently Analagous Threads