thr3ads.net - llvm dev - [LLVMdev] Improving Garbage Collection [Jul 2011]

If this information is useful, please help other people find it:
Share via:

Peter Lawrence

2011-Jul-18 18:00 UTC

[LLVMdev] Improving Garbage Collection

Talin,
            do you identify safe-points in the current or proposed  
llvm scheme, and if so how,
or are they implicit as being at all call sites (which begs the  
question what about leaves
in the call tree, how does GC get started at all in that case).


Peter Lawrence.

Talin

2011-Jul-19 05:53 UTC

head link

[LLVMdev] Improving Garbage Collection

On Mon, Jul 18, 2011 at 11:00 AM, Peter Lawrence
<peterl95124 at sbcglobal.net>wrote:
> Talin,
>           do you identify safe-points in the current or proposed llvm
> scheme, and if so how,
> or are they implicit as being at all call sites (which begs the question
> what about leaves
> in the call tree, how does GC get started at all in that case).
>
> The LLVM linker has a feature where you can specify what kind of safepoints your collector requires - the options are Loop, Return, PreCall and
PostCall. You can also override this behavior and examine each instruction
and return a boolean indicating whether it is or isn't a safe point.

Currently I only have function calls as safe points, although I may
eventually enable loops as well. As far as leaf functions go, consider that
the call to allocate memory is also a safe point - and if a function doesn't
allocate any memory then we don't care if the GC is involved or not.

One complication with the current scheme is that the frontend has to have a
sense of where the safe points are going to be. Because the current scheme
requires the frontend to insert additional loads and stores around safe
points (for spilling register values to memory so they can be traced), the
frontend has to be able to guess which function call might be a safe point -
but it can't know for sure due to the fact that optimization and inlining
(which happens much later) may cause the removal of the actual call
instruction. The safe but inefficient approach is to insert the extra loads
and stores around every call instruction.

Peter Lawrence.>
-- 
-- Talin
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20110718/b57dcd48/attachment.html>

Peter Lawrence

2011-Jul-19 17:06 UTC

head link

[LLVMdev] Improving Garbage Collection

Talin,
           how about having the front-end generate an llvm.safe.point 
() intrinsic call at
the desired safe points,  and having the addresses of the GC roots  
(at that point,
can vary from call to call) be the parameters (with noescape  
attribute) to the intrinsic,

IIUC currently the GC roots are tagged, and all analysis and  
transform optimizations
have to special case these tagged objects, but if instead their  
addresses were taken at
the safe points no special casing would have to be done -- all  
analysis and transform
optimizations already know how to deal with objects whose address is  
taken,

and since llvm does already have a "noescape" (not sure that's the
correct name?)
attribute for parameters, these addresses won't be misinterpreted by  
any alias
analysis either, and llvm is free to go ahead and keep these values  
in registers
between safe points  -- you can stop asking how to allow GC roots as  
SSA values,
any traditional load-store optimization pass will do it for you for  
free.

*** without you having to insert explicit load and store  
instructions, and having to
somehow mark them as non-delete-able, or always omit optimization  
passes that
you would otherwise like to have enabled ***

and also without you having to store NULL into a safe point to end  
its lifetime.
and I would suggest eliminating the gcroot() intrinsic as it's  
information content
would be redundant.


thoughts, comments ???


-Peter Lawrence.




On Jul 18, 2011, at 10:53 PM, Talin wrote:
> On Mon, Jul 18, 2011 at 11:00 AM, Peter Lawrence  
> <peterl95124 at sbcglobal.net> wrote:
> Talin,
>           do you identify safe-points in the current or proposed  
> llvm scheme, and if so how,
> or are they implicit as being at all call sites (which begs the  
> question what about leaves
> in the call tree, how does GC get started at all in that case).
>
> The LLVM linker has a feature where you can specify what kind of  
> safe points your collector requires - the options are Loop, Return,  
> PreCall and PostCall. You can also override this behavior and  
> examine each instruction and return a boolean indicating whether it  
> is or isn't a safe point.
>
> Currently I only have function calls as safe points, although I may  
> eventually enable loops as well. As far as leaf functions go,  
> consider that the call to allocate memory is also a safe point -  
> and if a function doesn't allocate any memory then we don't care if
> the GC is involved or not.
>
> One complication with the current scheme is that the frontend has  
> to have a sense of where the safe points are going to be. Because  
> the current scheme requires the frontend to insert additional loads  
> and stores around safe points (for spilling register values to  
> memory so they can be traced), the frontend has to be able to guess  
> which function call might be a safe point - but it can't know for  
> sure due to the fact that optimization and inlining (which happens  
> much later) may cause the removal of the actual call instruction.  
> The safe but inefficient approach is to insert the extra loads and  
> stores around every call instruction.
>
> Peter Lawrence.
>
> -- 
> -- Talin
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20110719/d23bc323/attachment.html>

Andrew Trick

2011-Jul-19 22:59 UTC

head link

[LLVMdev] Improving Garbage Collection

On Jul 18, 2011, at 10:53 PM, Talin wrote:
> Currently I only have function calls as safe points, although I may
eventually enable loops as well. As far as leaf functions go, consider that the
call to allocate memory is also a safe point - and if a function doesn't
allocate any memory then we don't care if the GC is involved or not.
That logic only applies to single-threaded apps (but it may still be good enough
for you in practice).

-Andy

Maybe Matching Threads

Search for more possibly parallel threads

llvm dev - Jul 2011 - [LLVMdev] Improving Garbage Collection

[LLVMdev] Improving Garbage Collection

[LLVMdev] Improving Garbage Collection

[LLVMdev] Improving Garbage Collection

[LLVMdev] Improving Garbage Collection

Maybe Matching Threads