thr3ads.net - llvm dev - [LLVMdev] x86 unwind support [Jul 2009]

If this information is useful, please help other people find it:
Share via:

Kenneth Uildriks

2009-Jul-18 16:22 UTC

[LLVMdev] x86 unwind support

According to this page:

http://lwn.net/Articles/252125/

data coming from L1 is only about three times as expensive as data
coming from a register.  So putting a register check after *every*
call is probably not going to be profitable, compared to a
thread-local global variable check after every invoke... if they
happen often on a thread, that variable will probably be in cache, and
if they don't happen often, the performance impact will be minimal.

Of course if most methods have variables with destructors, I'll end up
with a check of some kind after almost every (non-nounwind) call
anyway, so a register check would be better.  On the other hand,
implementing the register check would seem to require native codegen
changes at callsites as opposed to an IR-modifying pass with a
possible new intrinsic or two.

Anyway, here's my new plan:

1. A thread local global variable, type i8*, initialized to zero.
2. At invoke callsites, right before the invoke call a native method
(mysetjmp) that:

a. Saves ESI, EDI, EBX, EBP, ESP to a buffer alloca'd within the
method containing the invokesite..
b. Sets EAX to 0
c. Returns.

3. The return value of that native method (EAX) is checked, and if
nonzero, branch to unwind label.  Otherwise, save the value of the
thread-local-global into the buffer, write the address of that
alloca'd buffer into the thread-local global and make the call.

4. After the call returns, copy the old thread-local-global value out
of the alloca'd buffer back to the thread-local-global.

The unwind instruction will then:

1. Load the thread-local-global value.  If it's zero, there's nowhere
to unwind to, so abort.
2. Restore ESI, EDI, EBX, EBP, ESP, and the thread-local-global value
from the buffer.
3. Set EAX to 1.
4. Jump to 2c. (the return instruction for the native method mysetjmp).

The native method will return with all callee-saved registers restored
and a return value in EAX of 1, which will cause the following check
to branch to the unwind label.

Invoke sites only write five callee-saved registers to the stack, and
read/write one pointer to a single thread-local global variable, and
make one direct call.  Unwind sites make one direct call, read five
callee-saved registers from the stack (some distance up, so those
memory values might not be warm) and read/write one pointer to a
single thread-local global variable.

The next step would be to replace the mysetjmp call with a new
intrinsic, and then I'd have to save EIP and do an indirect jump to it
at the unwind site instead of jumping to a constant offset within the
native mysetjmp.  Making mylongjmp call a new intrinsic will
necessitate no other modifications.

On Thu, Jul 16, 2009 at 11:44 AM, Eli Friedman<eli.friedman at gmail.com>
wrote:> On Thu, Jul 16, 2009 at 9:10 AM, Kenneth Uildriks<kennethuil at
gmail.com> wrote:
>> 1. Which ones?  I know that Windows uses it for the "this"
pointer.
>
> The internal fastcc convention and the Windows fastcall convention off
> the top of my head.
>
>> Anyway, unless the callee is required to preserve it in a given
>> calling convention, that doesn't preclude us using it for a
*return*
>> value.  It would be checked after calls return, and wouldn't affect
>> the use of the register for passing values in before the call is made.
>>  The callee would set it right before return.
>
> Right, so that sounds okay.
>
>> 2. Does LLVM support nested functions?  I must have missed that.
>
> To the extent required to implement the gcc nested functions
> extension, yes.  The specific relevant behavior here is that if a
> parameter is marked with the nest attribute, it gets passed in ECX.
>
> -Eli
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

Duncan Sands

2009-Jul-18 20:04 UTC

head link

[LLVMdev] x86 unwind support

Hi Kenneth, this way of implementing unwind won't interact properly with
dwarf exception handling.  That's rather bad.

Ciao,

Duncan.

Kenneth Uildriks

2009-Jul-18 22:48 UTC

head link

[LLVMdev] x86 unwind support

Arrgh.  Exception handling uses invoke and doesn't use unwind!  Or did
I just miss it?

Since unwind doesn't take any operands, is there *any* possible
implementation of unwind that fits with exception handling/invoke?

Implemented as an optional pass, my scheme can go unused when you're
using the g++ front-end or something else that uses __cxa_throw.

On Sat, Jul 18, 2009 at 3:04 PM, Duncan Sands<baldrick at free.fr>
wrote:> Hi Kenneth, this way of implementing unwind won't interact properly
with
> dwarf exception handling.  That's rather bad.
>
> Ciao,
>
> Duncan.
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

Kenneth Uildriks

2009-Jul-19 15:18 UTC

head link

[LLVMdev] x86 unwind support

OK, I've read through http://www.llvm.org/docs/ExceptionHandling.html
several times now.

Let's see if I understand this...

1. Everywhere inside a "try" block, the C++ front-end emits
"invoke"
instructions instead of "call" instructions.  Without any
transformations, this "invoke" instruction compiles down to assembly
code that doesn't seem to do anything different from a "call"
instruction.  Also, "unwind" compiles down to nothing.  However, every
function gets some DWARF info compiled into it by LLVM, and part of it
is information about the invoke site.

2. To throw an exception, call __cxa_allocate_exception to allocate an
exception object, and __cxa_throw to throw it.

3. Every function gets some DWARF info complied into it by LLVM.  The
__cxa_throw function uses it to find the function that issued the
"invoke" and find the "landing pads" and jump to the right
landing pad
based on the exception type.

4. The landing pad uses exception-handling intrinsics to match the
exception type and to get the exception object.

The lowerinvoke pass adds SJLJ-based unwinding, which is a separate
mechanism based on GCC sjlj exception handling.

My proposed pass adds a lighter-weight setjmp/longjmp-style unwinding.

How do either of these prevent DWARF exception handling from working?
Would a landing pad expecting to get an exception object from the
exception intrinsics fail to get one in the case of an unwind and
crash?

Did I misunderstand anything I outlined above?

Is the exception-throwing function call expected to become an
intrinsic or an instruction in the future?  Will it replace unwind?

(Perhaps I should put all this aside and just have my compiler handle
my invoke/unwind logic instead of trying to use invoke/unwind
instructions.)

On Sat, Jul 18, 2009 at 3:04 PM, Duncan Sands<baldrick at free.fr>
wrote:> Hi Kenneth, this way of implementing unwind won't interact properly
with
> dwarf exception handling.  That's rather bad.
>
> Ciao,
>
> Duncan.
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

Seemingly Similar Threads

Search for more apparently analagous threads

llvm dev - Jul 2009 - [LLVMdev] x86 unwind support

[LLVMdev] x86 unwind support

[LLVMdev] x86 unwind support

[LLVMdev] x86 unwind support

[LLVMdev] x86 unwind support

Seemingly Similar Threads