thr3ads.net - llvm dev - [LLVMdev] Using JIT code to code a program to call C++ [Mar 2012]

If this information is useful, please help other people find it:
Share via:

Paul J. Lucas

2012-Mar-15 00:02 UTC

[LLVMdev] Using JIT code to code a program to call C++

My project has a C++ library that I want to allow the user to use via some
programming language to be JIT'd to call functions in said library.  For the
sake of simplicity, assume the library has classes like:

	class item {
	public:
	  item();
	  item( int );
	  ~item();
	  // ...
	};

	class item_iterator {
	public:
	  virtual ~item_iterator();
	  virtual bool next( item *result ) = 0;
	};

	class singleton_iterator : public item_iterator {
	public:
	  singleton_iterator( item const &i );
	  // ...
	};

I'm aware that LLVM doesn't know anything about C++ and that one way to
call C++ functions is to wrap them in C thunks:

	extern "C" {

	  void thunk_item_M_new( item *addr ) {
	    new( addr ) item;
	  }

	  void thunk_singleton_iterator_M_new( singleton_iterator *addr, item *i ) {
	    new( addr ) singleton_iterator( *i );
	  }

	  bool thunk_iterator_M_next( item_iterator *that, item *result ) {
	    return that->next( result );
	  }

	} // extern "C"

The first problem is how to allocate an item from LLVM.  I know how to create
StructTypes and add fields to them, but I don't have to have to parallel the
C++ class layout -- that's tedious and error-prone.

The idea I got was simply to add a char[sizeof(T)] as the only field to a
StructType for a C++ class type:

  StructType *const llvm_item_type = StructType::create( llvm_ctx,
"item" );
  vector<Type*> llvm_struct_types;
  llvm_struct_types.push_back( ArrayType::get( IntegerType::get( llvm_ctx, 8 ),
sizeof( item ) ) );
  llvm_item_type->setBody( llvm_struct_types, false );
  PointerType *const llvm_item_ptr_type = PointerType::getUnqual( llvm_item_type
);

I would think that, because it's a StructType, the alignment would be
correct and the sizeof(item) would get the size right.  Will that work?  Is
there a better way?

The second problem is that, unlike the C++ class hierarchy, there's no
inheritance relationship between StructTypes.  If I create a Function that takes
an llvm_iterator_type but try to build a Function object using an
llvm_singleton_iterator_type, the LLVM verifyModule() function complains at me:
> Call parameter type does not match function signature!
So then I thought I'd simply use void* everywhere:

  Type *const llvm_void_type = Type::getVoidTy( llvm_ctx );
  PointerType *const llvm_void_ptr_type = PointerType::getUnqual( llvm_void_type
);

but verifyModule() still complains at me because, apparently, there's no
automatic casting to void* types in LLVM.  How can I solve this problem?

- Paul

Óscar Fuentes

2012-Mar-15 00:51 UTC

head link

[LLVMdev] Using JIT code to code a program to call C++

"Paul J. Lucas" <paul at lucasmail.org> writes:

[snip]
> I'm aware that LLVM doesn't know anything about C++ and that one
way
> to call C++ functions is to wrap them in C thunks:
Yes, this is the only sane way unless you are willing to replicate the
ABI of your chosen C++ implementation onto your system. Let's forget
about dealing with more than one C++ ABI ;-)

[snip]
> I would think that, because it's a StructType, the alignment would be
> correct and the sizeof(item) would get the size right.  Will that
> work?
Yes. I do the same on my compiler. Please note that this trick is only
necessary if you pretend to put C++ objects on your stack. If all C++
objects your JITted code creates are on the heap, you can simply use
`new' instead of placement `new' in the C function that creates the C++
object and deal with the returned pointer..
> Is there a better way?
It all depends on the charateristics of your language.

[snip]
> but verifyModule() still complains at me because, apparently, there's
> no automatic casting to void* types in LLVM.  How can I solve this
> problem?
You must do the cast yourself, or simply use void* for all pointers for
all your C++ objects and deal with its real type on the upper part of
the compiler.

As said above, it all depends on the details of your language.

Paul J. Lucas

2012-Mar-21 02:38 UTC

head link

[LLVMdev] Catching C++ exceptions, cleaning up, rethrowing

To recap, on Mar 14, 2012, I wrote:
> My project has a C++ library that I want to allow the user to use via some
programming language to be JIT'd to call functions in said library.  For the
sake of simplicity, assume the library has classes like:
> 
> 	class item_iterator {
> 	public:
> 	  virtual ~item_iterator();
> 	  virtual bool next( item *result ) = 0;
> 	};
> 
> I'm aware that LLVM doesn't know anything about C++ and that one
way to call C++ functions is to wrap them in C thunks:
> 
> 	extern "C" bool thunk_iterator_M_next( void *v_that, void
*v_result ) {
> 	  item_iterator *const that = static_cast<item_iterator*>( v_that );
> 	  item *const result = static_cast<item*>( v_result );
> 	  return that->next( result );
> 	}
> 
> 	extern "C" void thunk_iterator_M_delete( void *v_that ) {
> 	  item_iterator *const that = static_cast<item_iterator*>( v_that );
> 	  that->~item_iterator();
> 	}
Thanks to a previous answer, I now have everything working.  My next problem is
to deal with exceptions that may be thrown from the C++ functions that are
called via the thunks, e.g., what if that->next() throws an exception?  I
need to be able to catch it, call a clean-up function, and rethrow the exception
so the code calling the JIT'd code can deal with the exception.

I've read the docs on LLVM exceptions, but I don't see any examples.  A
little help?

One thought might be to try to handle all the C++ exception code in the thunks. 
The JIT'd code would create/maintain a simple array-of-structs like:

	struct dtor_pair {
	  void (*dtor_fn)(void*);
	  void *that;
	};
	dtor_pair dtor_pairs[10];

that contain a pointer to the thunk for a destructor and a pointer to the object
to be destructed.

As the JIT'd code creates objects on the stack, it populates the dtor_pairs
array.  This array could then be passed to every thunk:

	extern "C" bool thunk_iterator_M_next( void *v_that, void *v_result,
	                                       dtor_pairs *dtors ) {
	  try {
	    item_iterator *const that = static_cast<item_iterator*>( v_that );
	    item *const result = static_cast<item*>( v_result );
	    return that->next( result );
	  }
	  catch ( ... ) {
	    run_dtors( dtors );
	    throw;
	  }
	}

where run_dtors() would run through the array backwards calling the destructor
functions in reverse order of construction.

Would this work?  If so, then I wouldn't have to mess with handing C++
exceptions from LLVM.  But is there a better "LLVM way" to do what I
want?

- Paul

Bill Wendling

2012-Mar-22 07:28 UTC

head link

[LLVMdev] Catching C++ exceptions, cleaning up, rethrowing

On Mar 20, 2012, at 7:38 PM, Paul J. Lucas wrote:
> To recap, on Mar 14, 2012, I wrote:
> 
>> My project has a C++ library that I want to allow the user to use via
some programming language to be JIT'd to call functions in said library. 
For the sake of simplicity, assume the library has classes like:
>> 
>> 	class item_iterator {
>> 	public:
>> 	  virtual ~item_iterator();
>> 	  virtual bool next( item *result ) = 0;
>> 	};
>> 
>> I'm aware that LLVM doesn't know anything about C++ and that
one way to call C++ functions is to wrap them in C thunks:
>> 
>> 	extern "C" bool thunk_iterator_M_next( void *v_that, void
*v_result ) {
>> 	  item_iterator *const that = static_cast<item_iterator*>(
v_that );
>> 	  item *const result = static_cast<item*>( v_result );
>> 	  return that->next( result );
>> 	}
>> 
>> 	extern "C" void thunk_iterator_M_delete( void *v_that ) {
>> 	  item_iterator *const that = static_cast<item_iterator*>(
v_that );
>> 	  that->~item_iterator();
>> 	}
> 
> Thanks to a previous answer, I now have everything working.  My next
problem is to deal with exceptions that may be thrown from the C++ functions
that are called via the thunks, e.g., what if that->next() throws an
exception?  I need to be able to catch it, call a clean-up function, and rethrow
the exception so the code calling the JIT'd code can deal with the
exception.
> 
> I've read the docs on LLVM exceptions, but I don't see any
examples.  A little help?
> I don't think this has anything to do with LLVM's IR-level exception
system. It sounds to me like you just need a way to handle C++ exceptions inside
of the C++ code and then rethrow so that the JIT's caller can do its thing.
(Right?)

You could move the C++ code into a C++ function that catches all exceptions. The
C functions you provide would call the small bit of C++ code that would then
execute the "real" functionality. You would have to wrap/unwrap the
variables, of course. (There are examples of wrapping/unwrapping of variables in
LLVM's source tree.) That way you will get to use C++'s exception
handling system instead of creating your own, which is a huge massive
undertaking full of pitfalls. When you rethrow the exception, it will propagate
past the C function to the code calling the JIT'ed code.
> One thought might be to try to handle all the C++ exception code in the
thunks.  The JIT'd code would create/maintain a simple array-of-structs
like:
> 
> 	struct dtor_pair {
> 	  void (*dtor_fn)(void*);
> 	  void *that;
> 	};
> 	dtor_pair dtor_pairs[10];
> 
> that contain a pointer to the thunk for a destructor and a pointer to the
object to be destructed.
> 
> As the JIT'd code creates objects on the stack, it populates the
dtor_pairs array.  This array could then be passed to every thunk:
> 
> 	extern "C" bool thunk_iterator_M_next( void *v_that, void
*v_result,
> 	                                       dtor_pairs *dtors ) {
> 	  try {
> 	    item_iterator *const that = static_cast<item_iterator*>( v_that
);
> 	    item *const result = static_cast<item*>( v_result );
> 	    return that->next( result );
> 	  }
> 	  catch ( ... ) {
> 	    run_dtors( dtors );
> 	    throw;
> 	  }
> 	}
> 
> where run_dtors() would run through the array backwards calling the
destructor functions in reverse order of construction.
> 
> Would this work?  If so, then I wouldn't have to mess with handing C++
exceptions from LLVM.  But is there a better "LLVM way" to do what I
want?
> What you're doing is recreating what the personality function and DWARF
unwinding library do.

-bw

Apparently Analagous Threads

Search for more reasonably related threads

llvm dev - Mar 2012 - [LLVMdev] Using JIT code to code a program to call C++

[LLVMdev] Using JIT code to code a program to call C++

[LLVMdev] Using JIT code to code a program to call C++

[LLVMdev] Catching C++ exceptions, cleaning up, rethrowing

[LLVMdev] Catching C++ exceptions, cleaning up, rethrowing

Apparently Analagous Threads