thr3ads.net - llvm dev - [llvm-dev] The future of LLVM's C APIs: Notes and BoF. [Oct 2015]

If this information is useful, please help other people find it:
Share via:

Justin Bogner via llvm-dev

2015-Oct-17 18:20 UTC

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

(Moving this to llvm-dev)

On Friday, October 16, 2015, Justin Bogner <mail at justinbogner.com>
wrote:
> Some users of llvm-c want stable API interfaces into various parts of
> the LLVM infrasture, others want further ABI guarantees about this
> usage, and still others simply want a way to bind to LLVM through their
> language frontend’s existing FFI support for C.
>
> If we want to improve the situation for any of these users, we need to
> properly understand how these APIs are being used (or abused)
> today. Juergen and I will be hosting a BoF at the dev meeting where we
> can discuss what the requirements of a sustainable C API are, and how we
> can organize things in LLVM to support this.
>
> There's been a fair amount of discussion about this lately, and
it's
> pretty clear that the "hopefully stable bindings to whatever APIs
> somebody needed" approach isn't really good enough for anybody.
There
> are a couple of points that I think are fairly non-controversial, but
> will more or less drive the discussion at the BoF:
>
> 1. It isn't practical to keep a bindings API stable, unless the
>    underlying API is also stable.
>
> 2. Handrolling bindings as they're needed tends to leave conspicuous
>    gaps where some API is inaccessible for no good reason.
>
> So based on (1), we'll really want to create some purpose built APIs
> that we can keep stable for various tasks. What's needed here? People
> want to do things like building a pass manager, setting up a canned JIT
> config, and to some degree even emit IR. We'll discuss what's
practical
> and what people want, and hopefully strike a good balance.
>
> Similarly, (2) implies that if we really need a *full* bindings API
> we'll want to automate it. But what is a full bindings API? Who uses
it,
> and what do they want from it? If it's automated, should installing
LLVM
> install this API, or should we simply provide an easy way to generate
> it?
>
> In the end, I hope to have a good idea of what people are actually using
> these APIs for, and both to support stable API users less haphazardly
> and to make unstable API more thorough and/or easier to create.
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20151017/ac18d202/attachment.html>

Rodney M. Bates via llvm-dev

2015-Oct-20 20:16 UTC

head link

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

On 10/17/2015 01:20 PM, Justin Bogner via llvm-dev
wrote:> (Moving this to llvm-dev)
>
> On Friday, October 16, 2015, Justin Bogner <mail at justinbogner.com
<mailto:mail at justinbogner.com>> wrote:
>
>     Some users of llvm-c want stable API interfaces into various parts of
>     the LLVM infrasture, others want further ABI guarantees about this
>     usage, and still others simply want a way to bind to LLVM through their
>     language frontend’s existing FFI support for C.
>
>     If we want to improve the situation for any of these users, we need to
>     properly understand how these APIs are being used (or abused)
>     today. Juergen and I will be hosting a BoF at the dev meeting where we
>     can discuss what the requirements of a sustainable C API are, and how
we
>     can organize things in LLVM to support this.
>
>     There's been a fair amount of discussion about this lately, and
it's
>     pretty clear that the "hopefully stable bindings to whatever APIs
>     somebody needed" approach isn't really good enough for
anybody. There
>     are a couple of points that I think are fairly non-controversial, but
>     will more or less drive the discussion at the BoF:
>
>     1. It isn't practical to keep a bindings API stable, unless the
>         underlying API is also stable.
>
>     2. Handrolling bindings as they're needed tends to leave
conspicuous
>         gaps where some API is inaccessible for no good reason.
>
>     So based on (1), we'll really want to create some purpose built
APIs
>     that we can keep stable for various tasks. What's needed here?
People
>     want to do things like building a pass manager, setting up a canned JIT
>     config, and to some degree even emit IR. We'll discuss what's
practical
>     and what people want, and hopefully strike a good balance.
>
>     Similarly, (2) implies that if we really need a *full* bindings API
>     we'll want to automate it. But what is a full bindings API? Who
uses it,
>     and what do they want from it? If it's automated, should installing
LLVM
>     install this API, or should we simply provide an easy way to generate
>     it?
>
>     In the end, I hope to have a good idea of what people are actually
using
>     these APIs for, and both to support stable API users less haphazardly
>     and to make unstable API more thorough and/or easier to create.
>
One actual use:

A couple of us in the Modula3 support community are working on splicing a
llvm back end onto our Modula3 compiler front end.  We have zero-thickness
bindings written in Modula3 which match C bindings like core.h.  In our front
end executable, we only build llvm IR in memory, then use LLVMPrintModuleToFile
and/or LLVMWriteBitcodeToFile.  We only link in the needed parts of the llvm
infrastructure to build and write the llvm IR.  Then we run stock llc on the
IR code in the emitted file.

We have had to write an additional binding to DIBuilder for this purpose,
as well as to a few odd other C++ functions here and there.  The latest
version of this is for llvm 3.6.1, after the separation of metadata from
values.  It has had minimal testing, but is fairly complete.  Older bindings
to an older DIBuilder exist, and are at least partly working, but only
contain specifically needed functions.

This scheme is passing a majority of preexisting compiler tests of code
function, but there is not much testing of emitted debug info.  Speaking
for myself, better debugger support is one of the primary motivations for
using llvm.  Our older back ends use a really cobbled-up extension of
stabs+.  It's a mess, and lots of debugger function is difficult to
provide.

The DIBuilder binding includes .h and .cpp files similar to core, that provide
a C binding, with lots of [un]wrapping, etc.  Like core, it loses the type
hierarchy.  Modula3 has a type hierarchy, but machine-level representation
is undoubtedly not ABI compatible with any C++ compiler.  The current handling
of this is not thoroughly thought out, just using lots of unsafe casting of
pointers.  I would like to think of something better, but haven't had much
time
for that, so far.
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
-- 
Rodney Bates
rodney.m.bates at acm.org

deadal nix via llvm-dev

2015-Oct-22 01:52 UTC

head link

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

Yup that is quite similar to my use case, but in addition I use LLVM to JIT
as well from C using MCJIT.

I have a patch for a DIBuilder binding, but it does change the API a bit,
so depending on how this discussion goes, it can or cannot be added. It was
necessary as the current binding wrap all metadata in values, which is very
wasteful when dealing with a large amount of metadata. The breakage is
small (all my existing code still compile and work even using the old
binding API) but very real. I obviously think the change is good or I
wouldn't have made it, but that may be too much for others.

Back on point, the importants things for me to do through the C API are :
 - Write IR, including debug infos and other metadata (it is mostly good,
some IR specificities are not covered, like ordered loads/stores, but these
are edges cases that ca be added. I can prepare a patch if people want
these).
 - Read IR, so I can do transformations.
 - Codegen to object files
 - JIT (using MCJIT is fine, Orca would be better but making a good C api
for it may prove challenging).

I preferably would like that the ABI is not broken, as typecheking won't
work cross language boundary. That being said, I'm ok with this being a
"best practice" or whatever people want to call it rather than a
strong,
enforced, requirement. However, if it is not a strong, enforced
requirement, we need a way to test what versions of LLVM is currently in
use as to choose the right version of the function to call. As long as ABI
breakage are not too frequent, this is workable as far as I'm concerned,
and put less constraint on the evolution of the whole thing.

Last but not least, I started working on a test that would read IR and
regenerate it. This would enforce that IR is readable and writable from the
C API, and that breakage are not done unknowingly like it happened in the
past several times. I really would like this to move forward whatever the
policy is chosen. Random breakage + detective work to figure out what
happened are the worst when we could know things in an automated manner.

As a final note, I'd like for us to focus on resolving this soon enough so
that some work can be done related to the C API before 3.8 is out. Nothing
is on fire yet, but let's not let this slip too much.


2015-10-20 13:16 GMT-07:00 Rodney M. Bates via llvm-dev <
llvm-dev at lists.llvm.org>:
>
>
> On 10/17/2015 01:20 PM, Justin Bogner via llvm-dev wrote:
>
>> (Moving this to llvm-dev)
>>
>>
>> On Friday, October 16, 2015, Justin Bogner <mail at justinbogner.com
>> <mailto:mail at justinbogner.com>> wrote:
>>
>>     Some users of llvm-c want stable API interfaces into various parts
of
>>     the LLVM infrasture, others want further ABI guarantees about this
>>     usage, and still others simply want a way to bind to LLVM through
>> their
>>     language frontend’s existing FFI support for C.
>>
>>     If we want to improve the situation for any of these users, we need
to
>>     properly understand how these APIs are being used (or abused)
>>     today. Juergen and I will be hosting a BoF at the dev meeting where
we
>>     can discuss what the requirements of a sustainable C API are, and
how
>> we
>>     can organize things in LLVM to support this.
>>
>>     There's been a fair amount of discussion about this lately, and
it's
>>     pretty clear that the "hopefully stable bindings to whatever
APIs
>>     somebody needed" approach isn't really good enough for
anybody. There
>>     are a couple of points that I think are fairly non-controversial,
but
>>     will more or less drive the discussion at the BoF:
>>
>>     1. It isn't practical to keep a bindings API stable, unless the
>>         underlying API is also stable.
>>
>>     2. Handrolling bindings as they're needed tends to leave
conspicuous
>>         gaps where some API is inaccessible for no good reason.
>>
>>     So based on (1), we'll really want to create some purpose built
APIs
>>     that we can keep stable for various tasks. What's needed here?
People
>>     want to do things like building a pass manager, setting up a canned
>> JIT
>>     config, and to some degree even emit IR. We'll discuss
what's
>> practical
>>     and what people want, and hopefully strike a good balance.
>>
>>     Similarly, (2) implies that if we really need a *full* bindings API
>>     we'll want to automate it. But what is a full bindings API? Who
uses
>> it,
>>     and what do they want from it? If it's automated, should
installing
>> LLVM
>>     install this API, or should we simply provide an easy way to
generate
>>     it?
>>
>>     In the end, I hope to have a good idea of what people are actually
>> using
>>     these APIs for, and both to support stable API users less
haphazardly
>>     and to make unstable API more thorough and/or easier to create.
>>
>>
> One actual use:
>
> A couple of us in the Modula3 support community are working on splicing a
> llvm back end onto our Modula3 compiler front end.  We have zero-thickness
> bindings written in Modula3 which match C bindings like core.h.  In our
> front
> end executable, we only build llvm IR in memory, then use
> LLVMPrintModuleToFile
> and/or LLVMWriteBitcodeToFile.  We only link in the needed parts of the
> llvm
> infrastructure to build and write the llvm IR.  Then we run stock llc on
> the
> IR code in the emitted file.
>
> We have had to write an additional binding to DIBuilder for this purpose,
> as well as to a few odd other C++ functions here and there.  The latest
> version of this is for llvm 3.6.1, after the separation of metadata from
> values.  It has had minimal testing, but is fairly complete.  Older
> bindings
> to an older DIBuilder exist, and are at least partly working, but only
> contain specifically needed functions.
>
> This scheme is passing a majority of preexisting compiler tests of code
> function, but there is not much testing of emitted debug info.  Speaking
> for myself, better debugger support is one of the primary motivations for
> using llvm.  Our older back ends use a really cobbled-up extension of
> stabs+.  It's a mess, and lots of debugger function is difficult to
> provide.
>
> The DIBuilder binding includes .h and .cpp files similar to core, that
> provide
> a C binding, with lots of [un]wrapping, etc.  Like core, it loses the type
> hierarchy.  Modula3 has a type hierarchy, but machine-level representation
> is undoubtedly not ABI compatible with any C++ compiler.  The current
> handling
> of this is not thoroughly thought out, just using lots of unsafe casting of
> pointers.  I would like to think of something better, but haven't had
much
> time
> for that, so far.
>
>
>> _______________________________________________
>> LLVM Developers mailing list
>> llvm-dev at lists.llvm.org
>> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>>
> --
> Rodney Bates
> rodney.m.bates at acm.org
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20151021/359e9a7e/attachment.html>

Markiyan Kushnir via llvm-dev

2015-Oct-22 12:44 UTC

head link

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

2015-10-17 21:20 GMT+03:00 Justin Bogner via llvm-dev <llvm-dev at
lists.llvm.org>:> (Moving this to llvm-dev)
>
> On Friday, October 16, 2015, Justin Bogner <mail at justinbogner.com>
wrote:
>>
>> Some users of llvm-c want stable API interfaces into various parts of
>> the LLVM infrasture, others want further ABI guarantees about this
>> usage, and still others simply want a way to bind to LLVM through their
>> language frontend’s existing FFI support for C.
>>
>> If we want to improve the situation for any of these users, we need to
>> properly understand how these APIs are being used (or abused)
>> today. Juergen and I will be hosting a BoF at the dev meeting where we
>> can discuss what the requirements of a sustainable C API are, and how
we
>> can organize things in LLVM to support this.
>>
>> There's been a fair amount of discussion about this lately, and
it's
>> pretty clear that the "hopefully stable bindings to whatever APIs
>> somebody needed" approach isn't really good enough for
anybody. There
>> are a couple of points that I think are fairly non-controversial, but
>> will more or less drive the discussion at the BoF:
>>
>> 1. It isn't practical to keep a bindings API stable, unless the
>>    underlying API is also stable.
>>
>> 2. Handrolling bindings as they're needed tends to leave
conspicuous
>>    gaps where some API is inaccessible for no good reason.
>>
>> So based on (1), we'll really want to create some purpose built
APIs
>> that we can keep stable for various tasks. What's needed here?
People
>> want to do things like building a pass manager, setting up a canned JIT
>> config, and to some degree even emit IR. We'll discuss what's
practical
>> and what people want, and hopefully strike a good balance.
>>
>> Similarly, (2) implies that if we really need a *full* bindings API
>> we'll want to automate it. But what is a full bindings API? Who
uses it,
>> and what do they want from it? If it's automated, should installing
LLVM
>> install this API, or should we simply provide an easy way to generate
>> it?
>>
>> In the end, I hope to have a good idea of what people are actually
using
>> these APIs for, and both to support stable API users less haphazardly
>> and to make unstable API more thorough and/or easier to create.
>
>
I'm using LLVM in my own DSL implemented in pure C, in-memory
compilation/execution of DSL within a long-running application.  I use
LLVM C API heavily to build LLVM modules, generate machine code out of
them and execute it.  Since I link LLVM components into my library
statically, I'm OK to have unstable yet rich C API (possibly
automatically generated, yet clearly documented in header files), and
adjust/re-compile my own things with the new LLVM release.  I realize
that I would not necessarily need the absolutely full set of LLVM API,
so I would like having some options in C API automatic generation.  My
use cases:

    - Using the full IR building capabilities;

    - running passes over modules and individual functions via pass
manager, configuring pass manager via pass manager builder (possibly
more advanced pass manager and/or pm builder uses);

    - I'm using LLVMCreateJITCompilerForModule() in 3.4, 3.5. (Tried
MCJIT in 3.6 but could not reach an equivalent of the former.);

    - getting pointers to compiled objects (LLVMGetPointerToGlobal)
and accessing/executing them at run time;

    - generating textual IR/assembly for debugging purposes;

    - during my DSL recompile, it's important to cleanly dispose the
LLVM context and other LLVM resources using C API in a long running
application (months);

    - something off this topic: nice to have an option to not
link/depend on libraries related to interactive console and/or
graphical features.



> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>

Reasonably Related Threads

Search for more reasonably related threads

llvm dev - Oct 2015 - The future of LLVM's C APIs: Notes and BoF.

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

[llvm-dev] The future of LLVM's C APIs: Notes and BoF.

Reasonably Related Threads