thr3ads.net - llvm dev - [LLVMdev] improving the ocaml binding's type safety [Mar 2008]

If this information is useful, please help other people find it:
Share via:

Gordon Henriksen

2008-Mar-16 02:33 UTC

[LLVMdev] improving the ocaml binding's type safety

Erick,

After some experimentation, I'd prefer the closed system. LLVM has  
some type peculiarities like the commonality between CallInst and  
InvokeInst. I find that the closed type system lets me express such  
constraints more naturally. Expressing these constraints explicitly in  
the open system involves annotating the C++ class hierarchy with extra  
variants which are unnecessary in the closed model.

Please use 'a Llvm.ty for Type and 'a Llvm.v for Value to save typing.  
These choices avoid conflicting with the common type binding t and the  
language keyword val, but promote these important types to the type  
names into the Llvm module (likely open'd) for brevity's sake.

I don't have a better suggestion than just naming the variant sum  
types Llvm.ll_____. I considered some other options, but decided I'm  
not fond of them in practice.

Thanks,
Gordon

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20080315/7c624f16/attachment.html>

Erick Tryzelaar

2008-Mar-16 07:21 UTC

head link

[LLVMdev] improving the ocaml binding's type safety

On Sat, Mar 15, 2008 at 7:33 PM, Gordon Henriksen
<gordonhenriksen at mac.com> wrote:> After some experimentation, I'd prefer the closed system. LLVM has some
type
> peculiarities like the commonality between CallInst and InvokeInst. I find
> that the closed type system lets me express such constraints more
naturally.
> Expressing these constraints explicitly in the open system involves
> annotating the C++ class hierarchy with extra variants which are
unnecessary
> in the closed model.
It looks like you might be right, and open variants might not be able
to handle the pseudo-shared functions like
llvm::CallInst::doesNotReturn and llvm::InvokeInst::doesNotReturn. We
can't do the naive

val does_not_return : [> `CallInst | `InvokeInst] t -> bool

Like we can with closed variants:

val does_not_return : [< llcallinst | llinvokeinst] t -> bool.

Although... what if we just add another variant? This would work:

type llcallinst = [ llinstruction | `CallInst | `CallSite ]
type llinvokeinst = [ llinstruction | `InvokeInst | `CallSite ]
val does_not_return : [> `CallSite] t -> bool

It makes the variant types a little more complicated, but end users
wouldn't work directly with the variants so there might not be that
much added complexity. They'd just specify "llcallinst t" and the
like. The variants would pretty much only get used in llvm.ml.

What are other problems that I'm missing? Have any other ideas on when
adding yet another variant really would break things down? I have to
think about this more. I just don't understand polymorphic types very
well.

> Please use 'a Llvm.ty for Type and 'a Llvm.v for Value to save
typing. These
> choices avoid conflicting with the common type binding t and the language
> keyword val, but promote these important types to the type names into the
> Llvm module (likely open'd) for brevity's sake.
>
> I don't have a better suggestion than just naming the variant sum types
> Llvm.ll_____. I considered some other options, but decided I'm not fond
of
> them in practice.
I think we'd need only one definition of "type 'a whatever".
The
phantom type would be enough to distinguish everything. For instance,
lablgtk has a "type 'a obj" that they use as the base type for all
of
their variants, which I might copy. We can even hide this type by
naming the variants something like "llfunction_variants" and then
"type llfunction = llfunction_variants obj" to have roughly the same
interface as before.

What about putting the types (and functions) in modules, like
ModuleProvider? It'd be like my scheme to break up llvm.ml into
multiple files without actually splitting it up :) Then you could open
Llvm and reference Value.t without obscuring anything. If not, then
maybe we should unpack ModuleProvider from a module (which I'd prefer
not to do). I was unsure of if it were better to add new functionality
in the top level or in a module, so I erred towards module. I'd also
like to be able to call "Module.create" instead of
"create_module". We
could even open or abbreviate the module name in scope for even
shorter function names than before.

Oh and one last thing that I've been meaning to ask for a long long
time. Do you think we could change the ordering of some of the
arguments? The builder functions, like:

external build_phi : (llvalue * llbasicblock) list -> string ->
llbuilder -> llvalue = "llvm_build_phi"

Have the builder as the last argument, instead of the first as it
normally is done in ocaml libraries. It also hampers currying, but I'm
not sure how often that would be used. The downside is that we'd have
to translate the order in llvm_ocaml.c, but since more of the
functions I want to do this to already have bindings in there, we
wouldn't really have any extra overhead.

Gordon Henriksen

2008-Mar-16 14:38 UTC

head link

[LLVMdev] improving the ocaml binding's type safety

On Mar 16, 2008, at 03:21, Erick Tryzelaar wrote:
> On Sat, Mar 15, 2008 at 7:33 PM, Gordon Henriksen <gordonhenriksen at
mac.com
> > wrote:
>
>> After some experimentation, I'd prefer the closed system. LLVM has
>> some type peculiarities like the commonality between CallInst and  
>> InvokeInst. I find that the closed type system lets me express such  
>> constraints more naturally. Expressing these constraints explicitly  
>> in the open system involves annotating the C++ class hierarchy with  
>> extra variants which are unnecessary in the closed model.
>
> It looks like you might be right, and open variants might not be  
> able to handle the pseudo-shared functions like  
> llvm::CallInst::doesNotReturn and llvm::InvokeInst::doesNotReturn.  
> We can't do the naive
>
> val does_not_return : [> `CallInst | `InvokeInst] t -> bool
>
> Like we can with closed variants:
>
> val does_not_return : [< llcallinst | llinvokeinst] t -> bool.
Exactly.
>> Please use 'a Llvm.ty for Type and 'a Llvm.v for Value to save
>> typing. These choices avoid conflicting with the common type  
>> binding t and the language keyword val, but promote these important  
>> types to the type names into the Llvm module (likely open'd) for  
>> brevity's sake.
>>
>
> For instance, lablgtk has a "type 'a obj" that they use as
the base
> type for all of their variants, which I might copy.
GTK+ has GtkObject at the root of its type system—an (open) tree of  
types. Value and Type have truly disjoint class hierarchies with no  
interfaces in common—a (closed) forest of types. Not only that, but  
there is cause to consider qualifying values with two phantom  
variants, which is not the case for types.

Please continue to use different abstract types, as we currently do  
(lltype and llvalue).
> I think we'd need only one definition of "type 'a
whatever". The
> phantom type would be enough to distinguish everything. We can even  
> hide this type by naming the variants something like  
> "llfunction_variants" and then "type llfunction =  
> llfunction_variants obj" to have roughly the same interface as before.
I am not convinced that your type definition there would work right.  
If it is truly compatible (user code can accept and return values of  
that type, let bind them, put them in structs and refs), please  
proceed (and provide such aliases for all variants). However, if it's  
only 'sort of' compatible, please skip the extra layer of complexity.
>> I don't have a better suggestion than just naming the variant sum  
>> types Llvm.ll_____. I considered some other options, but decided  
>> I'm not fond of them in practice.
>
> What about putting the types (and functions) in modules, like  
> ModuleProvider?
I do like the submodules for keeping odd classes in their own  
namespace, but I rather like how users of the core IR functions read  
in the functional style. Besides, I don't want to change my (user)  
code without good reason. (By contrast, getting type checking is a  
good reason to change my type declarations.)
> It'd be like my scheme to break up llvm.ml into multiple files  
> without actually splitting it up :) Then you could open Llvm and  
> reference Value.t without obscuring anything. We could even open or  
> abbreviate the module name in scope for even shorter function names  
> than before.
I thought [<Value.llconstant] Value.t was getting to be too verbose.  
Note that you don't need to worry about naming conflicts with the  
variants since these are all in the llvm:: namespace in C++.
> I was unsure of if it were better to add new functionality in the  
> top level or in a module, so I erred towards module.

Let's angle to keep the IR in the top-level and the rest in  
submodules. I think that creates a nice balance of brevity (and  
compat) for common, core functions and explicitness and isolation for  
the more obscure features of the infrastructure.
> Oh and one last thing that I've been meaning to ask for a long long  
> time. Do you think we could change the ordering of some of the  
> arguments? The builder functions, like:
At this point, please don't change it.

— Gordon

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20080316/26549f84/attachment.html>

Possibly Parallel Threads

Search for more possibly parallel threads

llvm dev - Mar 2008 - [LLVMdev] improving the ocaml binding's type safety

[LLVMdev] improving the ocaml binding's type safety

[LLVMdev] improving the ocaml binding's type safety

[LLVMdev] improving the ocaml binding's type safety

Possibly Parallel Threads