thr3ads.net - llvm dev - [LLVMdev] endian independence [Oct 2008]

If this information is useful, please help other people find it:
Share via:

Jay Foad

2008-Oct-27 10:14 UTC

[LLVMdev] endian independence

>> I'm already working on this myself. Would you be interested in
having
>> this work contributed back to LLVM?
>
> If this were to better support target independent languages, it would
> be very useful.  If you're just trying to *reduce* the endianness
> assumptions that leak through, I don't think it's a good approach.
> There is just no way to solve this problem with C.
Yes, I can see that the llvm part of this is more straightforward and
less controversial than the llvm-gcc part. Maybe I should submit the
llvm part (since it applies to all source languages) and keep the
llvm-gcc part as a local hack.
> How do you propose to handle things like:
>
> struct foo {
> #ifdef __LITTLE_ENDIAN__
>   int x, y;
> #else
>   int y, x;
> #endif
> };
I can't make all C programs work regardless of target endianness. This
one will only work on little-endian:

  int x = 1;
  assert(*(char *)&x == 1);

You've just highlighted another restriction that I'll have to impose:
you shouldn't expect to be able to detect target endianness at compile
time.

All I want is that, if you write your source code so that it doesn't
make assumptions about endianness, then the compiler and its
optimisations won't introduce any new assumptions about endianness.

Thanks,
Jay.

Jay Foad

2008-Oct-27 18:01 UTC

head link

[LLVMdev] endian independence

> Yes, I can see that the llvm part of this is more straightforward and
> less controversial than the llvm-gcc part. Maybe I should submit the
> llvm part (since it applies to all source languages) and keep the
> llvm-gcc part as a local hack.
Here (attached) is a patch for the llvm parts of this. It doesn't
introduce any new failures in "make check". Some points:

1. I don't understand why Module has its own DataLayout string, and
its own code to parse it (in getEndianness and getPointerSize).
Couldn't it have an instance of TargetData instead? Or is it just that
Module wants a concept of unknown endianness/pointer size, which
TargetData didn't support?

2. I'm assuming that for most code in lib/Target/, lib/CodeGen and
lib/ExecutionEngine you have a real CPU target with known endianness,
so I haven't changed any of that code to check for unknown endianness.

3. The Endianness enumeration should probably live somewhere other
than Module. But I don't know where.

Any comments?

Thanks,
Jay.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: endianness
Type: application/octet-stream
Size: 12339 bytes
Desc: not available
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20081027/015f2858/attachment.obj>

Chris Lattner

2008-Oct-27 18:05 UTC

head link

[LLVMdev] endian independence

On Oct 27, 2008, at 3:14 AM, Jay Foad wrote:
>>> I'm already working on this myself. Would you be interested in
>>> having
>>> this work contributed back to LLVM?
>>
>> If this were to better support target independent languages, it would
>> be very useful.  If you're just trying to *reduce* the endianness
>> assumptions that leak through, I don't think it's a good
approach.
>> There is just no way to solve this problem with C.
>
> Yes, I can see that the llvm part of this is more straightforward and
> less controversial than the llvm-gcc part. Maybe I should submit the
> llvm part (since it applies to all source languages) and keep the
> llvm-gcc part as a local hack.
Ok, if you want to address this in LLVM, the place to start is to make  
the optimizers completely targetdata-independent.  The best way to do  
this (IMO) is to change passes to use "getAnalysisToUpdate" instead of
"getAnalysis/AddRequired" on TargetData.  Then, change opt to only add
targetdata to the passmgr if a target data string exists in the module.

This would make the optimizers transparently take advantage of TD when  
available, but gracefully handle the cases when it isn't.

-Chris

Jay Foad

2008-Oct-27 18:16 UTC

head link

[LLVMdev] endian independence

> Ok, if you want to address this in LLVM, the place to start is to make
> the optimizers completely targetdata-independent.  The best way to do
> this (IMO) is to change passes to use "getAnalysisToUpdate"
instead of
> "getAnalysis/AddRequired" on TargetData.  Then, change opt to
only add
> targetdata to the passmgr if a target data string exists in the module.
>
> This would make the optimizers transparently take advantage of TD when
> available, but gracefully handle the cases when it isn't.
This sounds a bit all-or-nothing. In my case, I know everything about
target data *except* for the endianness, and I don't think I'd want to
disable any significant optimisations that depend on other aspects of
the target data. But maybe I'm an unusual case!

Thanks,
Jay.

Apparently Analagous Threads

Search for more apparently analagous threads

llvm dev - Oct 2008 - [LLVMdev] endian independence

[LLVMdev] endian independence

[LLVMdev] endian independence

[LLVMdev] endian independence

[LLVMdev] endian independence

Apparently Analagous Threads