thr3ads.net - llvm dev - [LLVMdev] Casting between address spaces and address space semantics [Jul 2008]

If this information is useful, please help other people find it:
Share via:

Matthijs Kooijman

2008-Jul-17 12:08 UTC

[LLVMdev] Casting between address spaces and address space semantics

Hi all,

I'm currently struggling a bit with some problems regarding address spaces
and
(implicit) casts. I'll explain some context first and then proceed to the
actual question I'd like to have answered.

In our target platform, we have a number of distinctly different memory banks.
To access these from our C code, we declare a global array for each memory,
with the address space attribute to mark in which memory the array should be
allocated. This allows the backend to map the load and store instructions from
and to this array on the right target instructions, depending on the memory
used. For example:

__attribute__((address_space(1))) char mem1[100];
__attribute__((address_space(2))) char mem2[100];

Now, we are using a function which reads a value from one of these memories
and does some processing. Since we want to execute this function for multiple
memories, we make it accept a pointer in the generic address space (ie, no
address space attribute):
	
	void do_stuff(char* mem);

Somewhere later on, we call the function as follows:

	do_stuff(mem1);
	...
	do_stuff(mem2);

As expected, the LLVM IR resulting from this contains a bitcast to remove the
address space from mem1 and mem2 (and also cast from [100 x i8] to i8*, but
that is less interesting). Now, this bitcast really isn't supported by
our backend (more specifically, the do_stuff function can't be codegen'd
since
we need to know from which address space we are reading at copile time).

To solve this, we make sure the do_stuff function gets inlined. When this
happens, our problems should go away, since we now know at compile time which
address space is used for every load and store instruction. However, the
bitcast that removes the address space won't go away.

What I would like to see, is that the bitcast be removed and the addrspace
annotated type be propagated to the gep/load/store instructions that use it.



However, this brings me to my actual question: How are address spaces
semantically defined? I see two options here:

a) Every address space has the full range of addresses and they completely
live side by side. This means that, for example, i32 addrspace(1) * 100 points
to a different piece of memory than i32 addrspace(2) * 100. Also, this means
that a bitcast from one address space to another (possibly 0), makes the
pointer point to something different when loaded.

b) Every address space is really a subspace of the full range of addresses,
but always disjoint. This means that, for example, i32 addrspace(1) * 100
points to the same memory as i32 addrspace(2) * 100, though one, or possibly
both of them can be invalid (since the pointer lies outside of that address
space). This also means that bitcasting a pointer from one address space to
another doesn't change it's meaning, though it can potentially become
invalid.

This approach also allows addrspace(0) to become a bit more special: Any
pointer can be valid in that addrspace. This means that casting to
addrspace(0) and then loading is semantically the same as loading directly
(though it can be that the hardware can't map something like that). This
approach would also allow for removing the bitcast in my above problem, since
the loads from the generic address space can be replaced by loads from a
specific address space, without changing semantics. This does not currently
happen, but I would be willing to implement this. Finally, this approach is
consistent with the address spaces defined by DSP-C.



There are probably a few more approaches, that are similar to the ones above.
I would suggest that whatever approach we pick, we should document it
somewhere (langref?), since leaving this stuff backend-specific makes
transformations not so useful. Being consistent with DSP-C is probably the
best track here?

Gr.

Matthijs
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20080717/53402465/attachment.sig>

Eli Friedman

2008-Jul-17 14:25 UTC

head link

[LLVMdev] Casting between address spaces and address space semantics

On Thu, Jul 17, 2008 at 5:08 AM, Matthijs Kooijman <matthijs at stdin.nl>
wrote:> Now, we are using a function which reads a value from one of these memories
> and does some processing. Since we want to execute this function for
multiple
> memories, we make it accept a pointer in the generic address space (ie, no
> address space attribute):
>
>        void do_stuff(char* mem);
The "generic address space" you're referring to is really just
address
space 0, at least in the current implementation.  Assuming alternate
address spaces are actually separate, passing a pointer from a
different address space to this function is undefined behavior.
> However, this brings me to my actual question: How are address spaces
> semantically defined? I see two options here:
>
> a) Every address space has the full range of addresses and they completely
> live side by side. This means that, for example, i32 addrspace(1) * 100
points
> to a different piece of memory than i32 addrspace(2) * 100. Also, this
means
> that a bitcast from one address space to another (possibly 0), makes the
> pointer point to something different when loaded.
>
> b) Every address space is really a subspace of the full range of addresses,
> but always disjoint. This means that, for example, i32 addrspace(1) * 100
> points to the same memory as i32 addrspace(2) * 100, though one, or
possibly
> both of them can be invalid (since the pointer lies outside of that address
> space). This also means that bitcasting a pointer from one address space to
> another doesn't change it's meaning, though it can potentially
become invalid.
Address spaces are platform-specific, so you can define them any way
you want, I suppose.  But if you can map the alternate address spaces
into address space 0, is there really any point to keeping the other
address spaces around?

-Eli

Mon P Wang

2008-Jul-17 16:48 UTC

head link

[LLVMdev] Casting between address spaces and address space semantics

In ISO/IEC WG14 n1169 on the C extensions to support embedded  
processors, any two address spaces must be disjoint, must be  
equivalent, or must be nested.  As Eli indicated, the actual  
relationship is platform specific depending on what makes the most  
sense for your hardware and how the program will behave will depend on  
that relationship.

   -- Mon Ping

On Jul 17, 2008, at 7:25 AM, Eli Friedman wrote:
> On Thu, Jul 17, 2008 at 5:08 AM, Matthijs Kooijman  
> <matthijs at stdin.nl> wrote:
>> Now, we are using a function which reads a value from one of these  
>> memories
>> and does some processing. Since we want to execute this function  
>> for multiple
>> memories, we make it accept a pointer in the generic address space  
>> (ie, no
>> address space attribute):
>>
>>       void do_stuff(char* mem);
>
> The "generic address space" you're referring to is really
just address
> space 0, at least in the current implementation.  Assuming alternate
> address spaces are actually separate, passing a pointer from a
> different address space to this function is undefined behavior.
>
>> However, this brings me to my actual question: How are address spaces
>> semantically defined? I see two options here:
>>
>> a) Every address space has the full range of addresses and they  
>> completely
>> live side by side. This means that, for example, i32 addrspace(1) *  
>> 100 points
>> to a different piece of memory than i32 addrspace(2) * 100. Also,  
>> this means
>> that a bitcast from one address space to another (possibly 0),  
>> makes the
>> pointer point to something different when loaded.
>>
>> b) Every address space is really a subspace of the full range of  
>> addresses,
>> but always disjoint. This means that, for example, i32 addrspace(1)  
>> * 100
>> points to the same memory as i32 addrspace(2) * 100, though one, or  
>> possibly
>> both of them can be invalid (since the pointer lies outside of that  
>> address
>> space). This also means that bitcasting a pointer from one address  
>> space to
>> another doesn't change it's meaning, though it can potentially
>> become invalid.
>
> Address spaces are platform-specific, so you can define them any way
> you want, I suppose.  But if you can map the alternate address spaces
> into address space 0, is there really any point to keeping the other
> address spaces around?
>
> -Eli
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Apparently Analagous Threads

Search for more apparently analagous threads

llvm dev - Jul 2008 - [LLVMdev] Casting between address spaces and address space semantics

[LLVMdev] Casting between address spaces and address space semantics

[LLVMdev] Casting between address spaces and address space semantics

[LLVMdev] Casting between address spaces and address space semantics

Apparently Analagous Threads