thr3ads.net - llvm dev - [LLVMdev] Proposal: New IR instruction for casting between address spaces [Sep 2012]

If this information is useful, please help other people find it:
Share via:

Villmow, Micah

2012-Sep-12 21:45 UTC

[LLVMdev] Proposal: New IR instruction for casting between address spaces

> -----Original Message-----
> From: Mon P Wang [mailto:monping at apple.com]
> Sent: Wednesday, September 12, 2012 1:12 PM
> To: Villmow, Micah
> Cc: Dan Gohman; llvmdev at cs.uiuc.edu
> Subject: Re: [LLVMdev] Proposal: New IR instruction for casting between
> address spaces
> 
> Hi,
> 
> On Sep 11, 2012, at 2:30 PM, Villmow, Micah wrote:
> 
> >
> >
> >> -----Original Message-----
> >> From: Dan Gohman [mailto:gohman at apple.com]
> >> Sent: Tuesday, September 11, 2012 1:28 PM
> >> To: Villmow, Micah
> >> Cc: llvmdev at cs.uiuc.edu
> >> Subject: Re: [LLVMdev] Proposal: New IR instruction for casting
> between
> >> address spaces
> >>
> >> On Sep 11, 2012, at 1:03 PM, "Villmow, Micah"
> <Micah.Villmow at amd.com>
> >> wrote:
> >>
> >>>
> >>> From: Villmow, Micah
> >>> Sent: Tuesday, September 11, 2012 12:51 PM
> >>> To: llvm-commits at cs.uiuc.edu
> >>> Subject: Proposal: New IR instruction for casting between
address
> >>> spaces
> >>>
> >>> Problem:
> >>> Bit casting between pointers of different address spaces only
works
> if
> >> all address space pointers are the same size. With changes from
> email
> >> chain [1][2], support for different pointer sizes breaks the
bitcast
> >> instruction since there is no guarantee that the pointer size for
> the
> >> address space is on the source and destination arguments are of
the
> same
> >> size.
> >>
> >> Can you comment on whether the need for this seems like a
> fundamental
> >> need, in your field, or more of a limitation of the current
> generation
> >> of architectures?
> > [Villmow, Micah] I think this is a little of both. While the current
> and previous generation of GPU architectures are limited in what they
> are capable of doing based on hardware restrictions, I also think there
> is a fundamental need. Not all devices will run with 64bit operations
> at full speed(or even have native instructions in any case), but memory
> sizes will soon eclipse what is addressable with 32bit pointers on non-
> PC systems. What this is causing is 32bit systems requiring addressing
> into 64bit memory and switching over to 64bit for address calculations
> destroys the performance advantage that the segmented memory provides.
> > In the CPU world, this isn't that much of a problem that I can
tell
> as they have already been solved(32bit vs 64bit math is 1-1 in most
> cases), but in non-CPU architectures, this is a huge performance
> penalty(64bit mul runs 6x slower than 32bit mul). So being able to
> switch to 32bit in the cases where it is required and switch to 64bit
> where it is required is a fundamental need that I don't think will go
> away even if the architectures improve their memory infrastructure.
> >>
> >>> Solution:
> >>> Remove the ability of bitcast to cast between pointers of
different
> >> address spaces and replace with an instruction that handles this
> case
> >> explicitely.
> >>>
> >>> Proposed changes:
> >>> *         Add restriction to the verifier on the bitcast
> instruction
> >> making bitcasting between address spaces illegal.
> >>> *         Change documentation[3] to state the bitcast to
pointers
> of
> >> different address spaces is illegal.
> >>> *         Add in a new IR node, addrspacecast, that allows
> conversions
> >> between address spaces
> >>> *         Updated the reader/writer to handle these cases
> >>> *         Update the documentation to insert the new IR node.
> >>> *         Add the following documentation:
> >>> 'addrspacecast .. to' Instruction
> >>>
> >>> Syntax:
> >>>
> >>>  <result> = addrspacecast <ty> <value> to
<ty2>             ;
> yields
> >> ty2
> >>> Overview:
> >>>
> >>> The ' addrspacecast ' instruction converts value to
type ty2
> without
> >> changing any bits.
> >>
> >> This is mildly imprecise, because the whole point of this
> instruction is
> >> that it can change the bit width.
> > [Villmow, Micah] Doh, cut and paste error, will fix it.
> >>
> >>>
> >>> Arguments:
> >>>
> >>> The ' addrspacecast ' instruction takes a value to
cast, which must
> be
> >> a non-aggregate first class value with a pointer type, and a type
to
> >> cast it to, which must also be a pointer type. The pointer types
of
> >> value and the destination type, ty2, must be identical, except for
> the
> >> address space.
> >>
> >> Having a "pointer type" is sufficient to imply that it
is a "non-
> >> aggregate first class value".
> >>
> >>>
> >>> Semantics:
> >>>
> >>> The ' addrspacecast ' instruction converts value to
type ty2. It
> >> converts the type to the type that is implied by the address space
> of
> >> the destination pointer. If the destination pointer is smaller
than
> the
> >> source pointer, the upper bits are truncated. If the inverse is
> true,
> >> the upper bits are sign extended, otherwise the operation is a no-
> op.
> >>
> >> Why sign-extended? Ptrtoint/inttoptr are zero-extended, and
it's
> >> surprising that addrspacecast would be different from them.
> > [Villmow, Micah] Take for example a pointer representing a negative
> pointer offset into a 16 bit address space, if this is converted to a
> 64bit address space, the upper 48 bits would be zero and your negative
> offset just became positive. The difference between these two
> instruction types is that addrspacecast does not explicitly convert to
> any size, only implicitly, so the bits would need to be filled
> correctly.
> >>
> 
> I view a pointer as pointing to a location in memory and not as an
> offset relative to some base register.  I think the proper semantic
> here is the same as inttoptr where it does a zero-extension.[Villmow, Micah] Yeah, but the pointer won't point to the same location if
the conversion from a smaller pointer to a larger pointer is zero extended.
Take two address spaces(1 and 2) that are 16 and 64 bits in size.
int(1) *a = 0xFFFFFFF9;
int(2) *b = *a;
Is b -10(SExt), or is it 4294967289(ZExt)?
This works for inttoptr and ptrtoint because there is an assumption that the
pointer is always the same size. Maybe we even need to extend ptrtoint/inttoptr
to handle this case by adding unsigned versions?
> 
>   -- Mon Ping
> 
> 
> 
> 
> >> Dan
> >>
> >
> >
> >
> > _______________________________________________
> > LLVM Developers mailing list
> > LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> > http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

Mon Ping Wang

2012-Sep-13 08:55 UTC

head link

[LLVMdev] Proposal: New IR instruction for casting between address spaces

On Sep 12, 2012, at 2:45 PM, "Villmow, Micah" <Micah.Villmow at
amd.com> wrote:
> 
> 
>> -----Original Message-----
>> From: Mon P Wang [mailto:monping at apple.com]
>> Sent: Wednesday, September 12, 2012 1:12 PM
>> To: Villmow, Micah
>> Cc: Dan Gohman; llvmdev at cs.uiuc.edu
>> Subject: Re: [LLVMdev] Proposal: New IR instruction for casting between
>> address spaces
>> 
>> Hi,
>> 
>> On Sep 11, 2012, at 2:30 PM, Villmow, Micah wrote:
>> 
>>> 
>>> 
>>>> -----Original Message-----
>>>> From: Dan Gohman [mailto:gohman at apple.com]
>>>> Sent: Tuesday, September 11, 2012 1:28 PM
>>>> To: Villmow, Micah
>>>> Cc: llvmdev at cs.uiuc.edu
>>>> Subject: Re: [LLVMdev] Proposal: New IR instruction for casting
>> between
>>>> address spaces
>>>> 
>>>> On Sep 11, 2012, at 1:03 PM, "Villmow, Micah"
>> <Micah.Villmow at amd.com>
>>>> wrote:
>>>> 
>>>>> 
>>>>> From: Villmow, Micah
>>>>> Sent: Tuesday, September 11, 2012 12:51 PM
>>>>> To: llvm-commits at cs.uiuc.edu
>>>>> Subject: Proposal: New IR instruction for casting between
address
>>>>> spaces
>>>>> 
>>>>> Problem:
>>>>> Bit casting between pointers of different address spaces
only works
>> if
>>>> all address space pointers are the same size. With changes from
>> email
>>>> chain [1][2], support for different pointer sizes breaks the
bitcast
>>>> instruction since there is no guarantee that the pointer size
for
>> the
>>>> address space is on the source and destination arguments are of
the
>> same
>>>> size.
>>>> 
>>>> Can you comment on whether the need for this seems like a
>> fundamental
>>>> need, in your field, or more of a limitation of the current
>> generation
>>>> of architectures?
>>> [Villmow, Micah] I think this is a little of both. While the
current
>> and previous generation of GPU architectures are limited in what they
>> are capable of doing based on hardware restrictions, I also think there
>> is a fundamental need. Not all devices will run with 64bit operations
>> at full speed(or even have native instructions in any case), but memory
>> sizes will soon eclipse what is addressable with 32bit pointers on non-
>> PC systems. What this is causing is 32bit systems requiring addressing
>> into 64bit memory and switching over to 64bit for address calculations
>> destroys the performance advantage that the segmented memory provides.
>>> In the CPU world, this isn't that much of a problem that I can
tell
>> as they have already been solved(32bit vs 64bit math is 1-1 in most
>> cases), but in non-CPU architectures, this is a huge performance
>> penalty(64bit mul runs 6x slower than 32bit mul). So being able to
>> switch to 32bit in the cases where it is required and switch to 64bit
>> where it is required is a fundamental need that I don't think will
go
>> away even if the architectures improve their memory infrastructure.
>>>> 
>>>>> Solution:
>>>>> Remove the ability of bitcast to cast between pointers of
different
>>>> address spaces and replace with an instruction that handles
this
>> case
>>>> explicitely.
>>>>> 
>>>>> Proposed changes:
>>>>> *         Add restriction to the verifier on the bitcast
>> instruction
>>>> making bitcasting between address spaces illegal.
>>>>> *         Change documentation[3] to state the bitcast to
pointers
>> of
>>>> different address spaces is illegal.
>>>>> *         Add in a new IR node, addrspacecast, that allows
>> conversions
>>>> between address spaces
>>>>> *         Updated the reader/writer to handle these cases
>>>>> *         Update the documentation to insert the new IR
node.
>>>>> *         Add the following documentation:
>>>>> 'addrspacecast .. to' Instruction
>>>>> 
>>>>> Syntax:
>>>>> 
>>>>> <result> = addrspacecast <ty> <value> to
<ty2>             ;
>> yields
>>>> ty2
>>>>> Overview:
>>>>> 
>>>>> The ' addrspacecast ' instruction converts value to
type ty2
>> without
>>>> changing any bits.
>>>> 
>>>> This is mildly imprecise, because the whole point of this
>> instruction is
>>>> that it can change the bit width.
>>> [Villmow, Micah] Doh, cut and paste error, will fix it.
>>>> 
>>>>> 
>>>>> Arguments:
>>>>> 
>>>>> The ' addrspacecast ' instruction takes a value to
cast, which must
>> be
>>>> a non-aggregate first class value with a pointer type, and a
type to
>>>> cast it to, which must also be a pointer type. The pointer
types of
>>>> value and the destination type, ty2, must be identical, except
for
>> the
>>>> address space.
>>>> 
>>>> Having a "pointer type" is sufficient to imply that
it is a "non-
>>>> aggregate first class value".
>>>> 
>>>>> 
>>>>> Semantics:
>>>>> 
>>>>> The ' addrspacecast ' instruction converts value to
type ty2. It
>>>> converts the type to the type that is implied by the address
space
>> of
>>>> the destination pointer. If the destination pointer is smaller
than
>> the
>>>> source pointer, the upper bits are truncated. If the inverse is
>> true,
>>>> the upper bits are sign extended, otherwise the operation is a
no-
>> op.
>>>> 
>>>> Why sign-extended? Ptrtoint/inttoptr are zero-extended, and
it's
>>>> surprising that addrspacecast would be different from them.
>>> [Villmow, Micah] Take for example a pointer representing a negative
>> pointer offset into a 16 bit address space, if this is converted to a
>> 64bit address space, the upper 48 bits would be zero and your negative
>> offset just became positive. The difference between these two
>> instruction types is that addrspacecast does not explicitly convert to
>> any size, only implicitly, so the bits would need to be filled
>> correctly.
>>>> 
>> 
>> I view a pointer as pointing to a location in memory and not as an
>> offset relative to some base register.  I think the proper semantic
>> here is the same as inttoptr where it does a zero-extension.
> [Villmow, Micah] Yeah, but the pointer won't point to the same location
if the conversion from a smaller pointer to a larger pointer is zero extended.
> Take two address spaces(1 and 2) that are 16 and 64 bits in size.
> int(1) *a = 0xFFFFFFF9;
> int(2) *b = *a;
> Is b -10(SExt), or is it 4294967289(ZExt)?
I think you mean if is it -10 (Sext) or 65529 (Zext from 16b to 64b)? 

I would expect the same result if I wrote
  int(1) *a = 0x0FFF9;
  int(2) *b = *a;

In C, integer to point conversions are implementation defined and depends on
what the addressing structure of the execution environment is. Given the current
definition of ptrtoint and intoptr, I feel that the addressing structure feels
like a flat memory model starting from 0 and the value "b" should be
65529.  In your example where we know the largest pointer is 64b, I would expect
the final result to be the same as doing a ptrtoint from int(1) to i64 and
intotptr to int(2)*.

-- Mon Ping
> This works for inttoptr and ptrtoint because there is an assumption that
the pointer is always the same size. Maybe we even need to extend
ptrtoint/inttoptr to handle this case by adding unsigned versions?



> 
>> 
>>  -- Mon Ping
>> 
>> 
>> 
>> 
>>>> Dan
>>>> 
>>> 
>>> 
>>> 
>>> _______________________________________________
>>> LLVM Developers mailing list
>>> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>> 
> 
> 
> 
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Villmow, Micah

2012-Sep-13 14:51 UTC

head link

[LLVMdev] Proposal: New IR instruction for casting between address spaces

> -----Original Message-----
> From: Mon Ping Wang [mailto:monping at apple.com]
> Sent: Thursday, September 13, 2012 1:55 AM
> To: Villmow, Micah
> Cc: llvmdev at cs.uiuc.edu
> Subject: Re: [LLVMdev] Proposal: New IR instruction for casting between
> address spaces
> 
> 
> On Sep 12, 2012, at 2:45 PM, "Villmow, Micah" <Micah.Villmow
at amd.com>
> wrote:
> 
> >
> >
> >> -----Original Message-----
> >> From: Mon P Wang [mailto:monping at apple.com]
> >> Sent: Wednesday, September 12, 2012 1:12 PM
> >> To: Villmow, Micah
> >> Cc: Dan Gohman; llvmdev at cs.uiuc.edu
> >> Subject: Re: [LLVMdev] Proposal: New IR instruction for casting
> >> between address spaces
> >>
> >> Hi,
> >>
> >> On Sep 11, 2012, at 2:30 PM, Villmow, Micah wrote:
> >>
> >>>
> >>>
> >>>> -----Original Message-----
> >>>> From: Dan Gohman [mailto:gohman at apple.com]
> >>>> Sent: Tuesday, September 11, 2012 1:28 PM
> >>>> To: Villmow, Micah
> >>>> Cc: llvmdev at cs.uiuc.edu
> >>>> Subject: Re: [LLVMdev] Proposal: New IR instruction for
casting
> >> between
> >>>> address spaces
> >>>>
> >>>> On Sep 11, 2012, at 1:03 PM, "Villmow, Micah"
> >> <Micah.Villmow at amd.com>
> >>>> wrote:
> >>>>
> >>>>>
> >>>>> From: Villmow, Micah
> >>>>> Sent: Tuesday, September 11, 2012 12:51 PM
> >>>>> To: llvm-commits at cs.uiuc.edu
> >>>>> Subject: Proposal: New IR instruction for casting
between address
> >>>>> spaces
> >>>>>
> >>>>> Problem:
> >>>>> Bit casting between pointers of different address
spaces only
> >>>>> works
> >> if
> >>>> all address space pointers are the same size. With changes
from
> >> email
> >>>> chain [1][2], support for different pointer sizes breaks
the
> >>>> bitcast instruction since there is no guarantee that the
pointer
> >>>> size for
> >> the
> >>>> address space is on the source and destination arguments
are of the
> >> same
> >>>> size.
> >>>>
> >>>> Can you comment on whether the need for this seems like a
> >> fundamental
> >>>> need, in your field, or more of a limitation of the
current
> >> generation
> >>>> of architectures?
> >>> [Villmow, Micah] I think this is a little of both. While the
current
> >> and previous generation of GPU architectures are limited in what
they
> >> are capable of doing based on hardware restrictions, I also think
> >> there is a fundamental need. Not all devices will run with 64bit
> >> operations at full speed(or even have native instructions in any
> >> case), but memory sizes will soon eclipse what is addressable with
> >> 32bit pointers on non- PC systems. What this is causing is 32bit
> >> systems requiring addressing into 64bit memory and switching over
to
> >> 64bit for address calculations destroys the performance advantage
> that the segmented memory provides.
> >>> In the CPU world, this isn't that much of a problem that I
can tell
> >> as they have already been solved(32bit vs 64bit math is 1-1 in
most
> >> cases), but in non-CPU architectures, this is a huge performance
> >> penalty(64bit mul runs 6x slower than 32bit mul). So being able to
> >> switch to 32bit in the cases where it is required and switch to
64bit
> >> where it is required is a fundamental need that I don't think
will go
> >> away even if the architectures improve their memory
infrastructure.
> >>>>
> >>>>> Solution:
> >>>>> Remove the ability of bitcast to cast between pointers
of
> >>>>> different
> >>>> address spaces and replace with an instruction that
handles this
> >> case
> >>>> explicitely.
> >>>>>
> >>>>> Proposed changes:
> >>>>> *         Add restriction to the verifier on the
bitcast
> >> instruction
> >>>> making bitcasting between address spaces illegal.
> >>>>> *         Change documentation[3] to state the bitcast
to pointers
> >> of
> >>>> different address spaces is illegal.
> >>>>> *         Add in a new IR node, addrspacecast, that
allows
> >> conversions
> >>>> between address spaces
> >>>>> *         Updated the reader/writer to handle these
cases
> >>>>> *         Update the documentation to insert the new
IR node.
> >>>>> *         Add the following documentation:
> >>>>> 'addrspacecast .. to' Instruction
> >>>>>
> >>>>> Syntax:
> >>>>>
> >>>>> <result> = addrspacecast <ty>
<value> to <ty2>             ;
> >> yields
> >>>> ty2
> >>>>> Overview:
> >>>>>
> >>>>> The ' addrspacecast ' instruction converts
value to type ty2
> >> without
> >>>> changing any bits.
> >>>>
> >>>> This is mildly imprecise, because the whole point of this
> >> instruction is
> >>>> that it can change the bit width.
> >>> [Villmow, Micah] Doh, cut and paste error, will fix it.
> >>>>
> >>>>>
> >>>>> Arguments:
> >>>>>
> >>>>> The ' addrspacecast ' instruction takes a
value to cast, which
> >>>>> must
> >> be
> >>>> a non-aggregate first class value with a pointer type, and
a type
> >>>> to cast it to, which must also be a pointer type. The
pointer types
> >>>> of value and the destination type, ty2, must be identical,
except
> >>>> for
> >> the
> >>>> address space.
> >>>>
> >>>> Having a "pointer type" is sufficient to imply
that it is a "non-
> >>>> aggregate first class value".
> >>>>
> >>>>>
> >>>>> Semantics:
> >>>>>
> >>>>> The ' addrspacecast ' instruction converts
value to type ty2. It
> >>>> converts the type to the type that is implied by the
address space
> >> of
> >>>> the destination pointer. If the destination pointer is
smaller than
> >> the
> >>>> source pointer, the upper bits are truncated. If the
inverse is
> >> true,
> >>>> the upper bits are sign extended, otherwise the operation
is a no-
> >> op.
> >>>>
> >>>> Why sign-extended? Ptrtoint/inttoptr are zero-extended,
and it's
> >>>> surprising that addrspacecast would be different from
them.
> >>> [Villmow, Micah] Take for example a pointer representing a
negative
> >> pointer offset into a 16 bit address space, if this is converted
to a
> >> 64bit address space, the upper 48 bits would be zero and your
> >> negative offset just became positive. The difference between these
> >> two instruction types is that addrspacecast does not explicitly
> >> convert to any size, only implicitly, so the bits would need to be
> >> filled correctly.
> >>>>
> >>
> >> I view a pointer as pointing to a location in memory and not as an
> >> offset relative to some base register.  I think the proper
semantic
> >> here is the same as inttoptr where it does a zero-extension.
> > [Villmow, Micah] Yeah, but the pointer won't point to the same
> location if the conversion from a smaller pointer to a larger pointer is
> zero extended.
> > Take two address spaces(1 and 2) that are 16 and 64 bits in size.
> > int(1) *a = 0xFFFFFFF9;
> > int(2) *b = *a;
> > Is b -10(SExt), or is it 4294967289(ZExt)?
> 
> I think you mean if is it -10 (Sext) or 65529 (Zext from 16b to 64b)?
> 
> I would expect the same result if I wrote
>   int(1) *a = 0x0FFF9;
>   int(2) *b = *a;
> 
> In C, integer to point conversions are implementation defined and
> depends on what the addressing structure of the execution environment
> is. Given the current definition of ptrtoint and intoptr, I feel that
> the addressing structure feels like a flat memory model starting from 0
> and the value "b" should be 65529.  In your example where we know
the
> largest pointer is 64b, I would expect the final result to be the same
> as doing a ptrtoint from int(1) to i64 and intotptr to int(2)*.[Villmow, Micah] So then if there is already a way to do this, what really is
the benefit of adding a new instruction?
Also there is a typo in my example, the second assignment should not have the
'*'. I can add a new instruction if that
is the recommended behavior, but I think it would also be fine to force ptrtoint
and inttoptr, although it does take one instruction more.
> 
> -- Mon Ping
> 
> > This works for inttoptr and ptrtoint because there is an assumption
> that the pointer is always the same size. Maybe we even need to extend
> ptrtoint/inttoptr to handle this case by adding unsigned versions?
> 
> 
> 
> 
> 
> >
> >>
> >>  -- Mon Ping
> >>
> >>
> >>
> >>
> >>>> Dan
> >>>>
> >>>
> >>>
> >>>
> >>> _______________________________________________
> >>> LLVM Developers mailing list
> >>> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> >>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
> >>
> >
> >
> >
> > _______________________________________________
> > LLVM Developers mailing list
> > LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> > http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

Apparently Analagous Threads

Search for more apparently analagous threads

llvm dev - Sep 2012 - [LLVMdev] Proposal: New IR instruction for casting between address spaces

[LLVMdev] Proposal: New IR instruction for casting between address spaces

[LLVMdev] Proposal: New IR instruction for casting between address spaces

[LLVMdev] Proposal: New IR instruction for casting between address spaces

Apparently Analagous Threads