thr3ads.net - similar to: "[LLVMdev] Unnecessary i16 -> i32 type promotion"

Displaying 20 results from an estimated 8000 matches similar to: "[LLVMdev] Unnecessary i16 -> i32 type promotion"

2011 Jun 23

[LLVMdev] type promotion i16 -> i32

Hello, I'm developing a llvm backend. It seems that, if i16 is not a legal type (no register can hold i16 types in RegisterInfo.td and as a RegisterClass in SelLowering.cpp), i16 should be promoted to i32. Nonotheless, this simple program: int main(){ volatile short a; a= 3; return 0; } which is trasformed in this IR: define i32 @main() nounwind readnone { entry: %a = alloca i16,

[LLVMdev] promotion of return value.

2009 Mar 13

[LLVMdev] promotion of return value.

Hi Rafael, if the nasty hack I mentioned in the PR was removed then the following code should work for both X86 and ARM: define signext i16 @f() nounwind { entry: %0 = tail call signext i16 @h() nounwind %1 = sext i16 %0 to i32 tail call void @g(i32 %1) nounwind %2 = load i16* @x, align 2 ret i16 %2 } This is what llvm-gcc currently produces. On x86 the

[LLVMdev] Pointer sizes, GetElementPtr, and offset sizes

2008 May 02

[LLVMdev] Pointer sizes, GetElementPtr, and offset sizes

The LLVA and LLVM papers motivate the GetElementPtr instruction by arguing that it abstracts implementation details, in particular pointer size, from the compiler. While it does this fine for pointer addresses, it does not manage it for address offsets. Consider the following code: $ cat test.c int main() { int *x[2]; int **y = &x[1]; return (y - x); } $ llvm-gcc -O3 -c test.c

[LLVMdev] promotion of return value.

2009 Mar 14

[LLVMdev] promotion of return value.

Hi, > What we are trying to do is to add new attributes (more maybe added > later): > > > sign_ext_from_i8, sign_ext_from_i16 > > zero_ext_from_i8, zero_ext_from_i16 > > to function definition so (assuming that both caller and callee are > generated in the same front-end) the caller will know if the callee has > already extended the return value or not, then

[LLVMdev] Promoting i16 load to i32

2011 Feb 07

[LLVMdev] Promoting i16 load to i32

Hi, I'm working on an LLVM backend for an architecture which does not natively support half-word loads. I'm having trouble getting LLVM to promote i16 to i32 loads for me - should I expect LLVM to be able to do this, are do I have to write a custom lowerer? This post (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019929.html) gave me the impression that it should be possible,

[LLVMdev] BlockAddress is a "User"

2010 Feb 26

[LLVMdev] BlockAddress is a "User"

I've been playing around with the new IndirectBr and BlockAddress types. I'm finding that in CodeGen, during "EliminateMostlyEmptyBlocks", BlockAddresses are not updated to point to the newly merged block if the original block was eliminated. This is causing me problems. Mind you, I'm experimenting with this using the Sparc backend, which could be the source of blame, but

[LLVMdev] MemoryDependenceAnalysis Bug or Feature?

2010 Jul 18

[LLVMdev] MemoryDependenceAnalysis Bug or Feature?

Yes, I'm not arguing that there is a dependence, just that it's not a clobber dependence. The case of a load is already considered earlier in that function and with isLoad == false it returns MemDepResult::getDef(). My question is: why should a read-only call (which yields AliasAnalysis::Ref and is handled in this code fragment) be any different from e.g. a load. Isn't a read-only

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Jul 31

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

Micah, I think that your patch is missing the necessary modifications in lib/VMCore/ValueTypes.cpp to EVT::getEVTString() and EVT::getTypeForEVT. -Hal On Fri, 27 Jul 2012 22:54:24 +0000 "Villmow, Micah" <Micah.Villmow at amd.com> wrote: > Vector promotion which is new in LLVM 3.1 is broken for sub32 bit > types. The problem is in the VectorLegalizer::PromoteVectorOp. The

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Jul 28

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

I think that you attached the wrong patch. The attached patch is the one which adds the new MVT types. From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] On Behalf Of Villmow, Micah Sent: Saturday, July 28, 2012 01:54 To: Developers Mailing List Subject: [LLVMdev] Vector promotion broken for <2 x [i8|i16]> Vector promotion which is new in LLVM 3.1 is broken for

[RFC] jump threading on std::pair<int, bool>

2018 Mar 08

[RFC] jump threading on std::pair<int, bool>

Hi, While comparing the code by LLVM and GCC for some major libraries, I found that LLVM fails to apply jump threading with a method whose return type is std::pair<int, bool> (actually, any pair of 32-bit values like std::pair<bool, int> and std::pair<int, int>). For example, jump threading does not work for the if statement in func. std::pair<int, bool> callee(int v) {

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Jul 30

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

>Though I am having trouble in understanding how x86 supports the <1 x i64> type. Based on looking at the code, it should fail because v1i64 is not supported on the x86 platform as >far as I can tell. The Type-Legalizer can handle vector types in the following ways: 1. Split - this splits vectors into two halves. For example on SSE4, <4 x i64> is split to <2 x i64> 2.

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Aug 01

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

Micah, One more thing ;) -- llvm::getEnumName in utils/TableGen/CodeGenTarget.cpp -Hal On Tue, 31 Jul 2012 21:02:02 +0000 "Villmow, Micah" <Micah.Villmow at amd.com> wrote: > Ahh yep, thanks for catching that, new patch attached. > > > -----Original Message----- > > From: Hal Finkel [mailto:hfinkel at anl.gov] > > Sent: Tuesday, July 31, 2012 1:40 PM

Unfolded additions of constants after promotion of @llvm.ctlz.i16 on SystemZ

2019 Feb 08

Unfolded additions of constants after promotion of @llvm.ctlz.i16 on SystemZ

Hi, SystemZ supports @llvm.ctlz.i64() natively with a single instruction (FLOGR), and lesser bitwidth versions of the intrinsic are promoted to i64. For some reason, this leads to unfolded additions of constants as shown below: This function: define i16 @fun(i16 %arg) { %1 = tail call i16 @llvm.ctlz.i16(i16 %arg, i1 false) ret i16 %1 } ,gives this optimized DAG as input to instruction

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

2019 Nov 14

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

For any of the examples shown below, if the logical equivalent using cmp + other IR instructions is no more than the number of IR instructions as the variant that uses shift, we should consider reversing the canonicalization. To make that happen, you would need to show that at least the minimal cases have codegen that is equal or better using the cmp form for at least a few in-tree targets. My

[LLVMdev] 16bit loads being promoted to 32bit?

2009 Feb 13

[LLVMdev] 16bit loads being promoted to 32bit?

I have the following function: define void @test_fc_0_kernel(i16 signext %x, i16 signext %y, i16 addrspace(11)* %input, i32 addrspace(11)* %result) { entry: %call = tail call i32 @get_id(i32 0) ; <i32> [#uses=2] %cmp = icmp slt i16 %x, %y ; <i1> [#uses=1] br i1 %cmp, label %if.then, label %if.end

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Jul 31

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

Ahh yep, thanks for catching that, new patch attached. > -----Original Message----- > From: Hal Finkel [mailto:hfinkel at anl.gov] > Sent: Tuesday, July 31, 2012 1:40 PM > To: Villmow, Micah > Cc: Developers Mailing List > Subject: Re: [LLVMdev] Vector promotion broken for <2 x [i8|i16]> > > Micah, > > I think that your patch is missing the necessary

[LLVMdev] BlockAddress is a "User"

2010 Feb 26

[LLVMdev] BlockAddress is a "User"

My apologies. This problem was indeed with my changes to the backend. Next time I will more carefully examine the source of the problem. :) On Fri, Feb 26, 2010 at 12:40 PM, Marc de Kruijf <dekruijf at wisc.edu> wrote: > I've been playing around with the new IndirectBr and BlockAddress types. > I'm finding that in CodeGen, during "EliminateMostlyEmptyBlocks", >

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Jul 30

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

No, that is correct. I am adding the new types so that I can bitcast v2i8 into a v1i16 and then perform the 'and' operation and have legalize types turn the v1i16 into a scalar. Though I am having trouble in understanding how x86 supports the <1 x i64> type. Based on looking at the code, it should fail because v1i64 is not supported on the x86 platform as far as I can tell. Micah

Intel AMX programming model discussion.

2020 Aug 15

Intel AMX programming model discussion.

Hi Philip, Your idea make sense to me in my first thought. Thank you for the idea. I will take more time to think it over to see it can help to reduce the complexity of tile register allocation. Yuanke From: Philip Reames <listmail at philipreames.com> Sent: Saturday, August 15, 2020 11:29 AM To: Luo, Yuanke <yuanke.luo at intel.com>; llvm-dev at lists.llvm.org; florian_hahn at

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

2012 Jul 27

[LLVMdev] Vector promotion broken for <2 x [i8|i16]>

Vector promotion which is new in LLVM 3.1 is broken for sub32 bit types. The problem is in the VectorLegalizer::PromoteVectorOp. The function getTypeToPromoteTo will return a <2 x i32> for a <2 x i8>, <2 x i16> or <4 x i8>. The problem is that there are no vectors of size 1 defined for i32 or i16. The attached patch fixes these issues. This can be reproduced by setting in

similar to: [LLVMdev] Unnecessary i16 -> i32 type promotion