Eli, I understand the need to widen unsigned types. However, I ran into a problem with the current GEP/subscript that clang has. AVX2 gather instructions rely on a 64-bit base pointer and a vector of 32-bit indices. Usually, when vectorizing programs, it is possible to detect that the GEP base pointer is uniform and that the index is variant (and needs to be vectorized). This works really nice for 32-bit programs, except that on 64-bit systems clang drops the information that the index is a 32-bit value. If I had this information then I would have been able to use the AVX2 instruction in many cases. Do you see a way to overcome this problem ? Since InstCombiner has this optimization, can we talk about dropping this optimization in clang ? Thanks, Nadav -----Original Message----- From: Eli Friedman [mailto:eli.friedman at gmail.com] Sent: Saturday, December 10, 2011 23:48 To: Rotem, Nadav Cc: cfe-dev at cs.uiuc.edu Developers Subject: Re: [cfe-dev] GEP index type On Sat, Dec 10, 2011 at 9:20 AM, Rotem, Nadav <nadav.rotem at intel.com> wrote:> Hi, > > I have a question regarding the LLVM-IR which is generated by clang. When I compile the code below on a 64-bit system, clang generates a GEP instruction with a 64 bit index. > > int demo(int *A, short i) { return A[i]; } > > becomes: > > %4 = sext i16 %3 to i64 > %6 = getelementptr inbounds i32* %5, i64 %4 > > I would expect to see an i16 index, since the index is known to be within the 0..16 bit range. Why does clang widen the index to a 64bit value ?Primarily because this exposes the sign extension to the optimizer, which is generally good; iIRC, instcombine does the same thing. Also, in some cases we have to do the extension explicitly to preserve the IR semantics: for example, suppose "i" was declared as an unsigned short. -Eli --------------------------------------------------------------------- Intel Israel (74) Limited This e-mail and any attachments may contain confidential material for the sole use of the intended recipient(s). Any review or distribution by others is strictly prohibited. If you are not the intended recipient, please contact the sender and delete all copies.
Hi Nadav,> I understand the need to widen unsigned types. However, I ran into a problem with the current GEP/subscript that clang has. > > AVX2 gather instructions rely on a 64-bit base pointer and a vector of 32-bit indices. Usually, when vectorizing programs, it is possible to detect that the GEP base pointer is uniform and that the index is variant (and needs to be vectorized). This works really nice for 32-bit programs, except that on 64-bit systems clang drops the information that the index is a 32-bit value. If I had this information then I would have been able to use the AVX2 instruction in many cases. > > Do you see a way to overcome this problem ? Since InstCombiner has this optimization, can we talk about dropping this optimization in clang ?I think this is an LLVM optimization: GEP indices are extended to an index type with the same size as a pointer. Ciao, Duncan.> Nadav > > > > > -----Original Message----- > From: Eli Friedman [mailto:eli.friedman at gmail.com] > Sent: Saturday, December 10, 2011 23:48 > To: Rotem, Nadav > Cc: cfe-dev at cs.uiuc.edu Developers > Subject: Re: [cfe-dev] GEP index type > > On Sat, Dec 10, 2011 at 9:20 AM, Rotem, Nadav<nadav.rotem at intel.com> wrote: >> Hi, >> >> I have a question regarding the LLVM-IR which is generated by clang. When I compile the code below on a 64-bit system, clang generates a GEP instruction with a 64 bit index. >> >> int demo(int *A, short i) { return A[i]; } >> >> becomes: >> >> %4 = sext i16 %3 to i64 >> %6 = getelementptr inbounds i32* %5, i64 %4 >> >> I would expect to see an i16 index, since the index is known to be within the 0..16 bit range. Why does clang widen the index to a 64bit value ? > > Primarily because this exposes the sign extension to the optimizer, > which is generally good; iIRC, instcombine does the same thing. > > Also, in some cases we have to do the extension explicitly to preserve > the IR semantics: for example, suppose "i" was declared as an unsigned > short. > > -Eli > --------------------------------------------------------------------- > Intel Israel (74) Limited > > This e-mail and any attachments may contain confidential material for > the sole use of the intended recipient(s). Any review or distribution > by others is strictly prohibited. If you are not the intended > recipient, please contact the sender and delete all copies. > > > _______________________________________________ > LLVM Developers mailing list > LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu > http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
Duncan, Eli said that Clang also has this optimization. Also, try to compile this code using clang w/o optimizations: int demo(int *A, short i) { return A[i]; } The generated GEP will have i64 indices. Thanks, Nadav -----Original Message----- From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] On Behalf Of Duncan Sands Sent: Sunday, December 11, 2011 10:39 To: llvmdev at cs.uiuc.edu Subject: Re: [LLVMdev] [cfe-dev] GEP index type Hi Nadav,> I understand the need to widen unsigned types. However, I ran into a problem with the current GEP/subscript that clang has. > > AVX2 gather instructions rely on a 64-bit base pointer and a vector of 32-bit indices. Usually, when vectorizing programs, it is possible to detect that the GEP base pointer is uniform and that the index is variant (and needs to be vectorized). This works really nice for 32-bit programs, except that on 64-bit systems clang drops the information that the index is a 32-bit value. If I had this information then I would have been able to use the AVX2 instruction in many cases. > > Do you see a way to overcome this problem ? Since InstCombiner has this optimization, can we talk about dropping this optimization in clang ?I think this is an LLVM optimization: GEP indices are extended to an index type with the same size as a pointer. Ciao, Duncan.> Nadav > > > > > -----Original Message----- > From: Eli Friedman [mailto:eli.friedman at gmail.com] > Sent: Saturday, December 10, 2011 23:48 > To: Rotem, Nadav > Cc: cfe-dev at cs.uiuc.edu Developers > Subject: Re: [cfe-dev] GEP index type > > On Sat, Dec 10, 2011 at 9:20 AM, Rotem, Nadav<nadav.rotem at intel.com> wrote: >> Hi, >> >> I have a question regarding the LLVM-IR which is generated by clang. When I compile the code below on a 64-bit system, clang generates a GEP instruction with a 64 bit index. >> >> int demo(int *A, short i) { return A[i]; } >> >> becomes: >> >> %4 = sext i16 %3 to i64 >> %6 = getelementptr inbounds i32* %5, i64 %4 >> >> I would expect to see an i16 index, since the index is known to be within the 0..16 bit range. Why does clang widen the index to a 64bit value ? > > Primarily because this exposes the sign extension to the optimizer, > which is generally good; iIRC, instcombine does the same thing. > > Also, in some cases we have to do the extension explicitly to preserve > the IR semantics: for example, suppose "i" was declared as an unsigned > short. > > -Eli > --------------------------------------------------------------------- > Intel Israel (74) Limited > > This e-mail and any attachments may contain confidential material for > the sole use of the intended recipient(s). Any review or distribution > by others is strictly prohibited. If you are not the intended > recipient, please contact the sender and delete all copies. > > > _______________________________________________ > LLVM Developers mailing list > LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu > http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev_______________________________________________ LLVM Developers mailing list LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev --------------------------------------------------------------------- Intel Israel (74) Limited This e-mail and any attachments may contain confidential material for the sole use of the intended recipient(s). Any review or distribution by others is strictly prohibited. If you are not the intended recipient, please contact the sender and delete all copies.
Possibly Parallel Threads
- [LLVMdev] [cfe-dev] GEP index type
- [LLVMdev] [llvm-commits] Vectors of Pointers and Vector-GEP
- [LLVMdev] [llvm-commits] Vectors of Pointers and Vector-GEP
- [LLVMdev] [llvm-commits] Vectors of Pointers and Vector-GEP
- [LLVMdev] [llvm-commits] Vectors of Pointers and Vector-GEP