Chuck Zhao
2011-Jan-21 20:50 UTC
[LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
I need to figure out all LLVM Instructions that may write to memory. In http://llvm.org/docs/tutorial/OCamlLangImpl7.html, it mentions that "In LLVM, all memory accesses are explicit with load/store instructions, and it is carefully designed not to have (or need) an "address-of" operator." I take this as "StoreInst is the only one that writes to memory". However, this doesn't seem to be enough. Consider: ... int a, b, d; d = a + b; ... The above code is turned into LLVM IR: %0 = load i32* @a, align 4 %1 = load i32* @b, align 4 %2 = add nsw i32 %1, %0 store i32 %2, i32* @d, align 4 Is it possible that temps such as %0, %1 and/or %2 will NOT being register allocated later in the compilation stage, and thus left in memory? The above code, when converted back to C level, looks like this: ... unsigned int llvm_cbe_tmp__6; unsigned int llvm_cbe_tmp__7; unsigned int llvm_cbe_tmp__8; unsigned int llvm_cbe_tmp__9; llvm_cbe_tmp__6 = *(&a); llvm_cbe_tmp__7 = *(&b); llvm_cbe_tmp__8 = ((unsigned int )(((unsigned int )llvm_cbe_tmp__7) + ((unsigned int )llvm_cbe_tmp__6))); *(&d) = llvm_cbe_tmp__8; llvm_cbe_tmp__9 = /*tail*/ printf(((&_OC_str.array[((signed int )0u)])), llvm_cbe_tmp__8); ... It seems the compiler-generated temps are _actually_ left on stack, and writes to them are actually writes to stack memory (via load, add, ...). I am confused here. Could somebody help to clarify it? Thank you Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20110121/a53772e3/attachment.html>
John Criswell
2011-Jan-21 22:33 UTC
[LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
On 1/21/11 2:50 PM, Chuck Zhao wrote:> I need to figure out all LLVM Instructions that may write to memory. > > In http://llvm.org/docs/tutorial/OCamlLangImpl7.html, it mentions that > "In LLVM, all memory accesses are explicit with load/store > instructions, and it is carefully designed not to have (or need) an > "address-of" operator." > > I take this as "StoreInst is the only one that writes to memory".There are intrinsic functions which write to memory also, such as memcpy.> > However, this doesn't seem to be enough.Your observation is correct. Strictly speaking, any instruction can write to memory after code generation because it may access a stack spill slot or a function parameter which the ABI places on the stack. When the Language Reference Manual talks about writing to memory, it is talking about writing to memory that is visible at the LLVM IR level. The stack frame is invisible at the LLVM IR level. Put another way, "memory" is a set of memory locations which can be explicitly accessed by LLVM load and store instructions and are not in SSA form; it is not all of the memory within the computer. If you're interested in finding instructions that write to RAM (including writes to stack spill slots), it may be better to work on Machine Instructions within the code generator framework. -- John T.> > Consider: > ... > int a, b, d; > d = a + b; > ... > > The above code is turned into LLVM IR: > %0 = load i32* @a, align 4 > %1 = load i32* @b, align 4 > %2 = add nsw i32 %1, %0 > store i32 %2, i32* @d, align 4 > > Is it possible that temps such as %0, %1 and/or %2 will NOT being register allocated later in the compilation stage, and thus left in memory? > > The above code, when converted back to C level, looks like this: > ... > unsigned int llvm_cbe_tmp__6; > unsigned int llvm_cbe_tmp__7; > unsigned int llvm_cbe_tmp__8; > unsigned int llvm_cbe_tmp__9; > > llvm_cbe_tmp__6 = *(&a); > llvm_cbe_tmp__7 = *(&b); > llvm_cbe_tmp__8 = ((unsigned int )(((unsigned int )llvm_cbe_tmp__7) + ((unsigned int )llvm_cbe_tmp__6))); > *(&d) = llvm_cbe_tmp__8; > llvm_cbe_tmp__9 = /*tail*/ printf(((&_OC_str.array[((signed int )0u)])), llvm_cbe_tmp__8); > ... > > It seems the compiler-generated temps are _actually_ left on stack, and writes to them are actually writes to stack memory (via load, add, ...). > > > > I am confused here. > Could somebody help to clarify it? > > Thank you > > Chuck > > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20110121/e5a2fa13/attachment.html>
Chuck Zhao
2011-Jan-21 23:00 UTC
[LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
John, Thanks for the reply. I agree with your comments that the "Memory" LLVM Spec refers to doesn't include stack. Let me leverage a bit further: If I need to work on high-level IRs (not machine dependent, not in the code-gen stage), is it reasonable to assume that ALL LLVM IRs that have a result field will have potential to write stack? E.g. <result> = add<ty> <op1>,<op2> /; yields {ty}:result/ br i1<cond>, label<iftrue>, label<iffalse> br label<dest> /; Unconditional branch/ ADD can (potential) write stack to store its result, while BR will NEVER write stack because its doesn't have a result. Thank you Chuck On 1/21/2011 5:33 PM, John Criswell wrote:> On 1/21/11 2:50 PM, Chuck Zhao wrote: >> I need to figure out all LLVM Instructions that may write to memory. >> >> In http://llvm.org/docs/tutorial/OCamlLangImpl7.html, it mentions that >> "In LLVM, all memory accesses are explicit with load/store >> instructions, and it is carefully designed not to have (or need) an >> "address-of" operator." >> >> I take this as "StoreInst is the only one that writes to memory". > > There are intrinsic functions which write to memory also, such as memcpy. >> >> However, this doesn't seem to be enough. > > Your observation is correct. Strictly speaking, any instruction can > write to memory after code generation because it may access a stack > spill slot or a function parameter which the ABI places on the stack. > > When the Language Reference Manual talks about writing to memory, it > is talking about writing to memory that is visible at the LLVM IR > level. The stack frame is invisible at the LLVM IR level. Put > another way, "memory" is a set of memory locations which can be > explicitly accessed by LLVM load and store instructions and are not in > SSA form; it is not all of the memory within the computer. > > If you're interested in finding instructions that write to RAM > (including writes to stack spill slots), it may be better to work on > Machine Instructions within the code generator framework. > > -- John T. > > >> >> Consider: >> ... >> int a, b, d; >> d = a + b; >> ... >> >> The above code is turned into LLVM IR: >> %0 = load i32* @a, align 4 >> %1 = load i32* @b, align 4 >> %2 = add nsw i32 %1, %0 >> store i32 %2, i32* @d, align 4 >> >> Is it possible that temps such as %0, %1 and/or %2 will NOT being register allocated later in the compilation stage, and thus left in memory? >> >> The above code, when converted back to C level, looks like this: >> ... >> unsigned int llvm_cbe_tmp__6; >> unsigned int llvm_cbe_tmp__7; >> unsigned int llvm_cbe_tmp__8; >> unsigned int llvm_cbe_tmp__9; >> >> llvm_cbe_tmp__6 = *(&a); >> llvm_cbe_tmp__7 = *(&b); >> llvm_cbe_tmp__8 = ((unsigned int )(((unsigned int )llvm_cbe_tmp__7) + ((unsigned int )llvm_cbe_tmp__6))); >> *(&d) = llvm_cbe_tmp__8; >> llvm_cbe_tmp__9 = /*tail*/ printf(((&_OC_str.array[((signed int )0u)])), llvm_cbe_tmp__8); >> ... >> >> It seems the compiler-generated temps are _actually_ left on stack, and writes to them are actually writes to stack memory (via load, add, ...). >> >> >> >> I am confused here. >> Could somebody help to clarify it? >> >> Thank you >> >> Chuck >> >> >> >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20110121/9276a7a2/attachment.html>
Duncan Sands
2011-Jan-22 11:17 UTC
[LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
Hi Chuck,> I need to figure out all LLVM Instructions that may write to memory.I->mayWriteToMemory() Ciao, Duncan.
Chuck Zhao
2011-Jan-22 14:07 UTC
[LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
Duncan, I looked at this function even before starting the discussions. I think it respects LLVM's principle that only Loads and Stores (+VAArg, and maybe Call) that can access global memory, but doesn't address the issues of accessing stack frames or stack objects. Thank you for the hint. Chuck On 1/22/2011 6:17 AM, Duncan Sands wrote:> Hi Chuck, > >> I need to figure out all LLVM Instructions that may write to memory. > I->mayWriteToMemory() > > Ciao, Duncan. > _______________________________________________ > LLVM Developers mailing list > LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu > http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
Apparently Analagous Threads
- [LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
- [LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
- [LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
- [LLVMdev] all LLVM Instructions that may write to memory -- other than StoreInst?
- [LLVMdev] how to handle inline asm code when converting c++ to C