mohie pokhriyal via llvm-dev
2017-Oct-21 11:51 UTC
[llvm-dev] Finding the entry point function in a LLVM IR
Hi , Given the following LLVM IR : define i32 @foo(i32 %l) #0 { entry: %add = add nsw i32 %l, 3 ret i32 %add } ; Function Attrs: nounwind ssp uwtable define i32 @boo(i32 %k) #0 { entry: %call = call i32 @foo(i32 %k) ret i32 %call } ; Function Attrs: nounwind ssp uwtable define i32 @main() #0 { entry: %add = add nsw i32 1, 2 %add2 = add nsw i32 2, %add ret i32 %add2 } I want to be able to find out that main is the entry point function of the program. main and boo both do not have any predecessors or successors , such that I can make a cfg to figure out who’s calling whom ? Is there a way I can achieve this ? Thanks a ton for the help!! Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20171021/7ef0c592/attachment.html>
David Chisnall via llvm-dev
2017-Oct-23 08:03 UTC
[llvm-dev] Finding the entry point function in a LLVM IR
On 21 Oct 2017, at 12:51, mohie pokhriyal via llvm-dev <llvm-dev at lists.llvm.org> wrote:> > I want to be able to find out that main is the entry point function of the program. > main and boo both do not have any predecessors or successors , such that I can make a cfg to figure out who’s calling whom ? > > Is there a way I can achieve this ?The fact that main is the entry point is not known to LLVM (except in a couple of places that special-case main, such as the internalise pass), because it is an artefact of C/C++, not a generic property. On most *NIX platforms, the real entry point for a program is something like __start or _start, which then call main. In most compilation units, there is no single entry point, because they do not contain the program entry point and so can be entered by any externally visible function. It might help if you explained why you need this. David
mats petersson via llvm-dev
2017-Oct-23 11:12 UTC
[llvm-dev] Finding the entry point function in a LLVM IR
If you want to know which functions are (or may be) called from where, in the entire program, then you will need to do some sort of "LLVM-IR Linking" (there are tools that will do that for you, such as "llvm-link"). Of course, even then, there's possible cases where it's impossible to know whether a function is ACTUALLY called until at runtime - function pointers, including those in vtables, may or may not actually get called, depending on the exact dynamic behaviour of the code. Then there will be functions implemented outside of the LLVM-IR for the program anyway. atexit is a good example of a function that takes a function pointer. Your code will not know what (if anything) atexit does with that function pointer, or if/when that function gets called. Of course, we, as humans, know how atexit works and when the function gets called, but some code will not know that unless you write code to understand it's behaviour - and there are many types of functions that take a pointer to a function, some of which are much more complex than atexit - for example signal handlers or call-backs that gets called on errors. Imagine a function being called only when a memory allocation fails... As David says, it's of course highly dependent on what you are trying to achieve, exactly what approach you should take (or if there is an approach that is meaningful at all). On 23 October 2017 at 09:03, David Chisnall via llvm-dev < llvm-dev at lists.llvm.org> wrote:> On 21 Oct 2017, at 12:51, mohie pokhriyal via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > > > > I want to be able to find out that main is the entry point function of > the program. > > main and boo both do not have any predecessors or successors , such that > I can make a cfg to figure out who’s calling whom ? > > > > Is there a way I can achieve this ? > > The fact that main is the entry point is not known to LLVM (except in a > couple of places that special-case main, such as the internalise pass), > because it is an artefact of C/C++, not a generic property. On most *NIX > platforms, the real entry point for a program is something like __start or > _start, which then call main. In most compilation units, there is no > single entry point, because they do not contain the program entry point and > so can be entered by any externally visible function. > > It might help if you explained why you need this. > > David > > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20171023/f3c02815/attachment.html>