thr3ads.net - llvm dev - [llvm-dev] What can the optimizer assume about the memory a global function pointer points to? [Apr 2020]

If this information is useful, please help other people find it:
Share via:

Manuel Jacob via llvm-dev

2020-Apr-16 00:45 UTC

[llvm-dev] What can the optimizer assume about the memory a global function pointer points to?

A function declaration declares a function pointer to the memory where 
the machine code will be at runtime. Besides providing the ability to 
call the function, that pointer can also be used, after bitcasting it, 
to modify the machine code implementing the function. What does the 
optimizer assume about the memory containing the machine code?

The following is an example where Alive2 assumes that transforming @src 
to @tgt is correct (note that @f is marked readnone):


declare i32 @f() readnone
declare void @modify_f()

define i32 @src() {
   call void @modify_f()
   %r = call i32 @f()
   ret i32 %r
}

define i32 @tgt() {
   %r = call i32 @f()
   call void @modify_f()
   ret i32 %r
}


Is this actually a correct transformation?

If yes, what is the exact rule?

Several possibilities come to my mind:

* The memory at @f is assumed to be constant. If this is the case, how 
can it be communicated to the optimizer that the memory is modified?
* In the following part of the definition of the "readnone" attribute,
"memory" includes the machine code of the callee: "On a function,
this
attribute indicates that the function computes its result (or decides to 
unwind an exception) based strictly on its arguments, without 
dereferencing any pointer arguments or otherwise accessing any mutable 
state (e.g. memory, control registers, etc) visible to caller 
functions.". However, then the following part would be inconsistent (if 
executing machine code is considered reading): "If a readnone function 
reads or writes memory visible to the program, or has other 
side-effects, the behavior is undefined.".

Hiroshi Yamauchi via llvm-dev

2020-Apr-16 16:26 UTC

head link

[llvm-dev] What can the optimizer assume about the memory a global function pointer points to?

I am not sure if the IR spec that explicitly talks about this, but I'm
under the impression that the code memory is assumed to be constant or
abstracted out at the IR level and the IR optimizer does not need to think
about the code getting modified or just treats it as undefined behavior.

On Wed, Apr 15, 2020 at 5:46 PM Manuel Jacob via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> A function declaration declares a function pointer to the memory where
> the machine code will be at runtime. Besides providing the ability to
> call the function, that pointer can also be used, after bitcasting it,
> to modify the machine code implementing the function. What does the
> optimizer assume about the memory containing the machine code?
>
> The following is an example where Alive2 assumes that transforming @src
> to @tgt is correct (note that @f is marked readnone):
>
>
> declare i32 @f() readnone
> declare void @modify_f()
>
> define i32 @src() {
>    call void @modify_f()
>    %r = call i32 @f()
>    ret i32 %r
> }
>
> define i32 @tgt() {
>    %r = call i32 @f()
>    call void @modify_f()
>    ret i32 %r
> }
>
>
> Is this actually a correct transformation?
>
> If yes, what is the exact rule?
>
> Several possibilities come to my mind:
>
> * The memory at @f is assumed to be constant. If this is the case, how
> can it be communicated to the optimizer that the memory is modified?
> * In the following part of the definition of the "readnone"
attribute,
> "memory" includes the machine code of the callee: "On a
function, this
> attribute indicates that the function computes its result (or decides to
> unwind an exception) based strictly on its arguments, without
> dereferencing any pointer arguments or otherwise accessing any mutable
> state (e.g. memory, control registers, etc) visible to caller
> functions.". However, then the following part would be inconsistent
(if
> executing machine code is considered reading): "If a readnone function
> reads or writes memory visible to the program, or has other
> side-effects, the behavior is undefined.".
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200416/b6bc1abc/attachment.html>

Manuel Jacob via llvm-dev

2020-Apr-16 18:46 UTC

head link

[llvm-dev] What can the optimizer assume about the memory a global function pointer points to?

On 2020-04-16 18:26, Hiroshi Yamauchi wrote:> I am not sure if the IR spec that explicitly talks about this, but I'm
> under the impression that the code memory is assumed to be constant or
> abstracted out at the IR level and the IR optimizer does not need to 
> think
> about the code getting modified or just treats it as undefined 
> behavior.
One case where the Language Reference explicitly mentions the 
possibility of modifying machine code is Prologue Data: 
http://llvm.org/docs/LangRef.html#prologue-data

So at least the prologue part is seemingly not considered constant. The 
question of what ordering restrictions are placed between calls and code 
potentially modifying the callee machine code is still unclear, though.

llvm dev - Apr 2020 - What can the optimizer assume about the memory a global function pointer points to?

[llvm-dev] What can the optimizer assume about the memory a global function pointer points to?

[llvm-dev] What can the optimizer assume about the memory a global function pointer points to?

[llvm-dev] What can the optimizer assume about the memory a global function pointer points to?