thr3ads.net - llvm dev - [LLVMdev] [RFC] Embedding command line options in bitcode (PR21471) [Dec 2014]

If this information is useful, please help other people find it:
Share via:

Akira Hatanaka

2014-Dec-03 00:31 UTC

[LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)

On Tue, Dec 2, 2014 at 3:21 PM, Eric Christopher <echristo at gmail.com>
wrote:
> On Mon Dec 01 2014 at 4:22:15 PM Bob Wilson <bob.wilson at apple.com>
wrote:
>
>> Thanks for your feedback, Eric.
>>
>> I still think we may be talking past each other a little bit, but
rather
>> than
>>
>
> Might be, sorry if so :(
>
>
>> delving further into the details right now, I’ve suggested to Akira
that
>> he look into how we should handle other kinds of options. I’m hoping
that
>> as we look at more of them, we will gain some insight into the approach
>> that we want to take here. This patch really only deals with the easy
>> cases, and whether we use Akira’s approach or not, I’m not especially
>> worried about this kind of option.
>>
>> Let’s come back to this in a little while after we’ve looked at what to
>> do for some other options…..
>>
>>
> I apologize if this seems a bit rambly, but I'm trying to get a lot of
> thoughts on this down:
>
> FWIW I'm looking at some of this at the moment which is why I spoke to
a
> few of the different options and needing to control TargetMachine creation.
> Just to continue with some of the discussion (at least to point some things
> the right direction and get feedback) let's think about the -target-abi
> option for ARM. It sets the ABI for the backend on the target machine. It
> probably shouldn't change on a per function basis (wait, except for the
> calling convention which can be changed via an attribute, fun!), so we want
> it to control global data layout and valid types which means we need it on
> the TargetMachine. The question is how to pass this information into the
> backend for such an option - right now, if we wanted, we could punt to
> using the command line options that currently exist and the existing
> ParseCommandLineOptions call from clang.
>
>When option -target-abi=apcs-gnu for ARM is used for clang,
EmitAssemblyHelper::CreateTargetMachine
passes FeatruresStr="+apcs" to Target::createTargetMachine. Do we have
anything else we have to pass to the backend in this case?

> That said, it's a pretty awful hack to use them. For some of my
particular
> needs I've got some code around unifying the DataLayout creation
between
> the front end and back end, but it's a partial solution for the general
> problem and I'll probably run into more issues in this arena.
>
> So, trying to come up with ways to communicate all of this via the API so
> that we can set ABIs for targets that need it (ARM, MIPS, others), as well
> as probably other options (anything passed via -backend-option is a good
> candidate).
>
> One thought is expanding and/or deriving target specific versions of
> TargetOptions and populating that on creation from the front end. We'd
want
> to be careful with the target specific bits and constructing defaults,
> probably something off of the bits in Targets.cpp would be appropriate. A
> lot of this type of code leads us down the TargetSupport library or
> something of the sort - classes to help describe or create backend
> constructs.
>
> Anyhow, this is my current thinking on how to do API building of
> TargetMachine etc. (Alternately a TargetMachineBuilder? But that sounds
> complicated in all of the same ways without any particular gain).
>
> -eric
>
>
>> On Dec 1, 2014, at 10:53 AM, Eric Christopher <echristo at
gmail.com> wrote:
>>
>> Hi Bob,
>>
>> I've been giving this some thought, comments inline:
>>
>> On Wed Nov 19 2014 at 6:24:18 PM Bob Wilson <bob.wilson at
apple.com> wrote:
>>
>>> On Nov 19, 2014, at 4:52 PM, Eric Christopher <echristo at
gmail.com>
>>> wrote:
>>>
>>>
>>>
>>> On Wed Nov 19 2014 at 4:39:42 PM Akira Hatanaka <ahatanak at
gmail.com>
>>> wrote:
>>>
>>>> On Wed, Nov 19, 2014 at 3:28 PM, Eric Christopher <echristo
at gmail.com>
>>>> wrote:
>>>>
>>>>> So my general concern here is that lots of command line
options that
>>>>> don't need to be or shouldn't be IR level
constructs become oddly named
>>>>> string options. It's bad enough that we're doing
that with the target cpu
>>>>> and target feature string let alone the rest of it.
>>>>>
>>>>> If we're talking about things that end up changing the
subtarget used
>>>>> we definitely need to make this part of the lookup key - at
which point
>>>>> we're going to want a unified interface to all of them.
I haven't applied
>>>>> your patch to see what the IR changes look like here.
>>>>>
>>>>>
>>>> With the patches applied, the attributes clang embeds in the IR
look
>>>> like this:
>>>>
>>>> $ clang -cc1 ... -mllvm -arm-long-calls -mllvm -arm-reserve-r9
>>>> -menable-no-nans
>>>>
>>>> attributes #0 = { nounwind readonly ssp
"NoNaNsFPMath"
>>>> "arm-long-calls"="1"
"arm-reserve-r9"="1"
"less-precise-fpmad"="false"
>>>> "no-frame-pointer-elim"="true"
"no-frame-pointer-elim-non-leaf"
>>>> "no-infs-fp-math"="false"
"stack-protector-buffer-size"="8"
>>>> "target-cpu"="arm7tdmi"
"target-features"="-neon,-crypto,+apcs"
>>>> "unsafe-fp-math"="false"
"use-soft-float"="true" }
>>>>
>>>> I'm guessing you want the front-end to convert command line
options
>>>> like "arm-long-calls" or "arm-reserve-r9"
to something that doesn't look as
>>>> odd?
>>>>
>>>
>>> Right, that was part of the idea, the rest is that we define things
that
>>> actually make sense as front end command line options and use those
to
>>> communicate via IR constructs or api calls I.e. reserved registers
don't
>>> need to be a specific back end option, but rather can be passed
along when
>>> creating a target machine, etc.
>>>
>>> -eric
>>>
>>>
>>> No, reserved registers (I assume you’re referring to
“arm-reserve-r9” in
>>> Akira’s example) don’t necessarily need to be backend options. But,
those
>>> kinds of options do need to be recorded in the IR and the target
machine
>>> needs to reflect the relevant IR settings. Otherwise, we’ll still
be left
>>> with a broken LTO model. E.G., I build one LTO object with
“arm-reserve-r9”
>>> and another without it, but the LTO compilation still needs to
respect
>>> those options for the code in each object. The details of how a
particular
>>> option should be handled will vary depending on the option, and we
need to
>>> discuss each one separately.
>>>
>>>
>> ABI breaking options should probably be considered part of
>> TargetMachine/Subtarget creation and handled accordingly (if it's
subtarget
>> creation it can be on the function as a feature string attribute, if it
>> affects global code generation it needs to be at TargetMachine creation
>> time).
>>
>>
>>> It’s pretty clear, though, that there is a class of backend options
that
>>> translate nicely to function attributes. This RFC is about the
mechanism
>>> that we use to handle those (without specifying which specific
options will
>>> be handled that way).
>>>
>>>
>> Fair enough. For function attributes I can see a desire to have (as
Chris
>> posted in an email from someone at synopsys) a generic target specific
>> dictionary mapping on functions for backend specific uses. I just
haven't
>> seen anything in the thread that would be applicable for this rather
than
>> an actual function attribute. That said, I would like some sort of
>> delineation for target function attributes as well. Thoughts on that?
>> Perhaps it could look like the string dictionary we've mentioned,
maybe not.
>>
>>
>>>
>>>
>>>> From looking at the patch it seems that the command line option
for
>>>>> NaNs should be a general attribute on the function rather
than a
>>>>> TargetMachine option anyhow, so it seems that should be
migrated down.
>>>>> Whether or not it is exposed as an llvm attribute or a
string based key
>>>>> value pair is an interesting discussion. Also, would we
want to, instead,
>>>>> expose the backend options as clang IR level options? Seems
to make sense
>>>>> from that perspective and then they can add the appropriate
attributes to
>>>>> the functions themselves. (-mlongcalls for the long calls
option for
>>>>> example).
>>>>>
>>>>> Thanks!
>>>>>
>>>>> -eric
>>>>>
>>>>
>>> Again, I think we can evaluate specific options on a case-by-case
basis.
>>> For some of them, it might make sense to expose them as clang
options. Like
>>> you, I had previously assumed we would teach the front-end about
all the
>>> backend options. I was a little surprised by the approach Akira
suggested
>>> here, but after giving it some thought, I think it’s a great idea.
There
>>> are a LOT of target-specific backend options. Some of them do not
need to
>>> be IR, but many do. Teaching clang about all of them would be a lot
of
>>> work, and I don’t see any clear benefit from that. This approach
lets clang
>>> remain relatively independent of the backend options. We still need
to
>>> think carefully about which backend options use this mechanism, of
course.
>>>
>>>
>> I'm unconvinced here...
>>
>>
>>> Are there other reasons why you think we should add all those
options to
>>> clang? I just don’t see much to be gained by that.
>>>
>>>
>> .... I think that anything you'd like to expose should probably be
a
>> front end option. If nothing else I don't see how you plan on
annotating
>> the IR. Basically you're constructing a parallel option hierarchy
and that
>> seems... redundant. I agree that it's nice for tools to be able to
override
>> things in the backend - for testing if nothing else. We do this a lot
by
>> checking options and values in the code. A way of streamlining this
would
>> be nice - but I don't think that's the goal of these patches
no?
>>
>>
>>> Do you have objections to the use of string-based key/value pairs?
I
>>> believe the intention all along, going back to Bill’s work on this
a few
>>> years ago, was that we would add new attributes for
target-independent
>>> things and use strings for the target-specific options. That seems
like a
>>> reasonable starting point to me.
>>>
>>>
>> Only some as I mentioned above - I haven't seen anything that needs
to be
>> target specific in this way, but I'm open to the idea :)
>>
>> As a note, this is all for function level things. We need a solution
for
>> non-function/module level attributes and I think that's going to
look like
>> API calls into the backend for TargetMachine creation - there may be a
list
>> of things that would turn into module level attributes, but that is a
>> direction fraught with danger and I'd like to avoid that as much as
>> possible.
>>
>> (As a note, the particular patch that Akira has is unacceptable because
>> of the changes to the subtarget lookup, but that's a separate issue
:)
>>
>> -eric
>>
>>
>>>
>>>>> On Tue Nov 18 2014 at 12:26:49 PM Akira Hatanaka
<ahatanak at gmail.com>
>>>>> wrote:
>>>>>
>>>>>> Updated patch is attached. Note this is just a
work-in-progress patch
>>>>>> and I plan to address the feedback comments later if
this patch is in the
>>>>>> right direction.
>>>>>>
>>>>>> This is how the command line options are parsed and
used by the
>>>>>> backend passes:
>>>>>>
>>>>>> 1. Tools such as clang and llc call
cl::ParseCommandLineOptions. Any
>>>>>> of the options that should be written to the bitcode as
function or module
>>>>>> attributes (these options are declared as CodeGenOpt,
which is a subclass
>>>>>> of cl::opt) are saved to Function or ModuleIntAttrs in
CodeGenOptions.cpp.
>>>>>> 2. setFunctionAttribute is called to move the function
and module
>>>>>> options saved in step 1 to the bitcode. Also, options
stored in
>>>>>> TargetOptions are moved to the bitcode too.
Pre-existing attributes in the
>>>>>> bitcode are overwritten by the options in the command
line.
>>>>>> 3. The backend passes and subtarget classes call
getFunctionAttribute
>>>>>> to read the attributes stored to the bitcode in step 2.
>>>>>> Function::hasFnAttribute can be called directly (for
example, NoNaNsFPMath
>>>>>> in the patch), if the default value is known to be
"false".
>>>>>>
>>>>>> I also made the following changes in the patch:
>>>>>>
>>>>>> 1. In the constructor of MachineFunction, call the
version of
>>>>>> getSubtargetImpl that takes a Function parameter, so
that it gets the
>>>>>> Subtarget specific to the Function being compiled.
>>>>>> 2. Change ARMTargetMachine::SubtargetMap to be a
DenseMap<Function*,
>>>>>> unique_ptr<ARMSubtarget>>. This is just a
temporary change to ease
>>>>>> debugging and should be reverted to a StringMap or
changed to another type
>>>>>> of map later.
>>>>>>
>>>>>>
>>>>>> On Mon, Nov 17, 2014 at 11:40 AM, Eric Christopher
<echristo at gmail.
>>>>>> com> wrote:
>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Mon Nov 17 2014 at 9:56:40 AM Bob Wilson
<bob.wilson at apple.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>>
>>>>>>>> > On Nov 14, 2014, at 2:44 PM, Duncan P. N.
Exon Smith <
>>>>>>>> dexonsmith at apple.com> wrote:
>>>>>>>> >
>>>>>>>> > +chrisb
>>>>>>>> >
>>>>>>>> >> On 2014-Nov-13, at 16:33, Akira
Hatanaka <ahatanak at gmail.com>
>>>>>>>> wrote:
>>>>>>>> >>
>>>>>>>> >> I'm working on fixing PR21471,
which is about embedding codegen
>>>>>>>> command line options into the bitcode as
function or module-level
>>>>>>>> attributes so that they don't get ignored
when doing LTO.
>>>>>>>> >>
>>>>>>>> >>
http://llvm.org/bugs/show_bug.cgi?id=21471
>>>>>>>> >>
>>>>>>>> >> I have an initial patch (attached to
this email) which enables
>>>>>>>> clang/llvm to recognize one command line
option, write it to the IR, and
>>>>>>>> read it out in a backend pass. I'm looking
to get feedback from the
>>>>>>>> community on whether I'm headed in the
right direction or whether there are
>>>>>>>> alternate ideas before I go all the way on
fixing the PR. Specifically, I'd
>>>>>>>> like to know the answers to the following
questions:
>>>>>>>> >>
>>>>>>>> >> 1. How do we make sure we continue to
be able to use the command
>>>>>>>> line options we've been using for llc and
other tools?
>>>>>>>> >> 2. How to handle cases where two
functions in a module have
>>>>>>>> different sets of command line options?
>>>>>>>> >> 3. Where should the command line
options or module/function
>>>>>>>> attributes be stored once they are read out
from the IR?
>>>>>>>> >>
>>>>>>>> >> The short description of the approach
I took in my patch is that
>>>>>>>> command line options that are important to
codegen are collected by
>>>>>>>> cl::ParseCommandLineOptions, written to the
bitcode as function or module
>>>>>>>> attributes, and read out directly by the
optimization passes that need
>>>>>>>> them. cl::opt options are replaced with
CodeGenOpt options (subclass of
>>>>>>>> cl::opt), which are needed only to parse the
command line and provide the
>>>>>>>> default value when the corresponding options
are not in the bitcode.
>>>>>>>> >
>>>>>>>> > I like this approach, since it means the
frontend doesn't have to
>>>>>>>> understand
>>>>>>>> > options in order to pass them on to the
backend.
>>>>>>>>
>>>>>>>> I agree. It’s not the approach that was I was
expecting, but after
>>>>>>>> reading the patches, I like it.
>>>>>>>>
>>>>>>>> If we can get consensus on the approach, I
suggest that we stage
>>>>>>>> the work to first adopt Akira’s patches and
then we can incrementally
>>>>>>>> convert various options over to use it. There
may be complications for some
>>>>>>>> options, so I’d like to see separate patches
for each one (possibly with a
>>>>>>>> few similar options combined in one patch).
>>>>>>>>
>>>>>>>>
>>>>>>> I'm not sold quite yet :)
>>>>>>>
>>>>>>> Akira: Can you show an example of how you expect
this to work in the
>>>>>>> IR and how the backend is going to process?
>>>>>>>
>>>>>>> Thanks.
>>>>>>>
>>>>>>> -eric
>>>>>>>
>>>>>>>
>>>>>>>> >
>>>>>>>> > The static variables should be
straightforward to migrate to an
>>>>>>>> LLVMContext
>>>>>>>> > once ParseCommandLineOptions stores things
there instead of in
>>>>>>>> globals.
>>>>>>>> >
>>>>>>>> >> diff --git
a/lib/CodeGen/CodeGenOption.cpp
>>>>>>>> b/lib/CodeGen/CodeGenOption.cpp
>>>>>>>> >> new file mode 100644
>>>>>>>> >> index 0000000..2d74c2f
>>>>>>>> >> --- /dev/null
>>>>>>>> >> +++ b/lib/CodeGen/CodeGenOption.cpp
>>>>>>>> >> @@ -0,0 +1,59 @@
>>>>>>>> >> +//===- CodeGen/CodeGenOptions.cpp -
Code-gen option.
>>>>>>>>  --*- C++ -*-===//
>>>>>>>> >> +//
>>>>>>>> >> +//                     The LLVM
Compiler Infrastructure
>>>>>>>> >> +//
>>>>>>>> >> +// This file is distributed under the
University of Illinois
>>>>>>>> Open Source
>>>>>>>> >> +// License. See LICENSE.TXT for
details.
>>>>>>>> >> +//
>>>>>>>> >>
+//===------------------------------------------------------
>>>>>>>> ----------------===//
>>>>>>>> >> +//
>>>>>>>> >>
+//===------------------------------------------------------
>>>>>>>> ----------------===//
>>>>>>>> >> +
>>>>>>>> >> +#include
"llvm/CodeGen/CodeGenOption.h"
>>>>>>>> >> +#include
"llvm/IR/Attributes.h"
>>>>>>>> >> +#include "llvm/IR/Module.h"
>>>>>>>> >> +
>>>>>>>> >> +using namespace llvm;
>>>>>>>> >> +
>>>>>>>> >> +static std::map<std::string,
bool> FunctionBoolAttrs;
>>>>>>>> >> +static std::map<std::string,
bool> ModuleBoolAttrs;
>>>>>>>> >> +
>>>>>>>> >
>>>>>>>> > @Chris, should these be using
ManagedStatic?
>>>>>>>> >
>>>>>>>> >
_______________________________________________
>>>>>>>> > LLVM Developers mailing list
>>>>>>>> > LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>> >
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> LLVM Developers mailing list
>>>>>>>> LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> LLVM Developers mailing list
>>>>>>> LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>
>>>>>>
>>>
>>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20141202/f31d1118/attachment.html>

Eric Christopher

2014-Dec-03 00:38 UTC

head link

[LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)

On Tue Dec 02 2014 at 4:31:43 PM Akira Hatanaka <ahatanak at gmail.com>
wrote:
> On Tue, Dec 2, 2014 at 3:21 PM, Eric Christopher <echristo at
gmail.com>
> wrote:
>
>> On Mon Dec 01 2014 at 4:22:15 PM Bob Wilson <bob.wilson at
apple.com> wrote:
>>
>>> Thanks for your feedback, Eric.
>>>
>>> I still think we may be talking past each other a little bit, but
rather
>>> than
>>>
>>
>> Might be, sorry if so :(
>>
>>
>>> delving further into the details right now, I’ve suggested to Akira
that
>>> he look into how we should handle other kinds of options. I’m
hoping that
>>> as we look at more of them, we will gain some insight into the
approach
>>> that we want to take here. This patch really only deals with the
easy
>>> cases, and whether we use Akira’s approach or not, I’m not
especially
>>> worried about this kind of option.
>>>
>>> Let’s come back to this in a little while after we’ve looked at
what to
>>> do for some other options…..
>>>
>>>
>> I apologize if this seems a bit rambly, but I'm trying to get a lot
of
>> thoughts on this down:
>>
>> FWIW I'm looking at some of this at the moment which is why I spoke
to a
>> few of the different options and needing to control TargetMachine
creation.
>> Just to continue with some of the discussion (at least to point some
things
>> the right direction and get feedback) let's think about the
-target-abi
>> option for ARM. It sets the ABI for the backend on the target machine.
It
>> probably shouldn't change on a per function basis (wait, except for
the
>> calling convention which can be changed via an attribute, fun!), so we
want
>> it to control global data layout and valid types which means we need it
on
>> the TargetMachine. The question is how to pass this information into
the
>> backend for such an option - right now, if we wanted, we could punt to
>> using the command line options that currently exist and the existing
>> ParseCommandLineOptions call from clang.
>>
>>
> When option -target-abi=apcs-gnu for ARM is used for clang,
EmitAssemblyHelper::CreateTargetMachine
> passes FeatruresStr="+apcs" to Target::createTargetMachine. Do we
have
> anything else we have to pass to the backend in this case?
>
Not yet, but the option can be a backend option rather than a subtarget
feature - and I'm probably going to do that in the short term as it's no
worse than the current status, see the next line here below :)

-eric

>
>
>> That said, it's a pretty awful hack to use them. For some of my
>> particular needs I've got some code around unifying the DataLayout
creation
>> between the front end and back end, but it's a partial solution for
the
>> general problem and I'll probably run into more issues in this
arena.
>>
>> So, trying to come up with ways to communicate all of this via the API
so
>> that we can set ABIs for targets that need it (ARM, MIPS, others), as
well
>> as probably other options (anything passed via -backend-option is a
good
>> candidate).
>>
>> One thought is expanding and/or deriving target specific versions of
>> TargetOptions and populating that on creation from the front end.
We'd want
>> to be careful with the target specific bits and constructing defaults,
>> probably something off of the bits in Targets.cpp would be appropriate.
A
>> lot of this type of code leads us down the TargetSupport library or
>> something of the sort - classes to help describe or create backend
>> constructs.
>>
>> Anyhow, this is my current thinking on how to do API building of
>> TargetMachine etc. (Alternately a TargetMachineBuilder? But that sounds
>> complicated in all of the same ways without any particular gain).
>>
>> -eric
>>
>>
>>> On Dec 1, 2014, at 10:53 AM, Eric Christopher <echristo at
gmail.com>
>>> wrote:
>>>
>>> Hi Bob,
>>>
>>> I've been giving this some thought, comments inline:
>>>
>>> On Wed Nov 19 2014 at 6:24:18 PM Bob Wilson <bob.wilson at
apple.com>
>>> wrote:
>>>
>>>> On Nov 19, 2014, at 4:52 PM, Eric Christopher <echristo at
gmail.com>
>>>> wrote:
>>>>
>>>>
>>>>
>>>> On Wed Nov 19 2014 at 4:39:42 PM Akira Hatanaka <ahatanak at
gmail.com>
>>>> wrote:
>>>>
>>>>> On Wed, Nov 19, 2014 at 3:28 PM, Eric Christopher
<echristo at gmail.com>
>>>>>  wrote:
>>>>>
>>>>>> So my general concern here is that lots of command line
options that
>>>>>> don't need to be or shouldn't be IR level
constructs become oddly named
>>>>>> string options. It's bad enough that we're
doing that with the target cpu
>>>>>> and target feature string let alone the rest of it.
>>>>>>
>>>>>> If we're talking about things that end up changing
the subtarget used
>>>>>> we definitely need to make this part of the lookup key
- at which point
>>>>>> we're going to want a unified interface to all of
them. I haven't applied
>>>>>> your patch to see what the IR changes look like here.
>>>>>>
>>>>>>
>>>>> With the patches applied, the attributes clang embeds in
the IR look
>>>>> like this:
>>>>>
>>>>> $ clang -cc1 ... -mllvm -arm-long-calls -mllvm
-arm-reserve-r9
>>>>> -menable-no-nans
>>>>>
>>>>> attributes #0 = { nounwind readonly ssp
"NoNaNsFPMath"
>>>>> "arm-long-calls"="1"
"arm-reserve-r9"="1"
"less-precise-fpmad"="false"
>>>>> "no-frame-pointer-elim"="true"
"no-frame-pointer-elim-non-leaf"
>>>>> "no-infs-fp-math"="false"
"stack-protector-buffer-size"="8"
>>>>> "target-cpu"="arm7tdmi"
"target-features"="-neon,-crypto,+apcs"
>>>>> "unsafe-fp-math"="false"
"use-soft-float"="true" }
>>>>>
>>>>> I'm guessing you want the front-end to convert command
line options
>>>>> like "arm-long-calls" or
"arm-reserve-r9" to something that doesn't look as
>>>>> odd?
>>>>>
>>>>
>>>> Right, that was part of the idea, the rest is that we define
things
>>>> that actually make sense as front end command line options and
use those to
>>>> communicate via IR constructs or api calls I.e. reserved
registers don't
>>>> need to be a specific back end option, but rather can be passed
along when
>>>> creating a target machine, etc.
>>>>
>>>> -eric
>>>>
>>>>
>>>> No, reserved registers (I assume you’re referring to
“arm-reserve-r9”
>>>> in Akira’s example) don’t necessarily need to be backend
options. But,
>>>> those kinds of options do need to be recorded in the IR and the
target
>>>> machine needs to reflect the relevant IR settings. Otherwise,
we’ll still
>>>> be left with a broken LTO model. E.G., I build one LTO object
with
>>>> “arm-reserve-r9” and another without it, but the LTO
compilation still
>>>> needs to respect those options for the code in each object. The
details of
>>>> how a particular option should be handled will vary depending
on the
>>>> option, and we need to discuss each one separately.
>>>>
>>>>
>>> ABI breaking options should probably be considered part of
>>> TargetMachine/Subtarget creation and handled accordingly (if
it's subtarget
>>> creation it can be on the function as a feature string attribute,
if it
>>> affects global code generation it needs to be at TargetMachine
creation
>>> time).
>>>
>>>
>>>> It’s pretty clear, though, that there is a class of backend
options
>>>> that translate nicely to function attributes. This RFC is about
the
>>>> mechanism that we use to handle those (without specifying which
specific
>>>> options will be handled that way).
>>>>
>>>>
>>> Fair enough. For function attributes I can see a desire to have (as
>>> Chris posted in an email from someone at synopsys) a generic target
>>> specific dictionary mapping on functions for backend specific uses.
I just
>>> haven't seen anything in the thread that would be applicable
for this
>>> rather than an actual function attribute. That said, I would like
some sort
>>> of delineation for target function attributes as well. Thoughts on
that?
>>> Perhaps it could look like the string dictionary we've
mentioned, maybe not.
>>>
>>>
>>>>
>>>>
>>>>> From looking at the patch it seems that the command line
option for
>>>>>> NaNs should be a general attribute on the function
rather than a
>>>>>> TargetMachine option anyhow, so it seems that should be
migrated down.
>>>>>> Whether or not it is exposed as an llvm attribute or a
string based key
>>>>>> value pair is an interesting discussion. Also, would we
want to, instead,
>>>>>> expose the backend options as clang IR level options?
Seems to make sense
>>>>>> from that perspective and then they can add the
appropriate attributes to
>>>>>> the functions themselves. (-mlongcalls for the long
calls option for
>>>>>> example).
>>>>>>
>>>>>> Thanks!
>>>>>>
>>>>>> -eric
>>>>>>
>>>>>
>>>> Again, I think we can evaluate specific options on a
case-by-case
>>>> basis. For some of them, it might make sense to expose them as
clang
>>>> options. Like you, I had previously assumed we would teach the
front-end
>>>> about all the backend options. I was a little surprised by the
approach
>>>> Akira suggested here, but after giving it some thought, I think
it’s a
>>>> great idea. There are a LOT of target-specific backend options.
Some of
>>>> them do not need to be IR, but many do. Teaching clang about
all of them
>>>> would be a lot of work, and I don’t see any clear benefit from
that. This
>>>> approach lets clang remain relatively independent of the
backend options.
>>>> We still need to think carefully about which backend options
use this
>>>> mechanism, of course.
>>>>
>>>>
>>> I'm unconvinced here...
>>>
>>>
>>>> Are there other reasons why you think we should add all those
options
>>>> to clang? I just don’t see much to be gained by that.
>>>>
>>>>
>>> .... I think that anything you'd like to expose should probably
be a
>>> front end option. If nothing else I don't see how you plan on
annotating
>>> the IR. Basically you're constructing a parallel option
hierarchy and that
>>> seems... redundant. I agree that it's nice for tools to be able
to override
>>> things in the backend - for testing if nothing else. We do this a
lot by
>>> checking options and values in the code. A way of streamlining this
would
>>> be nice - but I don't think that's the goal of these
patches no?
>>>
>>>
>>>> Do you have objections to the use of string-based key/value
pairs? I
>>>> believe the intention all along, going back to Bill’s work on
this a few
>>>> years ago, was that we would add new attributes for
target-independent
>>>> things and use strings for the target-specific options. That
seems like a
>>>> reasonable starting point to me.
>>>>
>>>>
>>> Only some as I mentioned above - I haven't seen anything that
needs to
>>> be target specific in this way, but I'm open to the idea :)
>>>
>>> As a note, this is all for function level things. We need a
solution for
>>> non-function/module level attributes and I think that's going
to look like
>>> API calls into the backend for TargetMachine creation - there may
be a list
>>> of things that would turn into module level attributes, but that is
a
>>> direction fraught with danger and I'd like to avoid that as
much as
>>> possible.
>>>
>>> (As a note, the particular patch that Akira has is unacceptable
because
>>> of the changes to the subtarget lookup, but that's a separate
issue :)
>>>
>>> -eric
>>>
>>>
>>>>
>>>>>> On Tue Nov 18 2014 at 12:26:49 PM Akira Hatanaka
<ahatanak at gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Updated patch is attached. Note this is just a
work-in-progress
>>>>>>> patch and I plan to address the feedback comments
later if this patch is in
>>>>>>> the right direction.
>>>>>>>
>>>>>>> This is how the command line options are parsed and
used by the
>>>>>>> backend passes:
>>>>>>>
>>>>>>> 1. Tools such as clang and llc call
cl::ParseCommandLineOptions. Any
>>>>>>> of the options that should be written to the
bitcode as function or module
>>>>>>> attributes (these options are declared as
CodeGenOpt, which is a subclass
>>>>>>> of cl::opt) are saved to Function or ModuleIntAttrs
in CodeGenOptions.cpp.
>>>>>>> 2. setFunctionAttribute is called to move the
function and module
>>>>>>> options saved in step 1 to the bitcode. Also,
options stored in
>>>>>>> TargetOptions are moved to the bitcode too.
Pre-existing attributes in the
>>>>>>> bitcode are overwritten by the options in the
command line.
>>>>>>> 3. The backend passes and subtarget classes call
>>>>>>> getFunctionAttribute to read the attributes stored
to the bitcode in step
>>>>>>> 2. Function::hasFnAttribute can be called directly
(for example,
>>>>>>> NoNaNsFPMath in the patch), if the default value is
known to be "false".
>>>>>>>
>>>>>>> I also made the following changes in the patch:
>>>>>>>
>>>>>>> 1. In the constructor of MachineFunction, call the
version of
>>>>>>> getSubtargetImpl that takes a Function parameter,
so that it gets the
>>>>>>> Subtarget specific to the Function being compiled.
>>>>>>> 2. Change ARMTargetMachine::SubtargetMap to be a
DenseMap<Function*,
>>>>>>> unique_ptr<ARMSubtarget>>. This is just a
temporary change to ease
>>>>>>> debugging and should be reverted to a StringMap or
changed to another type
>>>>>>> of map later.
>>>>>>>
>>>>>>>
>>>>>>> On Mon, Nov 17, 2014 at 11:40 AM, Eric Christopher
<echristo at gmail.
>>>>>>> com> wrote:
>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Mon Nov 17 2014 at 9:56:40 AM Bob Wilson
<bob.wilson at apple.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>>
>>>>>>>>> > On Nov 14, 2014, at 2:44 PM, Duncan P.
N. Exon Smith <
>>>>>>>>> dexonsmith at apple.com> wrote:
>>>>>>>>> >
>>>>>>>>> > +chrisb
>>>>>>>>> >
>>>>>>>>> >> On 2014-Nov-13, at 16:33, Akira
Hatanaka <ahatanak at gmail.com>
>>>>>>>>> wrote:
>>>>>>>>> >>
>>>>>>>>> >> I'm working on fixing PR21471,
which is about embedding codegen
>>>>>>>>> command line options into the bitcode as
function or module-level
>>>>>>>>> attributes so that they don't get
ignored when doing LTO.
>>>>>>>>> >>
>>>>>>>>> >>
http://llvm.org/bugs/show_bug.cgi?id=21471
>>>>>>>>> >>
>>>>>>>>> >> I have an initial patch (attached
to this email) which enables
>>>>>>>>> clang/llvm to recognize one command line
option, write it to the IR, and
>>>>>>>>> read it out in a backend pass. I'm
looking to get feedback from the
>>>>>>>>> community on whether I'm headed in the
right direction or whether there are
>>>>>>>>> alternate ideas before I go all the way on
fixing the PR. Specifically, I'd
>>>>>>>>> like to know the answers to the following
questions:
>>>>>>>>> >>
>>>>>>>>> >> 1. How do we make sure we continue
to be able to use the
>>>>>>>>> command line options we've been using
for llc and other tools?
>>>>>>>>> >> 2. How to handle cases where two
functions in a module have
>>>>>>>>> different sets of command line options?
>>>>>>>>> >> 3. Where should the command line
options or module/function
>>>>>>>>> attributes be stored once they are read out
from the IR?
>>>>>>>>> >>
>>>>>>>>> >> The short description of the
approach I took in my patch is
>>>>>>>>> that command line options that are
important to codegen are collected by
>>>>>>>>> cl::ParseCommandLineOptions, written to the
bitcode as function or module
>>>>>>>>> attributes, and read out directly by the
optimization passes that need
>>>>>>>>> them. cl::opt options are replaced with
CodeGenOpt options (subclass of
>>>>>>>>> cl::opt), which are needed only to parse
the command line and provide the
>>>>>>>>> default value when the corresponding
options are not in the bitcode.
>>>>>>>>> >
>>>>>>>>> > I like this approach, since it means
the frontend doesn't have
>>>>>>>>> to understand
>>>>>>>>> > options in order to pass them on to
the backend.
>>>>>>>>>
>>>>>>>>> I agree. It’s not the approach that was I
was expecting, but after
>>>>>>>>> reading the patches, I like it.
>>>>>>>>>
>>>>>>>>> If we can get consensus on the approach, I
suggest that we stage
>>>>>>>>> the work to first adopt Akira’s patches and
then we can incrementally
>>>>>>>>> convert various options over to use it.
There may be complications for some
>>>>>>>>> options, so I’d like to see separate
patches for each one (possibly with a
>>>>>>>>> few similar options combined in one patch).
>>>>>>>>>
>>>>>>>>>
>>>>>>>> I'm not sold quite yet :)
>>>>>>>>
>>>>>>>> Akira: Can you show an example of how you
expect this to work in
>>>>>>>> the IR and how the backend is going to process?
>>>>>>>>
>>>>>>>> Thanks.
>>>>>>>>
>>>>>>>> -eric
>>>>>>>>
>>>>>>>>
>>>>>>>>> >
>>>>>>>>> > The static variables should be
straightforward to migrate to an
>>>>>>>>> LLVMContext
>>>>>>>>> > once ParseCommandLineOptions stores
things there instead of in
>>>>>>>>> globals.
>>>>>>>>> >
>>>>>>>>> >> diff --git
a/lib/CodeGen/CodeGenOption.cpp
>>>>>>>>> b/lib/CodeGen/CodeGenOption.cpp
>>>>>>>>> >> new file mode 100644
>>>>>>>>> >> index 0000000..2d74c2f
>>>>>>>>> >> --- /dev/null
>>>>>>>>> >> +++
b/lib/CodeGen/CodeGenOption.cpp
>>>>>>>>> >> @@ -0,0 +1,59 @@
>>>>>>>>> >> +//===- CodeGen/CodeGenOptions.cpp
- Code-gen option.
>>>>>>>>>  --*- C++ -*-===//
>>>>>>>>> >> +//
>>>>>>>>> >> +//                     The LLVM
Compiler Infrastructure
>>>>>>>>> >> +//
>>>>>>>>> >> +// This file is distributed under
the University of Illinois
>>>>>>>>> Open Source
>>>>>>>>> >> +// License. See LICENSE.TXT for
details.
>>>>>>>>> >> +//
>>>>>>>>> >>
+//===------------------------------------------------------
>>>>>>>>> ----------------===//
>>>>>>>>> >> +//
>>>>>>>>> >>
+//===------------------------------------------------------
>>>>>>>>> ----------------===//
>>>>>>>>> >> +
>>>>>>>>> >> +#include
"llvm/CodeGen/CodeGenOption.h"
>>>>>>>>> >> +#include
"llvm/IR/Attributes.h"
>>>>>>>>> >> +#include
"llvm/IR/Module.h"
>>>>>>>>> >> +
>>>>>>>>> >> +using namespace llvm;
>>>>>>>>> >> +
>>>>>>>>> >> +static std::map<std::string,
bool> FunctionBoolAttrs;
>>>>>>>>> >> +static std::map<std::string,
bool> ModuleBoolAttrs;
>>>>>>>>> >> +
>>>>>>>>> >
>>>>>>>>> > @Chris, should these be using
ManagedStatic?
>>>>>>>>> >
>>>>>>>>> >
_______________________________________________
>>>>>>>>> > LLVM Developers mailing list
>>>>>>>>> > LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>> >
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
_______________________________________________
>>>>>>>>> LLVM Developers mailing list
>>>>>>>>> LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>>
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>>
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> LLVM Developers mailing list
>>>>>>>> LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>
>>>>>>>
>>>>
>>>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20141203/d5bf02da/attachment.html>

Akira Hatanaka

2014-Dec-03 19:39 UTC

head link

[LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)

On Tue, Dec 2, 2014 at 4:38 PM, Eric Christopher <echristo at gmail.com>
wrote:
>
>
> On Tue Dec 02 2014 at 4:31:43 PM Akira Hatanaka <ahatanak at
gmail.com>
> wrote:
>
>> On Tue, Dec 2, 2014 at 3:21 PM, Eric Christopher <echristo at
gmail.com>
>> wrote:
>>
>>> On Mon Dec 01 2014 at 4:22:15 PM Bob Wilson <bob.wilson at
apple.com>
>>> wrote:
>>>
>>>> Thanks for your feedback, Eric.
>>>>
>>>> I still think we may be talking past each other a little bit,
but
>>>> rather than
>>>>
>>>
>>> Might be, sorry if so :(
>>>
>>>
>>>> delving further into the details right now, I’ve suggested to
Akira
>>>> that he look into how we should handle other kinds of options.
I’m hoping
>>>> that as we look at more of them, we will gain some insight into
the
>>>> approach that we want to take here. This patch really only
deals with the
>>>> easy cases, and whether we use Akira’s approach or not, I’m not
especially
>>>> worried about this kind of option.
>>>>
>>>> Let’s come back to this in a little while after we’ve looked at
what to
>>>> do for some other options…..
>>>>
>>>>
>>> I apologize if this seems a bit rambly, but I'm trying to get a
lot of
>>> thoughts on this down:
>>>
>>> FWIW I'm looking at some of this at the moment which is why I
spoke to a
>>> few of the different options and needing to control TargetMachine
creation.
>>> Just to continue with some of the discussion (at least to point
some things
>>> the right direction and get feedback) let's think about the
-target-abi
>>> option for ARM. It sets the ABI for the backend on the target
machine. It
>>> probably shouldn't change on a per function basis (wait, except
for the
>>> calling convention which can be changed via an attribute, fun!), so
we want
>>> it to control global data layout and valid types which means we
need it on
>>> the TargetMachine. The question is how to pass this information
into the
>>> backend for such an option - right now, if we wanted, we could punt
to
>>> using the command line options that currently exist and the
existing
>>> ParseCommandLineOptions call from clang.
>>>
>>>
>> When option -target-abi=apcs-gnu for ARM is used for clang,
EmitAssemblyHelper::CreateTargetMachine
>> passes FeatruresStr="+apcs" to Target::createTargetMachine.
Do we have
>> anything else we have to pass to the backend in this case?
>>
>
> Not yet, but the option can be a backend option rather than a subtarget
> feature - and I'm probably going to do that in the short term as
it's no
> worse than the current status, see the next line here below :)
>
> -eric
>
>
>>
>>
>>> That said, it's a pretty awful hack to use them. For some of my
>>> particular needs I've got some code around unifying the
DataLayout creation
>>> between the front end and back end, but it's a partial solution
for the
>>> general problem and I'll probably run into more issues in this
arena.
>>>
>>> So, trying to come up with ways to communicate all of this via the
API
>>> so that we can set ABIs for targets that need it (ARM, MIPS,
others), as
>>> well as probably other options (anything passed via -backend-option
is a
>>> good candidate).
>>>
>>> One thought is expanding and/or deriving target specific versions
of
>>> TargetOptions and populating that on creation from the front end.
We'd want
>>> to be careful with the target specific bits and constructing
defaults,
>>> probably something off of the bits in Targets.cpp would be
appropriate. A
>>> lot of this type of code leads us down the TargetSupport library or
>>> something of the sort - classes to help describe or create backend
>>> constructs.
>>>
>>> Anyhow, this is my current thinking on how to do API building of
>>> TargetMachine etc. (Alternately a TargetMachineBuilder? But that
sounds
>>> complicated in all of the same ways without any particular gain).
>>>
>>>
If we wanted to, in EmitAssemblyHelper::CreateTargetMachine, we could pass
module-level CodeGenOpts.BackendOptions (backend options that don't need to
change on a per-function basis) to Target::createTargetMachine. We can
either pass it directly to the function or add a new field in
llvm::TargetOptions that holds the options. If we can do that, we don't
have to call ParseCommandLineOptions from
EmitAssemblyHelper::CreateTargetMachine.


> -eric
>>>
>>>
>>>> On Dec 1, 2014, at 10:53 AM, Eric Christopher <echristo at
gmail.com>
>>>> wrote:
>>>>
>>>> Hi Bob,
>>>>
>>>> I've been giving this some thought, comments inline:
>>>>
>>>> On Wed Nov 19 2014 at 6:24:18 PM Bob Wilson <bob.wilson at
apple.com>
>>>> wrote:
>>>>
>>>>> On Nov 19, 2014, at 4:52 PM, Eric Christopher <echristo
at gmail.com>
>>>>> wrote:
>>>>>
>>>>>
>>>>>
>>>>> On Wed Nov 19 2014 at 4:39:42 PM Akira Hatanaka
<ahatanak at gmail.com>
>>>>> wrote:
>>>>>
>>>>>> On Wed, Nov 19, 2014 at 3:28 PM, Eric Christopher
<echristo at gmail.com
>>>>>> > wrote:
>>>>>>
>>>>>>> So my general concern here is that lots of command
line options that
>>>>>>> don't need to be or shouldn't be IR level
constructs become oddly named
>>>>>>> string options. It's bad enough that we're
doing that with the target cpu
>>>>>>> and target feature string let alone the rest of it.
>>>>>>>
>>>>>>> If we're talking about things that end up
changing the subtarget
>>>>>>> used we definitely need to make this part of the
lookup key - at which
>>>>>>> point we're going to want a unified interface
to all of them. I haven't
>>>>>>> applied your patch to see what the IR changes look
like here.
>>>>>>>
>>>>>>>
>>>>>> With the patches applied, the attributes clang embeds
in the IR look
>>>>>> like this:
>>>>>>
>>>>>> $ clang -cc1 ... -mllvm -arm-long-calls -mllvm
-arm-reserve-r9
>>>>>> -menable-no-nans
>>>>>>
>>>>>> attributes #0 = { nounwind readonly ssp
"NoNaNsFPMath"
>>>>>> "arm-long-calls"="1"
"arm-reserve-r9"="1"
"less-precise-fpmad"="false"
>>>>>> "no-frame-pointer-elim"="true"
"no-frame-pointer-elim-non-leaf"
>>>>>> "no-infs-fp-math"="false"
"stack-protector-buffer-size"="8"
>>>>>> "target-cpu"="arm7tdmi"
"target-features"="-neon,-crypto,+apcs"
>>>>>> "unsafe-fp-math"="false"
"use-soft-float"="true" }
>>>>>>
>>>>>> I'm guessing you want the front-end to convert
command line options
>>>>>> like "arm-long-calls" or
"arm-reserve-r9" to something that doesn't look as
>>>>>> odd?
>>>>>>
>>>>>
>>>>> Right, that was part of the idea, the rest is that we
define things
>>>>> that actually make sense as front end command line options
and use those to
>>>>> communicate via IR constructs or api calls I.e. reserved
registers don't
>>>>> need to be a specific back end option, but rather can be
passed along when
>>>>> creating a target machine, etc.
>>>>>
>>>>> -eric
>>>>>
>>>>>
>>>>> No, reserved registers (I assume you’re referring to
“arm-reserve-r9”
>>>>> in Akira’s example) don’t necessarily need to be backend
options. But,
>>>>> those kinds of options do need to be recorded in the IR and
the target
>>>>> machine needs to reflect the relevant IR settings.
Otherwise, we’ll still
>>>>> be left with a broken LTO model. E.G., I build one LTO
object with
>>>>> “arm-reserve-r9” and another without it, but the LTO
compilation still
>>>>> needs to respect those options for the code in each object.
The details of
>>>>> how a particular option should be handled will vary
depending on the
>>>>> option, and we need to discuss each one separately.
>>>>>
>>>>>
>>>> ABI breaking options should probably be considered part of
>>>> TargetMachine/Subtarget creation and handled accordingly (if
it's subtarget
>>>> creation it can be on the function as a feature string
attribute, if it
>>>> affects global code generation it needs to be at TargetMachine
creation
>>>> time).
>>>>
>>>>
>>>>> It’s pretty clear, though, that there is a class of backend
options
>>>>> that translate nicely to function attributes. This RFC is
about the
>>>>> mechanism that we use to handle those (without specifying
which specific
>>>>> options will be handled that way).
>>>>>
>>>>>
>>>> Fair enough. For function attributes I can see a desire to have
(as
>>>> Chris posted in an email from someone at synopsys) a generic
target
>>>> specific dictionary mapping on functions for backend specific
uses. I just
>>>> haven't seen anything in the thread that would be
applicable for this
>>>> rather than an actual function attribute. That said, I would
like some sort
>>>> of delineation for target function attributes as well. Thoughts
on that?
>>>> Perhaps it could look like the string dictionary we've
mentioned, maybe not.
>>>>
>>>>
>>>>>
>>>>>
>>>>>> From looking at the patch it seems that the command
line option for
>>>>>>> NaNs should be a general attribute on the function
rather than a
>>>>>>> TargetMachine option anyhow, so it seems that
should be migrated down.
>>>>>>> Whether or not it is exposed as an llvm attribute
or a string based key
>>>>>>> value pair is an interesting discussion. Also,
would we want to, instead,
>>>>>>> expose the backend options as clang IR level
options? Seems to make sense
>>>>>>> from that perspective and then they can add the
appropriate attributes to
>>>>>>> the functions themselves. (-mlongcalls for the long
calls option for
>>>>>>> example).
>>>>>>>
>>>>>>> Thanks!
>>>>>>>
>>>>>>> -eric
>>>>>>>
>>>>>>
>>>>> Again, I think we can evaluate specific options on a
case-by-case
>>>>> basis. For some of them, it might make sense to expose them
as clang
>>>>> options. Like you, I had previously assumed we would teach
the front-end
>>>>> about all the backend options. I was a little surprised by
the approach
>>>>> Akira suggested here, but after giving it some thought, I
think it’s a
>>>>> great idea. There are a LOT of target-specific backend
options. Some of
>>>>> them do not need to be IR, but many do. Teaching clang
about all of them
>>>>> would be a lot of work, and I don’t see any clear benefit
from that. This
>>>>> approach lets clang remain relatively independent of the
backend options.
>>>>> We still need to think carefully about which backend
options use this
>>>>> mechanism, of course.
>>>>>
>>>>>
>>>> I'm unconvinced here...
>>>>
>>>>
>>>>> Are there other reasons why you think we should add all
those options
>>>>> to clang? I just don’t see much to be gained by that.
>>>>>
>>>>>
>>>> .... I think that anything you'd like to expose should
probably be a
>>>> front end option. If nothing else I don't see how you plan
on annotating
>>>> the IR. Basically you're constructing a parallel option
hierarchy and that
>>>> seems... redundant. I agree that it's nice for tools to be
able to override
>>>> things in the backend - for testing if nothing else. We do this
a lot by
>>>> checking options and values in the code. A way of streamlining
this would
>>>> be nice - but I don't think that's the goal of these
patches no?
>>>>
>>>>
>>>>> Do you have objections to the use of string-based key/value
pairs? I
>>>>> believe the intention all along, going back to Bill’s work
on this a few
>>>>> years ago, was that we would add new attributes for
target-independent
>>>>> things and use strings for the target-specific options.
That seems like a
>>>>> reasonable starting point to me.
>>>>>
>>>>>
>>>> Only some as I mentioned above - I haven't seen anything
that needs to
>>>> be target specific in this way, but I'm open to the idea :)
>>>>
>>>> As a note, this is all for function level things. We need a
solution
>>>> for non-function/module level attributes and I think that's
going to look
>>>> like API calls into the backend for TargetMachine creation -
there may be a
>>>> list of things that would turn into module level attributes,
but that is a
>>>> direction fraught with danger and I'd like to avoid that as
much as
>>>> possible.
>>>>
>>>> (As a note, the particular patch that Akira has is unacceptable
because
>>>> of the changes to the subtarget lookup, but that's a
separate issue :)
>>>>
>>>> -eric
>>>>
>>>>
>>>>>
>>>>>>> On Tue Nov 18 2014 at 12:26:49 PM Akira Hatanaka
<ahatanak at gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Updated patch is attached. Note this is just a
work-in-progress
>>>>>>>> patch and I plan to address the feedback
comments later if this patch is in
>>>>>>>> the right direction.
>>>>>>>>
>>>>>>>> This is how the command line options are parsed
and used by the
>>>>>>>> backend passes:
>>>>>>>>
>>>>>>>> 1. Tools such as clang and llc call
cl::ParseCommandLineOptions.
>>>>>>>> Any of the options that should be written to
the bitcode as function or
>>>>>>>> module attributes (these options are declared
as CodeGenOpt, which is a
>>>>>>>> subclass of cl::opt) are saved to Function or
ModuleIntAttrs in
>>>>>>>> CodeGenOptions.cpp.
>>>>>>>> 2. setFunctionAttribute is called to move the
function and module
>>>>>>>> options saved in step 1 to the bitcode. Also,
options stored in
>>>>>>>> TargetOptions are moved to the bitcode too.
Pre-existing attributes in the
>>>>>>>> bitcode are overwritten by the options in the
command line.
>>>>>>>> 3. The backend passes and subtarget classes
call
>>>>>>>> getFunctionAttribute to read the attributes
stored to the bitcode in step
>>>>>>>> 2. Function::hasFnAttribute can be called
directly (for example,
>>>>>>>> NoNaNsFPMath in the patch), if the default
value is known to be "false".
>>>>>>>>
>>>>>>>> I also made the following changes in the patch:
>>>>>>>>
>>>>>>>> 1. In the constructor of MachineFunction, call
the version of
>>>>>>>> getSubtargetImpl that takes a Function
parameter, so that it gets the
>>>>>>>> Subtarget specific to the Function being
compiled.
>>>>>>>> 2. Change ARMTargetMachine::SubtargetMap to be
a
>>>>>>>> DenseMap<Function*,
unique_ptr<ARMSubtarget>>. This is just a temporary
>>>>>>>> change to ease debugging and should be reverted
to a StringMap or changed
>>>>>>>> to another type of map later.
>>>>>>>>
>>>>>>>>
>>>>>>>> On Mon, Nov 17, 2014 at 11:40 AM, Eric
Christopher <echristo at gmail.
>>>>>>>> com> wrote:
>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Mon Nov 17 2014 at 9:56:40 AM Bob Wilson
<bob.wilson at apple.com>
>>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> > On Nov 14, 2014, at 2:44 PM,
Duncan P. N. Exon Smith <
>>>>>>>>>> dexonsmith at apple.com> wrote:
>>>>>>>>>> >
>>>>>>>>>> > +chrisb
>>>>>>>>>> >
>>>>>>>>>> >> On 2014-Nov-13, at 16:33,
Akira Hatanaka <ahatanak at gmail.com>
>>>>>>>>>> wrote:
>>>>>>>>>> >>
>>>>>>>>>> >> I'm working on fixing
PR21471, which is about embedding
>>>>>>>>>> codegen command line options into the
bitcode as function or module-level
>>>>>>>>>> attributes so that they don't get
ignored when doing LTO.
>>>>>>>>>> >>
>>>>>>>>>> >>
http://llvm.org/bugs/show_bug.cgi?id=21471
>>>>>>>>>> >>
>>>>>>>>>> >> I have an initial patch
(attached to this email) which enables
>>>>>>>>>> clang/llvm to recognize one command
line option, write it to the IR, and
>>>>>>>>>> read it out in a backend pass. I'm
looking to get feedback from the
>>>>>>>>>> community on whether I'm headed in
the right direction or whether there are
>>>>>>>>>> alternate ideas before I go all the way
on fixing the PR. Specifically, I'd
>>>>>>>>>> like to know the answers to the
following questions:
>>>>>>>>>> >>
>>>>>>>>>> >> 1. How do we make sure we
continue to be able to use the
>>>>>>>>>> command line options we've been
using for llc and other tools?
>>>>>>>>>> >> 2. How to handle cases where
two functions in a module have
>>>>>>>>>> different sets of command line options?
>>>>>>>>>> >> 3. Where should the command
line options or module/function
>>>>>>>>>> attributes be stored once they are read
out from the IR?
>>>>>>>>>> >>
>>>>>>>>>> >> The short description of the
approach I took in my patch is
>>>>>>>>>> that command line options that are
important to codegen are collected by
>>>>>>>>>> cl::ParseCommandLineOptions, written to
the bitcode as function or module
>>>>>>>>>> attributes, and read out directly by
the optimization passes that need
>>>>>>>>>> them. cl::opt options are replaced with
CodeGenOpt options (subclass of
>>>>>>>>>> cl::opt), which are needed only to
parse the command line and provide the
>>>>>>>>>> default value when the corresponding
options are not in the bitcode.
>>>>>>>>>> >
>>>>>>>>>> > I like this approach, since it
means the frontend doesn't have
>>>>>>>>>> to understand
>>>>>>>>>> > options in order to pass them on
to the backend.
>>>>>>>>>>
>>>>>>>>>> I agree. It’s not the approach that was
I was expecting, but
>>>>>>>>>> after reading the patches, I like it.
>>>>>>>>>>
>>>>>>>>>> If we can get consensus on the
approach, I suggest that we stage
>>>>>>>>>> the work to first adopt Akira’s patches
and then we can incrementally
>>>>>>>>>> convert various options over to use it.
There may be complications for some
>>>>>>>>>> options, so I’d like to see separate
patches for each one (possibly with a
>>>>>>>>>> few similar options combined in one
patch).
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>> I'm not sold quite yet :)
>>>>>>>>>
>>>>>>>>> Akira: Can you show an example of how you
expect this to work in
>>>>>>>>> the IR and how the backend is going to
process?
>>>>>>>>>
>>>>>>>>> Thanks.
>>>>>>>>>
>>>>>>>>> -eric
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> >
>>>>>>>>>> > The static variables should be
straightforward to migrate to an
>>>>>>>>>> LLVMContext
>>>>>>>>>> > once ParseCommandLineOptions
stores things there instead of in
>>>>>>>>>> globals.
>>>>>>>>>> >
>>>>>>>>>> >> diff --git
a/lib/CodeGen/CodeGenOption.cpp
>>>>>>>>>> b/lib/CodeGen/CodeGenOption.cpp
>>>>>>>>>> >> new file mode 100644
>>>>>>>>>> >> index 0000000..2d74c2f
>>>>>>>>>> >> --- /dev/null
>>>>>>>>>> >> +++
b/lib/CodeGen/CodeGenOption.cpp
>>>>>>>>>> >> @@ -0,0 +1,59 @@
>>>>>>>>>> >> +//===-
CodeGen/CodeGenOptions.cpp - Code-gen option.
>>>>>>>>>>  --*- C++ -*-===//
>>>>>>>>>> >> +//
>>>>>>>>>> >> +//                     The
LLVM Compiler Infrastructure
>>>>>>>>>> >> +//
>>>>>>>>>> >> +// This file is distributed
under the University of Illinois
>>>>>>>>>> Open Source
>>>>>>>>>> >> +// License. See LICENSE.TXT
for details.
>>>>>>>>>> >> +//
>>>>>>>>>> >>
+//===------------------------------------------------------
>>>>>>>>>> ----------------===//
>>>>>>>>>> >> +//
>>>>>>>>>> >>
+//===------------------------------------------------------
>>>>>>>>>> ----------------===//
>>>>>>>>>> >> +
>>>>>>>>>> >> +#include
"llvm/CodeGen/CodeGenOption.h"
>>>>>>>>>> >> +#include
"llvm/IR/Attributes.h"
>>>>>>>>>> >> +#include
"llvm/IR/Module.h"
>>>>>>>>>> >> +
>>>>>>>>>> >> +using namespace llvm;
>>>>>>>>>> >> +
>>>>>>>>>> >> +static
std::map<std::string, bool> FunctionBoolAttrs;
>>>>>>>>>> >> +static
std::map<std::string, bool> ModuleBoolAttrs;
>>>>>>>>>> >> +
>>>>>>>>>> >
>>>>>>>>>> > @Chris, should these be using
ManagedStatic?
>>>>>>>>>> >
>>>>>>>>>> >
_______________________________________________
>>>>>>>>>> > LLVM Developers mailing list
>>>>>>>>>> > LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>>> >
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
_______________________________________________
>>>>>>>>>> LLVM Developers mailing list
>>>>>>>>>> LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>>>
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
_______________________________________________
>>>>>>>>> LLVM Developers mailing list
>>>>>>>>> LLVMdev at cs.uiuc.edu        
http://llvm.cs.uiuc.edu
>>>>>>>>>
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>>>>>>>>
>>>>>>>>
>>>>>
>>>>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20141203/bfbcecf5/attachment.html>

llvm dev - Dec 2014 - [LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)

[LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)

[LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)

[LLVMdev] [RFC] Embedding command line options in bitcode (PR21471)