Displaying 20 results from an estimated 100 matches similar to: "help/hints/suggestions/tips please: how to give _generic_ compilation for a particular ISA a non-zero LoopMicroOpBufferSize?"
2018 Mar 06
2
[RFC] llvm-mca: a static performance analysis tool
On Tue, Mar 6, 2018 at 5:55 AM, Andrew Trick via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
>
>
> On Mar 5, 2018, at 6:28 PM, Matthias Braun <mbraun at apple.com> wrote:
>
>
>
> On Mar 5, 2018, at 6:14 PM, Andrew Trick via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>
>
> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at
2018 Mar 06
0
[RFC] llvm-mca: a static performance analysis tool
> On Mar 6, 2018, at 4:20 AM, Andrea Di Biagio <andrea.dibiagio at gmail.com> wrote:
>
> To be clear then, resolveSchedClass should be moved from TargetSchedModel into MCSchedModel (which is where I originally wanted it). Any TargetInstrInfo APIs called from SchedPredicate should be moved to MCInstrInfo, which should be straightforward but annoying.
>
> Personally, I
2017 Oct 17
2
getCacheSize() / subtarget machine id
Hi,
while implementing SystemZTTI:getCacheSize(), it became clear that there
really isn't a simple way to just ask the Subtarget for the current
subtarget machine model. I was thinking like something of an enum that
would also reflect the subtarget series (and would allow >= and similar
operations).
I would like to ask what the ideas are on how this should be done best.
Some
2018 Mar 06
0
[RFC] llvm-mca: a static performance analysis tool
> On Mar 5, 2018, at 6:28 PM, Matthias Braun <mbraun at apple.com> wrote:
>
>
>
>> On Mar 5, 2018, at 6:14 PM, Andrew Trick via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote:
>>
>>
>>
>>> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at apple.com <mailto:qcolombet at
2018 Mar 06
3
[RFC] llvm-mca: a static performance analysis tool
> On Mar 5, 2018, at 6:14 PM, Andrew Trick via llvm-dev <llvm-dev at lists.llvm.org> wrote:
>
>
>
>> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at apple.com <mailto:qcolombet at apple.com>> wrote:
>>
>> When Ahmed and I worked on the decompiler, we first targeted MC. Going to MI was more difficult and really wouldn’t have gotten us a
2017 Aug 23
2
Subtarget Initialization in <ARCH>TargetMachine constructor
Thanks, Alex. See my comments below.
On Wed, Aug 23, 2017 at 12:59 AM, Alex Bradbury <asb at asbradbury.org> wrote:
> On 22 August 2017 at 23:39, Y Song via llvm-dev <llvm-dev at lists.llvm.org> wrote:
>> Hi,
>
> Hi Yonghong.
>
>> I found some different discrepancy on how Subtarget is created
>> between some arch specific TargetMachine constructor.
2017 Sep 22
2
SchedClasses
Hi all,
I am looking at the scheduling model of the ThunderX2. I am trying to figure out the cost of the LDADDALX instruction. The following program’s output is:
name LDADDALX; class 872
microops 65535
I would have assumed that the microops are less than 20. The ThunderX2 has a detailed cost model for LSE. Could somebody tell me what I am doing wrong?
Cheers,
Tom
#define GET_REGINFO_ENUM
2017 Sep 29
0
SchedClasses
> On Sep 22, 2017, at 10:34 AM, Thorsten Schütt via llvm-dev <llvm-dev at lists.llvm.org> wrote:
>
> Hi all,
>
> I am looking at the scheduling model of the ThunderX2. I am trying to figure out the cost of the LDADDALX instruction. The following program’s output is:
> name LDADDALX; class 872
> microops 65535
> I would have assumed that the microops are less than
2013 Apr 30
1
[LLVMdev] Instruction Scheduling - migration from v3.1 to v3.2
On Apr 26, 2013, at 3:53 AM, Martin J. O'Riordan <Martin.ORiordan at movidius.com> wrote:
> I am migrating the llvm/clang derived compiler for our processor from the
> v3.1 to v3.2 codebase. This has mostly gone well except that instruction
> latency scheduling is no longer happening.
>
> The people who implemented this previously sub-classed 'ScheduleDAGInstrs'
2015 Nov 16
3
DFAPacketizer, Scheduling and LoadLatency
I'm unclear how does DFAPacketizer and the scheduler know a given
instruction is a load.
Here is what I'm talking about
Let's assume my VLIW target is described as follows:
def MyTargetItineraries :
ProcessorItineraries<[Slot0, Slot1], [], [
..............................
InstrItinData<RI, [InstrStage<1, [Slot0, Slot1]>]>,
2018 Mar 06
0
[RFC] llvm-mca: a static performance analysis tool
> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at apple.com> wrote:
>
> When Ahmed and I worked on the decompiler, we first targeted MC. Going to MI was more difficult and really wouldn’t have gotten us a lot of benefits. Instead, Ahmed pushed for directly decompiling to IR (look for dagger).
Thanks for the pointer Quentin.
> I would actually be in favor for more
2015 Nov 17
2
DFAPacketizer, Scheduling and LoadLatency
> In particular, the LoadLatency is used in defaultDefLatency:
>
> /// Return the default expected latency for a def based on it's opcode.
> unsigned TargetInstrInfo::defaultDefLatency(
> const MCSchedModel &SchedModel, const MachineInstr *DefMI) const {
> if (DefMI->isTransient())
> return 0;
> if (DefMI->mayLoad())
> return
2018 Mar 05
2
[RFC] llvm-mca: a static performance analysis tool
Thanks Andrea for working on this!
I’ve been willing to do this for quite some time now. Looks like procrastination was the right approach here ;).
> On Mar 2, 2018, at 9:33 AM, Andrew Trick via llvm-dev <llvm-dev at lists.llvm.org> wrote:
>
> +Ahmed
>
>> On Mar 2, 2018, at 6:42 AM, Andrea Di Biagio <andrea.dibiagio at gmail.com <mailto:andrea.dibiagio at
2017 Aug 22
2
Subtarget Initialization in <ARCH>TargetMachine constructor
Hi,
I found some different discrepancy on how Subtarget is created
between some arch specific TargetMachine constructor.
For example, for BPF/Lanai:
BPFTargetMachine::BPFTargetMachine(const Target &T, const Triple &TT,
StringRef CPU, StringRef FS,
const TargetOptions &Options,
2014 May 23
2
[LLVMdev] clang -O2 versus opt -O2 | llc | clang
----- Original Message -----
> From: "Tim Northover" <t.p.northover at gmail.com>
> To: "Sanjay Patel" <spatel at rotateright.com>
> Cc: "LLVM Developers Mailing List" <llvmdev at cs.uiuc.edu>
> Sent: Friday, May 23, 2014 2:22:55 PM
> Subject: Re: [LLVMdev] clang -O2 versus opt -O2 | llc | clang
>
> Hi Sanjay,
>
> >
2017 Sep 30
1
SchedClasses
On Fri, Sep 29, 2017 at 7:51 PM, Andrew Trick via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
>
>
> > On Sep 22, 2017, at 10:34 AM, Thorsten Schütt via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
> >
>
> >
> > #define GET_REGINFO_ENUM
> > #include "AArch64GenRegisterInfo.inc"
> >
> > #define GET_INSTRINFO_ENUM
>
2014 May 23
2
[LLVMdev] clang -O2 versus opt -O2 | llc | clang
I'm investigating a miscompilation bug (
http://llvm.org/bugs/show_bug.cgi?id=19823 ), but I've run into a problem:
the output of the program is different when I compile the IR with clang
compared to opt | llc | clang. Any clues on how to resolve this difference?
$ ./opt -O1 19823.ll | ./llc | ./clang -x assembler - -o a.out ; ./a.out ;
echo $?
1
$ ./opt -O2 19823.ll | ./llc | ./clang -x
2017 Jun 14
2
Can USBstikA boot USBstkB ?!
PS. Since I've now only got a laptop, I mostly use TinyCore linux;
which is also sysylinx booted; and makes provision for
*THE SLOW REACTION TIME OF SOME USB DEVICES*;
which may be the subtle cause of my problem in trying to boot
a second USBdevice [other than the <boot device>].
WDYS?
On 6/14/17, eas lab <lab.eas at gmail.com> wrote:
>> I don't think that syslinux be
2015 Jan 18
1
Error loading vesamenu.c32
> On Sun, Jan 18, 2015 at 6:44 AM, Angel <angelv64 at gmail.com> wrote:
> > -----BEGIN PGP SIGNED MESSAGE-----
> > Hash: SHA256
> >
> > Hello.
> >
> > I'm trying to setup an multiboot USB, mainly to install Debian using
> > netinst images.
> >
> > I've mounted one of iso image and copied to a directory in USB (made
> >
2017 Jun 14
0
Can USBstikA boot USBstkB ?!
]You seem to be assuming that changing from a direct SATA and IDE
] connections to some USB adapter would have no impact on how the
] respective HDDs are recognized/detected by the BIOS.
] You seem to be
] assuming that the BIOS from 2 different PCs/Laptops would recognize all
] your devices in the same exact way and that there would be no changes
] in behavior or in supported features by the