thr3ads.net - similar to: "help/hints/suggestions/tips please: how to give _generic_ compilation for a particular ISA a non-zero LoopMicroOpBufferSize?"

Displaying 20 results from an estimated 100 matches similar to: "help/hints/suggestions/tips please: how to give _generic_ compilation for a particular ISA a non-zero LoopMicroOpBufferSize?"

[RFC] llvm-mca: a static performance analysis tool

2018 Mar 06

[RFC] llvm-mca: a static performance analysis tool

On Tue, Mar 6, 2018 at 5:55 AM, Andrew Trick via llvm-dev < llvm-dev at lists.llvm.org> wrote: > > > On Mar 5, 2018, at 6:28 PM, Matthias Braun <mbraun at apple.com> wrote: > > > > On Mar 5, 2018, at 6:14 PM, Andrew Trick via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > > > > On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at

[RFC] llvm-mca: a static performance analysis tool

2018 Mar 06

[RFC] llvm-mca: a static performance analysis tool

> On Mar 6, 2018, at 4:20 AM, Andrea Di Biagio <andrea.dibiagio at gmail.com> wrote: > > To be clear then, resolveSchedClass should be moved from TargetSchedModel into MCSchedModel (which is where I originally wanted it). Any TargetInstrInfo APIs called from SchedPredicate should be moved to MCInstrInfo, which should be straightforward but annoying. > > Personally, I

getCacheSize() / subtarget machine id

2017 Oct 17

getCacheSize() / subtarget machine id

Hi, while implementing SystemZTTI:getCacheSize(), it became clear that there really isn't a simple way to just ask the Subtarget for the current subtarget machine model. I was thinking like something of an enum that would also reflect the subtarget series (and would allow >= and similar operations). I would like to ask what the ideas are on how this should be done best. Some

[RFC] llvm-mca: a static performance analysis tool

2018 Mar 06

[RFC] llvm-mca: a static performance analysis tool

> On Mar 5, 2018, at 6:28 PM, Matthias Braun <mbraun at apple.com> wrote: > > > >> On Mar 5, 2018, at 6:14 PM, Andrew Trick via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote: >> >> >> >>> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at apple.com <mailto:qcolombet at

[RFC] llvm-mca: a static performance analysis tool

2018 Mar 06

[RFC] llvm-mca: a static performance analysis tool

> On Mar 5, 2018, at 6:14 PM, Andrew Trick via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > > >> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at apple.com <mailto:qcolombet at apple.com>> wrote: >> >> When Ahmed and I worked on the decompiler, we first targeted MC. Going to MI was more difficult and really wouldn’t have gotten us a

Subtarget Initialization in <ARCH>TargetMachine constructor

2017 Aug 23

Subtarget Initialization in <ARCH>TargetMachine constructor

Thanks, Alex. See my comments below. On Wed, Aug 23, 2017 at 12:59 AM, Alex Bradbury <asb at asbradbury.org> wrote: > On 22 August 2017 at 23:39, Y Song via llvm-dev <llvm-dev at lists.llvm.org> wrote: >> Hi, > > Hi Yonghong. > >> I found some different discrepancy on how Subtarget is created >> between some arch specific TargetMachine constructor.

SchedClasses

2017 Sep 22

SchedClasses

Hi all, I am looking at the scheduling model of the ThunderX2. I am trying to figure out the cost of the LDADDALX instruction. The following program’s output is: name LDADDALX; class 872 microops 65535 I would have assumed that the microops are less than 20. The ThunderX2 has a detailed cost model for LSE. Could somebody tell me what I am doing wrong? Cheers, Tom #define GET_REGINFO_ENUM

SchedClasses

2017 Sep 29

SchedClasses

> On Sep 22, 2017, at 10:34 AM, Thorsten Schütt via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > Hi all, > > I am looking at the scheduling model of the ThunderX2. I am trying to figure out the cost of the LDADDALX instruction. The following program’s output is: > name LDADDALX; class 872 > microops 65535 > I would have assumed that the microops are less than

[LLVMdev] Instruction Scheduling - migration from v3.1 to v3.2

2013 Apr 30

[LLVMdev] Instruction Scheduling - migration from v3.1 to v3.2

On Apr 26, 2013, at 3:53 AM, Martin J. O'Riordan <Martin.ORiordan at movidius.com> wrote: > I am migrating the llvm/clang derived compiler for our processor from the > v3.1 to v3.2 codebase. This has mostly gone well except that instruction > latency scheduling is no longer happening. > > The people who implemented this previously sub-classed 'ScheduleDAGInstrs'

DFAPacketizer, Scheduling and LoadLatency

2015 Nov 16

DFAPacketizer, Scheduling and LoadLatency

I'm unclear how does DFAPacketizer and the scheduler know a given instruction is a load. Here is what I'm talking about Let's assume my VLIW target is described as follows: def MyTargetItineraries : ProcessorItineraries<[Slot0, Slot1], [], [ .............................. InstrItinData<RI, [InstrStage<1, [Slot0, Slot1]>]>,

[RFC] llvm-mca: a static performance analysis tool

2018 Mar 06

[RFC] llvm-mca: a static performance analysis tool

> On Mar 5, 2018, at 3:38 PM, Quentin Colombet <qcolombet at apple.com> wrote: > > When Ahmed and I worked on the decompiler, we first targeted MC. Going to MI was more difficult and really wouldn’t have gotten us a lot of benefits. Instead, Ahmed pushed for directly decompiling to IR (look for dagger). Thanks for the pointer Quentin. > I would actually be in favor for more

DFAPacketizer, Scheduling and LoadLatency

2015 Nov 17

DFAPacketizer, Scheduling and LoadLatency

> In particular, the LoadLatency is used in defaultDefLatency: > > /// Return the default expected latency for a def based on it's opcode. > unsigned TargetInstrInfo::defaultDefLatency( > const MCSchedModel &SchedModel, const MachineInstr *DefMI) const { > if (DefMI->isTransient()) > return 0; > if (DefMI->mayLoad()) > return

[RFC] llvm-mca: a static performance analysis tool

2018 Mar 05

[RFC] llvm-mca: a static performance analysis tool

Thanks Andrea for working on this! I’ve been willing to do this for quite some time now. Looks like procrastination was the right approach here ;). > On Mar 2, 2018, at 9:33 AM, Andrew Trick via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > +Ahmed > >> On Mar 2, 2018, at 6:42 AM, Andrea Di Biagio <andrea.dibiagio at gmail.com <mailto:andrea.dibiagio at

Subtarget Initialization in <ARCH>TargetMachine constructor

2017 Aug 22

Subtarget Initialization in <ARCH>TargetMachine constructor

Hi, I found some different discrepancy on how Subtarget is created between some arch specific TargetMachine constructor. For example, for BPF/Lanai: BPFTargetMachine::BPFTargetMachine(const Target &T, const Triple &TT, StringRef CPU, StringRef FS, const TargetOptions &Options,

[LLVMdev] clang -O2 versus opt -O2 | llc | clang

2014 May 23

[LLVMdev] clang -O2 versus opt -O2 | llc | clang

----- Original Message ----- > From: "Tim Northover" <t.p.northover at gmail.com> > To: "Sanjay Patel" <spatel at rotateright.com> > Cc: "LLVM Developers Mailing List" <llvmdev at cs.uiuc.edu> > Sent: Friday, May 23, 2014 2:22:55 PM > Subject: Re: [LLVMdev] clang -O2 versus opt -O2 | llc | clang > > Hi Sanjay, > > >

SchedClasses

2017 Sep 30

SchedClasses

On Fri, Sep 29, 2017 at 7:51 PM, Andrew Trick via llvm-dev < llvm-dev at lists.llvm.org> wrote: > > > > On Sep 22, 2017, at 10:34 AM, Thorsten Schütt via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > > > > > > > #define GET_REGINFO_ENUM > > #include "AArch64GenRegisterInfo.inc" > > > > #define GET_INSTRINFO_ENUM >

[LLVMdev] clang -O2 versus opt -O2 | llc | clang

2014 May 23

[LLVMdev] clang -O2 versus opt -O2 | llc | clang

I'm investigating a miscompilation bug ( http://llvm.org/bugs/show_bug.cgi?id=19823 ), but I've run into a problem: the output of the program is different when I compile the IR with clang compared to opt | llc | clang. Any clues on how to resolve this difference? $ ./opt -O1 19823.ll | ./llc | ./clang -x assembler - -o a.out ; ./a.out ; echo $? 1 $ ./opt -O2 19823.ll | ./llc | ./clang -x

Can USBstikA boot USBstkB ?!

2017 Jun 14

Can USBstikA boot USBstkB ?!

PS. Since I've now only got a laptop, I mostly use TinyCore linux; which is also sysylinx booted; and makes provision for *THE SLOW REACTION TIME OF SOME USB DEVICES*; which may be the subtle cause of my problem in trying to boot a second USBdevice [other than the <boot device>]. WDYS? On 6/14/17, eas lab <lab.eas at gmail.com> wrote: >> I don't think that syslinux be

Error loading vesamenu.c32

2015 Jan 18

Error loading vesamenu.c32

> On Sun, Jan 18, 2015 at 6:44 AM, Angel <angelv64 at gmail.com> wrote: > > -----BEGIN PGP SIGNED MESSAGE----- > > Hash: SHA256 > > > > Hello. > > > > I'm trying to setup an multiboot USB, mainly to install Debian using > > netinst images. > > > > I've mounted one of iso image and copied to a directory in USB (made > >

Can USBstikA boot USBstkB ?!

2017 Jun 14

Can USBstikA boot USBstkB ?!

]You seem to be assuming that changing from a direct SATA and IDE ] connections to some USB adapter would have no impact on how the ] respective HDDs are recognized/detected by the BIOS. ] You seem to be ] assuming that the BIOS from 2 different PCs/Laptops would recognize all ] your devices in the same exact way and that there would be no changes ] in behavior or in supported features by the

similar to: help/hints/suggestions/tips please: how to give _generic_ compilation for a particular ISA a non-zero LoopMicroOpBufferSize?