thr3ads.net - similar to: "[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets"

Displaying 20 results from an estimated 30000 matches similar to: "[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets"

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

2019 Nov 13

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

On Wed, Nov 13, 2019 at 12:26 PM Joan Lluch via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > Hi All, > > In relation to the subject of this message I got my first round of patches successfully reviewed and committed. As a matter of reference, they are the following: > > https://reviews.llvm.org/D69116 > https://reviews.llvm.org/D69120 >

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

2019 Nov 13

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

As before, I'm not convinced that we want to allow target-based enable/disable in instcombine for performance. That undermines having a target-independent canonical form in the 1st place. It's not clear to me what the remaining motivating cases look like. If you could post those here or as bugs, I think you'd have a better chance of finding an answer. Let's take a minimal example

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

2019 Nov 14

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

For any of the examples shown below, if the logical equivalent using cmp + other IR instructions is no more than the number of IR instructions as the variant that uses shift, we should consider reversing the canonicalization. To make that happen, you would need to show that at least the minimal cases have codegen that is equal or better using the cmp form for at least a few in-tree targets. My

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Oct 03

[cfe-dev] CFG simplification question, and preservation of branching in the original code

Hi all, > On 2 Oct 2019, at 14:34, Sanjay Patel <spatel at rotateright.com> wrote > Providing target options/overrides to code that is supposed to be target-independent sounds self-defeating to me. I doubt that proposal would gain much support. > Of course, if you're customizing LLVM for your own out-of-trunk backend, you can do anything you'd like if you're willing to

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Oct 01

[cfe-dev] CFG simplification question, and preservation of branching in the original code

Hi Sanjay, Thanks for your reply. > So yes, the IR optimizer (instcombine is the specific pass) sometimes turns icmp (and select) sequences into ALU ops. Instcombine is almost entirely *target-independent* and should remain that way. The (sometimes unfortunate) decision to create shifts were made based on popular targets of the time (PowerPC and/or x86), and other targets may have suffered

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Sep 30

[cfe-dev] CFG simplification question, and preservation of branching in the original code

For the MSP430 example, I'm guess its InstCombiner::transformSExtICmp or InstCombiner::transformZExtICmp ~Craig On Mon, Sep 30, 2019 at 2:21 PM Support IMAP <support at sweetwilliamsl.com> wrote: > Hi all, > > Ok, I just found a much simpler example of the same issue. > > Consider the following code > > int cmpge32_0(long a) { > return a>=0; > } >

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

2019 Oct 26

[AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets

Hi, In relation to this subject, I have submitted bug reports, baseline tests, and patches that go towards this goal: https://bugs.llvm.org/show_bug.cgi?id=43542 <https://bugs.llvm.org/show_bug.cgi?id=43542> https://bugs.llvm.org/show_bug.cgi?id=43559 <https://bugs.llvm.org/show_bug.cgi?id=43559> https://reviews.llvm.org/D69099 <https://reviews.llvm.org/D69099> (baseline

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Oct 04

[cfe-dev] CFG simplification question, and preservation of branching in the original code

On 10/4/19 5:46 AM, Joan Lluch via llvm-dev wrote: Hi all, As a continuation of this thread, I was about to fill a bug report requesting the modification of DAGTypeLegalizer::ExpandIntRes_SIGN_EXTEND, in order to avoid the creation of Shifts for targets with no native support. For example by generating a ‘select' equivalent to a<0 ? -1 : 0 instead of an arithmetic shift right. For targets

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Sep 30

[cfe-dev] CFG simplification question, and preservation of branching in the original code

On Mon, Sep 30, 2019 at 11:52 AM Joan Lluch <joan.lluch at icloud.com> wrote: > > Hi Roman, > > Is "test" actually an implementation of a 64-bit-wide multiplication > compiler-rt builtin? > Then i'd think the main problem is that it is being optimized in the > first place, you could end up with endless recursion… > > > No, this is not a compiler-rt

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Sep 29

[cfe-dev] CFG simplification question, and preservation of branching in the original code

On Sun, Sep 29, 2019 at 3:35 PM Joan Lluch via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > Hi Sanjay, > > Actually, the CodeGenPrepare::optimizeSelectInst is not doing the best it could do in some circumstances: The case of “OptSize" for targets not supporting Select was already mentioned to be detrimental. > > For targets that actually have selects, but branches

[cfe-dev] CFG simplification question, and preservation of branching in the original code

2019 Sep 25

[cfe-dev] CFG simplification question, and preservation of branching in the original code

Changing the order of the checks in CodeGenPrepare::optimizeSelectInst() sounds good to me. But you may need to go further for optimum performance. For example, we may be canonicalizing math/logic IR patterns into 'select' such as in the recent: https://reviews.llvm.org/D67799 So if you want those to become ALU ops again rather than branches, then you need to do the transform later in

Bug: Library functions for ISD::SRA, ISD::SHL, and ISD::SRL

2019 Jun 11

Bug: Library functions for ISD::SRA, ISD::SHL, and ISD::SRL

Hi Eli, First of all, please I would appreciate that you try to not confuse my limited use of English with stupidity or lack or criteria in other subjects. I’m not English native, so please keep that in mind. You have been significantly helpful in the recent past so please keep on. Interestingly, you made a mention of a related but not identical issue. It is true that most (or all) processors

Bug: Library functions for ISD::SRA, ISD::SHL, and ISD::SRL

2019 Jun 10

Bug: Library functions for ISD::SRA, ISD::SHL, and ISD::SRL

Hi Eli, Thanks for pointing to the CTLZ_ZERO_UNDEF “LibCall” implementation. I have not it in the version that I am currently using, so it’s nice to know that it’s implemented now. Incidentally, the CTLZ… implementation is IDENTICAL to what I am proposing for the Shifts. This is not just adding support for “out-of-tree-targets”, but giving consistency to the fact that we have perfectly defined

[LLVMdev] InstCombine adds bit masks, confuses self, others

2012 Apr 17

[LLVMdev] InstCombine adds bit masks, confuses self, others

On Apr 16, 2012, at 3:30 PM, Chandler Carruth <chandlerc at google.com> wrote: > Does sinking these into the DAGCombine layer help? How much does it break? I tried disabling just the InstCombine transforms that hide shl instructions behind bitmasks. Even though DAGCombine has the same transforms, it causes some pretty bad regressions: External/SPEC/CINT95/147_vortex/147_vortex

[LLVMdev] InstCombine adds bit masks, confuses self, others

2012 Apr 17

[LLVMdev] InstCombine adds bit masks, confuses self, others

On Wed, Apr 18, 2012 at 12:22 AM, Jakob Stoklund Olesen <stoklund at 2pi.dk>wrote: > I tried disabling just the InstCombine transforms that hide shl > instructions behind bitmasks. Even though DAGCombine has the same > transforms, it causes some pretty bad regressions: > I wonder about your idea of still combining most of the shifts, but leaving the last bits uncombined...

[LLVMdev] [global-isel] Simplifying the simplifier

2013 Aug 08

[LLVMdev] [global-isel] Simplifying the simplifier

----- Original Message ----- [snip] > > DAGCombine is supposed to clean up legalization products, but today > it seems to be accumulating some transformations that probably > belong in InstCombine. Since it is running so late in the pass > pipeline, its canonicalizing approach is causing problems for > targets that are trying to arrange the IR optimally. Optimal and >

[LLVMdev] InstCombine adds bit masks, confuses self, others

2012 Apr 16

[LLVMdev] InstCombine adds bit masks, confuses self, others

Look at this silly function: $ cat small.c unsigned f(unsigned a, unsigned *p) { unsigned x = a/4; p[0] = x; p[1] = x+x; return p[1] - 2*p[0]; } GCC turns this into straightforward code and figures out the 0 return value: shrl $2, %edi movl %edi, (%rsi) addl %edi, %edi movl %edi, 4(%rsi) movl $0, %eax ret LLVM optimizes the code: $ clang -O -S -o- small.c -emit-llvm define i32

How to change CLang struct alignment behaviour?

2019 May 14

How to change CLang struct alignment behaviour?

Hi John, On Tue, 14 May 2019 at 17:51, Joan Lluch <joan.lluch at icloud.com> wrote: > This problem is also shared by the MSP430 target, and it’s very easy to reproduce by just compiling the code that I posted before. That's some good detective work; it definitely explains what you're seeing. Since MSP430 is affected it would probably be pretty easy to upstream an alignment-aware

How to change CLang struct alignment behaviour?

2019 May 13

How to change CLang struct alignment behaviour?

Hi Joan, On Mon, 13 May 2019 at 18:01, Joan Lluch <joan.lluch at icloud.com> wrote: > After looking at it a bit further, I think this is a Clang thing. Clang issues “align 2” if the struct has at least one int (2 bytes), but also if the entire struct size is multiple of 2. For example a struct with 4 char members. In these cases the LLVM backend correctly creates word sized load/stores

Bug: Library functions for ISD::SRA, ISD::SHL, and ISD::SRL

2019 Jun 10

Bug: Library functions for ISD::SRA, ISD::SHL, and ISD::SRL

LLVM appears to support Library functions for ISD::SRA ,ISD::SHL, and ISD::SRL, as they are properly defined in RuntimeLibCalls.def. The library functions defined in RuntimeLibCalls.def (among others) are these: HANDLE_LIBCALL(SRA_I16, "__ashrhi3") HANDLE_LIBCALL(SRA_I32, "__ashrsi3") HANDLE_LIBCALL(SRA_I64, "__ashrdi3") However, setting

similar to: [AVR] [MSP430] Code gen improvements for 8 bit and 16 bit targets