thr3ads.net - search: "tokenfactors"

Displaying 20 results from an estimated 80 matches for "tokenfactors".

Did you mean: tokenfactor

[LLVMdev] FoldingSetNodeID operations inefficiency

2008 Apr 28

[LLVMdev] FoldingSetNodeID operations inefficiency

...these TFi nodes as operands: _____TF______ / \ TF1 ... __TFk__ / \ / \ LD1.1..LD1.64 LDk.1...LDk.64 These changes (a) and (b) significantely reduce the compilation time on my pathological use-cases with huge TokenFactors. I attach the proposed patch to this mail for review. The only questions I still have are the following: - Which approach is better, b.1 or b.2? - If I have a huge number k of loads/nodes (i.e. nodes N1 ... Nk), to be put into a TokenFactor, what is a better order to put them into a TF? Is it...

[LLVMdev] FoldingSetNodeID operations inefficiency

2008 Apr 30

[LLVMdev] FoldingSetNodeID operations inefficiency

...\ > > TF1 ... __TFk__ > > / \ / \ > > LD1.1..LD1.64 LDk.1...LDk.64 > > > > > > These changes (a) and (b) significantely reduce the compilation time > > on my pathological use-cases with huge TokenFactors. > > > > I attach the proposed patch to this mail for review. > > > > The only questions I still have are the following: > > - Which approach is better, b.1 or b.2? > > I think b.2 is slightly better than b.1, because b.2 leaves all > the nodes at the same de...

[LLVMdev] How can I get rid of "OPFL_Chain" in myCPUGenInstrInfo.inc

2014 Apr 28

[LLVMdev] How can I get rid of "OPFL_Chain" in myCPUGenInstrInfo.inc

guys, 1)i made a mistake in my post. the said TF node was created when Selected() was called in ADDC node. 2) the source code under test short a,b; void test() { a+=b; } 3)the DAG after ADDC was seleced: Select node: 0x4977a20: ch,glue = <<Unknown Machine Node #65419>> 0x4972bd0, 0x49731d0, 0x4976c20, 0x49730d0<Mem:LD1[@a](align=2)> Result node: 0x4977a20: ch,glue =

how to type-legalize a dag

2016 Mar 15

how to type-legalize a dag

I have added new instructions to my target, unfortunately they are not being properly type legalized. The instructions I've added are a vector add, vector load and a vector store. Can anyone lend a hand on how to legalize them so that my target would be able to recognize them. Below is the output of llc with a -debug-only=isel. As you could see the output type for load, store, and add changes

[LLVMdev] How can I get rid of "OPFL_Chain" in myCPUGenInstrInfo.inc

2014 Apr 26

[LLVMdev] How can I get rid of "OPFL_Chain" in myCPUGenInstrInfo.inc

hi Tim,guys, it was regarding splitting 16-bit ADDC to two 8-bit ADDC+ADDE. the 8-bit ADDE instruction is defined as: let Constraints="$dst=$op0",mayStore=1, hasSideEffects=0,neverHasSideEffects=1 in def ADDErm: myInstr <0x0, (outs Intregs:$dst) (ins Intregs:$op0,MEMi:$op1), "", [set IntRegs:$dest (adde IntRegs:$op0, (load ADDRi:$op1))] > very unlucky, this

Ensuring chain dependencies with expansion to libcalls

2017 Feb 14

Ensuring chain dependencies with expansion to libcalls

Hi all, Our target does not have native support for 64-bit integers, so we rely on library calls for certain operations (like sdiv). We recently ran into a problem where these operations that are expanded to library calls aren't maintaining the proper ordering in relation to other chains in the DAG. The following snippet of a DAG demonstrates the problem. t0: ch = EntryToken t2:

[LLVMdev] FoldingSetNodeID operations inefficiency

2008 Apr 23

[LLVMdev] FoldingSetNodeID operations inefficiency

Hi, While profiling LLVM using my test-cases with huge MBBs, I noticed that FoldingSetNodeID operations (ComputeHash,insertion,etc) may become really inefficient for the nodes, which have very many operands. I can give you an example of what is meant by "very many". In my test-case (you can fetch it from here http://llvm.org/bugs/attachment.cgi?id=1275), which is just one HUGE MBB

[LLVMdev] Frame index arithmetic

2010 Jan 19

[LLVMdev] Frame index arithmetic

Hi Mark, >> Sounds like your load / store address selection routine isn't working >> like what you expected. >> > > Thanks for the reply. Unfortunately, this doesn't seem to be the problem. do you handle truncating stores and extending loads? Ciao, Duncan.

[LLVMdev] Complex load patterns and token factors

2012 Jun 23

[LLVMdev] Complex load patterns and token factors

Working on a target I added this pattern: def : Pat<(v4i64 (load xoaddr:$src)), (QVFCTIDb (QVLFDXb xoaddr:$src))>; which represents an actual load followed by a necessary conversion operation. The problem is that when this matches any TokenFactor that was attached to the load node gets attached, not to the inner load instruction, but the outer conversion operation. This is

[LLVMdev] Non-Chain Chains

2012 Jan 05

[LLVMdev] Non-Chain Chains

Following up on my call schedule posting from yesterday, I am now trying to add edges from the call to the instruction before it. This seemed easiest to do in SelectionDAGBuilder but it is troublesome. A couple of questions: - How do I get the previous instruction that was translated? prior(CS.getInstruction()) in visitCall returns something invalid. When I try to call getValue on the

rL296252 Made large integer operation codegen significantly worse.

2017 Feb 28

rL296252 Made large integer operation codegen significantly worse.

I see we're missing an isel pattern for add producing carry and doing a memory RMW. I'm going to see if adding that helps anything. ~Craig On Mon, Feb 27, 2017 at 8:47 PM, Nirav Davé via llvm-dev < llvm-dev at lists.llvm.org> wrote: > Yes. I'm seeing that as well. Not clear what's going on. > > In any case it looks to be unrelated to the alias analysis so barring

[LLVMdev] Frame index arithmetic

2010 Jan 19

[LLVMdev] Frame index arithmetic

>> I'm trying something cunning/crazy with the stack - implementing it in a type of memory that can only be addressed via immediates. >> >> I've got this mostly working. However, I came across a problem which I've been unable to work around: lowering the IR (even without any optimisations enabled) often requires the pattern: >> >> i32 = FrameIndex

[LLVMdev] FoldingSetNodeID operations inefficiency

2008 Apr 24

[LLVMdev] FoldingSetNodeID operations inefficiency

Hi Chris, This is a good idea and I started thinking in that direction already. But what I don't quite understand the TFs, how TFs are formed and which rules they should obey to. For example now: > PendingLoads created by the SelectionDAGLowering::getLoadFrom and then copied into the > TokenFactor node by SelectionDAGLowering::getRoot called from the >

Instruction selection problems due to SelectionDAGBuilder

2016 Aug 02

Instruction selection problems due to SelectionDAGBuilder

Hello. I'm having problems at instruction selection with my back end with the following basic-block due to a vector add with immediate constant vector (obtained by vectorizing a simple C program doing vector sum map): vector.ph: ; preds = %vector.memcheck50 %.splatinsert = insertelement <8 x i64> undef, i64 %i.07.unr, i32 0

[LLVMdev] Load serialisation during selection DAG building

2012 Aug 14

[LLVMdev] Load serialisation during selection DAG building

I looked into those patches but I don't think they will help in my situation because my problems occur during instruction selection rather than scheduling. A simple and concrete example is a pattern like: [(set GR:$dst (add GR:$src (nvload addr:$mem)))] where nvload matches a load provided that isVolatile() is false. If the selection DAG looks like: | | LD1 LD2 ^

[LLVMdev] Load serialisation during selection DAG building

2012 Aug 13

[LLVMdev] Load serialisation during selection DAG building

I've got a question about how SelectionDAGBuilder treats loads. The LLVM Language Reference Manual explicitly states that the order of volatile operations may be changed relative to non-volatile operations. However, when the SelectionDAGBuilder in LLVM 3.1 encounters a volatile load, it flushes all pending loads and then chains the volatile load onto them meaning that the volatile load must

Error in v64i32 type in x86 backend

2017 Jul 07

Error in v64i32 type in x86 backend

Have you read http://llvm.org/docs/WritingAnLLVMBackend.html and http://llvm.org/docs/CodeGenerator.html ? http://llvm.org/docs/WritingAnLLVMBackend.html#instruction-selector describes how to define a store instruction. -Eli On 7/6/2017 6:51 PM, hameeza ahmed via llvm-dev wrote: > Please correct me i m stuck at this point. > > On Jul 6, 2017 5:18 PM, "hameeza ahmed"

[LLVMdev] Load serialisation during selection DAG building

2012 Aug 13

[LLVMdev] Load serialisation during selection DAG building

Steve, I had created a patch last year that does something similar to what you describe for regular loads and stores using aliasing information. I think that the last message in the thread was: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120402/140299.html This approach has worked for me, but it is not the preferred solution going forward. The preferred solution is to keep the

[LLVMdev] Making Sense of ISel DAG Output

2008 Oct 03

[LLVMdev] Making Sense of ISel DAG Output

On Thursday 02 October 2008 19:32, Dan Gohman wrote: > Looking at your dump() output above, it looks like the pre-selection > loads have multiple uses, so even though you've managed to match a > larger pattern that incorporates them, they still need to exist to > satisfy some other users. Yes, I looked at that too. It looks like these other uses end up being chains to

Error in v64i32 type in x86 backend

2017 Jul 07

Error in v64i32 type in x86 backend

also i further run the following command; llc -debug filer-knl_o3.ll and its output is attached here. by looking at the output can we say that legalization runs fine and the error is due to instruction selection/ pattern matching which is not yet implemented? so do i need to worry and try to correct it at this stage or should i move forward to implement instruction selection/ pattern matching?

search for: tokenfactors