thr3ads.net - similar to: "What should a truncating store do?"

Displaying 20 results from an estimated 1200 matches similar to: "What should a truncating store do?"

2017 Sep 15

What should a truncating store do?

They are starting to look complicated. The patch linked is interesting, perhaps v1 vectors are special cased. It shouldn't be too onerous to work out what one or two in tree back ends do by experimentation. Thanks again, it's great to have context beyond the source. On Fri, Sep 15, 2017 at 9:41 PM, Friedman, Eli <efriedma at codeaurora.org> wrote: > On 9/15/2017 12:10 PM, Jon

What should a truncating store do?

2017 Sep 25

What should a truncating store do?

(Not sure if this exactly maps to “truncating store”, but I think it at least touches some of the subjects discussed in this thread) Our out-of-tree-target need several patches to get things working correctly for us. We have introduced i24 and i40 types in ValueTypes/MachineValueTypes (in addition to the normal pow-of-2 types). And we have vectors of those (v2i40, v4i40). And the byte size in our

Intel AMX programming model discussion.

2020 Aug 14

Intel AMX programming model discussion.

Hi, Intel Advanced Matrix Extensions (Intel AMX) is a new programming paradigm consisting of two components: a set of 2-dimensional registers (tiles) representing sub-arrays from a larger 2-dimensional memory image, and accelerators able to operate on tiles. Capability of Intel AMX implementation is enumerated by palettes. Two palettes are supported: palette 0 represents the initialized state and

Intel AMX programming model discussion.

2020 Aug 14

Intel AMX programming model discussion.

[Yuanke] AMX register is special. It needs to be configured before use and the config instruction is expensive. To avoid unnecessary tile configure, we collect the tile shape information as much as possible and combine them into one ldtilecfg instruction. The ldtilecfg instruction should dominate any AMX instruction that access tile register. On the other side, the ldtilecfg should post-dominated

Intel AMX programming model discussion.

2020 Aug 18

Intel AMX programming model discussion.

The AMX registers are complicated. The single configuration register (which is mostly used implicitly, similar to MXCSR for floating point) controls the shape of all the tile registers, and if you change the tile configuration every single tile register is cleared. In practice, if we have to change the the configuration while any of the tile registers are live, performance is going to be terrible.

Troubleshooting underidentification issues in structural equation modelling (SEM)

2013 Feb 09

Troubleshooting underidentification issues in structural equation modelling (SEM)

Hi all, hope someone can help me out with this. Background Introduction I have a data set consisting of data collected from a questionnaire that I wish to validate. I have chosen to use confirmatory factor analysis to analyse this data set. Instrument The instrument consists of 11 subscales. There is a total of 68 items in the 11 subscales. Each item is scored on an integer scale between 1 to 4.

Intel AMX programming model discussion.

2020 Aug 19

Intel AMX programming model discussion.

Hi Hal, There is 3 aspect to be solved. 1. The HW support max shape 16x16, so there are many register classes from 1x1 to 16x16. We need 256 register classes. 2. We want to support variable shape, so compiler don't know what register class to fit tile shape as it is only known in runtime. 3. The tile configure is to configure physical tile register, so we need to allocate

cannot boot any domU - case #2

2005 Aug 09

cannot boot any domU - case #2

Hi, I noticed the earlier thread on this started by Scott Koranda (sorry - was reading off gmane so I can''t reply to thread) I have exactly the same symptoms: machine hangs as soon as I attempt to start a single domU, just after emitting "using config file..." At that point the machine appears completely wedged. The sequence of 3 ^A on the console no longer does

Intel AMX programming model discussion.

2020 Aug 19

Intel AMX programming model discussion.

There is no problem to have 256 register classes. Just a lot of register classes to me. We don't assume the shape of each physical register be 16x16, it is defined by user. For variable shape, I mean the shape is known in runtime and in compile time the shape is unknown. Take below code as an example, the %row and %col are variable instead of constant. Compiler recognizes llvm.x86.tileloadd64

Intel AMX programming model discussion.

2020 Aug 19

Intel AMX programming model discussion.

> When the tile shape is unknown at compile time, how do you plan to do the register allocation of the tiles? My question is: do you do the allocation for this case in the same way as you would if you knew the size was 16x16 (i.e., conservatively assume the largest size)? I think what will happen is that the registers are allocated based on a number of runtime values that are assumed to be

Intel AMX programming model discussion.

2020 Aug 19

Intel AMX programming model discussion.

The width and height can be runtime values that we would just copy into 64 byte configuration block we pass to ldtilecfg. So the code doesn't need to be multiversioned. The user code would also use those values to update pointers in the loops they write using the tiles. If we can't determine that two tiles were defined with the same width and height we need to assume the shape is different

[LLVMdev] JVM Backend

2009 Nov 29

[LLVMdev] JVM Backend

> So it will stack overflow on tail calls At the moment, yes. But then again, so does java. Also, it looks like they're working on support for tail calls in the Da Vinci Machine[1]. > and break with run-time errors When I said it raises an assertion, I meant at compile-time. > on structs? No, structs are supported. The only unsupported types at the moment (as far as I am aware) are

Intel AMX programming model discussion.

2020 Aug 20

Intel AMX programming model discussion.

On 8/20/20 2:47 PM, Topper, Craig wrote: > > I think I’m still missing something here. The configuration is per > tile. The multiply instructions take a MxK tile and multiply it by a > KxN tile and accumulate into an MxN tile. So the configuration needs > to know how many of each size of tile it needs to avoid a spill. > Wouldn’t the register allocator then need to know which

[LLVMdev] JVM Backend

2009 Nov 29

[LLVMdev] JVM Backend

On Sunday 29 November 2009 02:06:04 you wrote: > > So it will stack overflow on tail calls > > At the moment, yes. But then again, so does java. Sure but a lot of people like me are using LLVM precisely because it offers these wonderful features. As long as your JVM backend does not handle these features correctly its utility is greatly diminished. > Also, it looks like

[LLVMdev] JVM Backend

2009 Nov 25

[LLVMdev] JVM Backend

Hello, David First of all, thanks for the backend submission. I let Chris comment about the procedure of adding it to the tree. :) I just did a quick look into the code. The comments are below > Indirect function calls don't work yet, and there's probably some > minor bugs in it, but it works well for the test cases that I've run > through it. Could you please provide some

Intel AMX programming model discussion.

2020 Aug 21

Intel AMX programming model discussion.

Hi Hal, The proposal is attractive to me, but there is something I still can't figure out. Let's take below MIR as an example. We assume we have 256 register classes (vtile1x1, vtile1x2, ..., tile16x16). 1. After instruction selection, the pseudo AMX instruction is generated. The name of pseudo instructions have 'P' prefix. Now all the AMX pseudo instruction take vtile as

Changes to 'ADJCALLSTACK*' and 'callseq_*' between LLVM v4.0 and v5.0

2017 Sep 15

Changes to 'ADJCALLSTACK*' and 'callseq_*' between LLVM v4.0 and v5.0

Hi LLVM-Devs, I have managed to complete updating our sources from LLVM v4.0 to v5.0, but I am getting selection errors for 'callseq_end'. I am aware that the 'ADJCALLSTACKUP' and 'ADJCALLSTACKDOWN' patterns have changed, and have added an additional argument to the TD descriptions for these. There are interactions with 'ISD::CALL' and 'ISD::RET_FLAG',

Changes to 'ADJCALLSTACK*' and 'callseq_*' between LLVM v4.0 and v5.0

2017 Sep 15

Changes to 'ADJCALLSTACK*' and 'callseq_*' between LLVM v4.0 and v5.0

Hi Martin, Pseudo CALLSEQ_START was changed in r302527, commit message contains details on the changes. However CALLSEQ_END was not modified. If your made changes to ADJCALLSTACKUP to add additional argument, that may result in error. Thanks, --Serge 2017-09-15 19:09 GMT+07:00 Martin J. O'Riordan via llvm-dev < llvm-dev at lists.llvm.org>: > Hi LLVM-Devs, > > I have managed

[LLVMdev] JVM Backend

2009 Nov 29

[LLVMdev] JVM Backend

Hi David, > No, structs are supported. The only unsupported types at the moment > (as far as I am aware) are things like i31 and f80. for funky sized integers, the most important operations to support are loads and stores, shifts and logical operations (and, or, xor). These are the ones that the optimizers like to introduce most. The logical operations are straightforward. Loads and

hierarchical confirmatory factor analysis with sem package

2013 Apr 28

hierarchical confirmatory factor analysis with sem package

Hi, I am doing a hierarchical CFA using the sem package. I have 20 items, and I have 2 factors (F3 and F4), and also F1 and F2 are nested within F3. Here is the code that I have, but it is giving me an error message "Warning message: In eval(expr, envir, enclos) : Negative parameter variances. Model may be underidentified." and a further error "Error in summary.objectiveML(cfa,

similar to: What should a truncating store do?