Displaying 20 results from an estimated 1000 matches similar to: "How to avoid register spills at wide integer addition?"
2012 Apr 05
0
[LLVMdev] Suboptimal code due to excessive spilling
I don't know much about this, but maybe -mllvm -unroll-count=1 can be used as a workaround?
/Patrik Hägglund
-----Original Message-----
From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] On Behalf Of Brent Walker
Sent: den 28 mars 2012 03:18
To: llvmdev
Subject: [LLVMdev] Suboptimal code due to excessive spilling
Hi,
I have run into the following strange behavior
2012 Mar 28
2
[LLVMdev] Suboptimal code due to excessive spilling
Hi,
I have run into the following strange behavior and wanted to ask for
some advice. For the C program below, function sum() gets inlined in
foo() but the code generated looks very suboptimal (the code is an
extract from a larger program).
Below I show the 32-bit x86 assembly as produced by the demo page on
the llvm home page ("Output A"). As you can see from the assembly,
after
2010 Dec 15
1
pmnorm: probabilites don't sum up to 1
Dear list member,
I struggle with the problem, why the probabilities of choosing one of
three mutually exclusive alternatives don?t sum up to 1!
Let?s assume we have three alternatives X, Y, and Z. Let?s further
assume we know their respective utilities:
uX, uY, uZ. I?m interested in calculating the probability of choosing
X, Y, and Z.
Since I assume that the alternatives are mutually
2006 Aug 24
0
ca.po Pz test question
Hello,
I have a few questions about & Ouliaris Unit Root Test of type Pz
1) I noticed that critical values given in the article "Asymptotic properties of residual..." by Phillips and Ouliaris are different from those given by R. More presicely - they are swaped with each other. Could explain why?
2) that question is quite stupid. Can you explain what coefficients for test Pz mean?
2015 Feb 02
3
[LLVMdev] LLVM IR i128
Hi everyone!
Here, I have a question and am curious about i128. I want to know how the LLVM handle i128, because many compiler backend doesn't support i128 directly. So I am very curious and want to how the llvm handle this situation? Besides i128, such as i256, i512, even i24? Thanks.
Best Regards
Wu Zhao
-------------- next part --------------
An HTML attachment was scrubbed...
2014 Sep 04
2
[LLVMdev] How to deal with wider Integer type?
Hi,
I am currently working on an opencl project based on LLVM, the target
device is 32bit.
I met a problem that some llvm passes like GVN SROA will generate some IR
operating
on wide integer types like i128 or i512. But the device does not support
such kind of data type.
Is there any idea on how to lower this kind of IR to only operate on i32 or
vector of i32? Or is there any existing code handle
2016 Mar 24
0
summary( prcomp(*, tol = .) ) -- and 'rank.'
Martin, I fully agree. This becomes an issue when you have big matrices.
(Note that there are awesome methods for actually only computing a small
number of PCs (unlike your code which uses svn which gets all of them);
these are available in various CRAN packages).
Best,
Kasper
On Thu, Mar 24, 2016 at 1:09 PM, Martin Maechler <maechler at stat.math.ethz.ch
> wrote:
> Following from
2008 Mar 26
2
[LLVMdev] Checked arithmetic
Hi Chris,
> Why not define an "add with overflow" intrinsic that returns its value and
> overflow bit as an i1?
what's the point? We have this today with apint codegen (if you turn on
LegalizeTypes). For example, this function
define i1 @cc(i32 %x, i32 %y) {
%xx = zext i32 %x to i33
%yy = zext i32 %y to i33
%s = add i33 %xx, %yy
%tmp = lshr i33 %s, 32
%b = trunc
2012 Aug 11
2
IPv6 on Centos 6
We've been running ipv6 for a year or so now, but some of our newer
instances (all on an ESX cluster) are not working. It looks like it's
all of our Centos 6 instances. I'm hoping someone can point me in the
right direction...
tshark indicates that it's neighbor discovery that's failing:
<centos666.peak.org> [26] # cat ../network
NETWORKING=yes
2005 Oct 13
0
BIND help please
I am setting up a test domain to learn bind, and how it,s working.
The master domain is
bbb.ttt.domene 10.11.12.13
the child domain
ftp.bbb.ttt.domene 10.11.55.77
The thing I don't understand is, why don't the query report additional
info (ip address)?
Her is two queries(the first the one that troubles me), cut of
named.conf, test of zone file, the zone file and cut from log file.
2017 Mar 21
3
"isolinux.bin missing or corrupt" when booting USB flash drive in old PC
Thomas Schmitt wrote:
> My main point of interest is which block is really loaded as first one
> of isolinux.bin.
[Program snipped.]
> (It could be shorter if i did not insist in big endian words.)
Why insisting?
Anyway appended is a dumper version. I could put up a binery version
if needed. Let me know if so.
--
MartinS
/*
2016 Mar 24
3
summary( prcomp(*, tol = .) ) -- and 'rank.'
Following from the R-help thread of March 22 on "Memory usage in prcomp",
I've started looking into adding an optional 'rank.' argument
to prcomp allowing to more efficiently get only a few PCs
instead of the full p PCs, say when p = 1000 and you know you
only want 5 PCs.
(https://stat.ethz.ch/pipermail/r-help/2016-March/437228.html
As it was mentioned, we already
2016 Mar 25
0
summary( prcomp(*, tol = .) ) -- and 'rank.'
As I see it, the display showing the first p << n PCs adding up to 100% of the variance is plainly wrong.
I suspect it comes about via a mental short-circuit: If we try to control p using a tolerance, then that amounts to saying that the remaining PCs are effectively zero-variance, but that is (usually) not the intention at all.
The common case is that the remainder terms have a roughly
2016 Mar 25
0
summary( prcomp(*, tol = .) ) -- and 'rank.'
> On 25 Mar 2016, at 10:08 , Jari Oksanen <jari.oksanen at oulu.fi> wrote:
>
>>
>> On 25 Mar 2016, at 10:41 am, peter dalgaard <pdalgd at gmail.com> wrote:
>>
>> As I see it, the display showing the first p << n PCs adding up to 100% of the variance is plainly wrong.
>>
>> I suspect it comes about via a mental short-circuit: If we
2016 Nov 09
10
Is the correct behavior of getelementptr i192* for opt + llc -march=aarch64?
Hi all,
opt and opt + llc generate the difference aarch64 asm code for the following LLVM code.
Is it intended behavior?
I expected (A) because I cast %p from i192* to i64*.
The information is dropped by opt and 8-byte padding is inserted or I write a bad code?
% cat a.ll
define void @store0_to_p4(i192* %p)
{
%p1 = bitcast i192* %p to i64*
%p2 = getelementptr i64, i64* %p1, i64 3
%p3 =
2016 Mar 24
3
summary( prcomp(*, tol = .) ) -- and 'rank.'
I agree with Kasper, this is a 'big' issue. Does your method of taking only
n PCs reduce the load on memory?
The new addition to the summary looks like a good idea, but Proportion of
Variance as you describe it may be confusing to new users. Am I correct in
saying Proportion of variance describes the amount of variance with respect
to the number of components the user chooses to show? So
2007 Apr 18
0
[patch 9/9] Guest page hinting: full s390 support.
From: Martin Schwidefsky <schwidefsky@de.ibm.com>
From: Hubertus Franke <frankeh@watson.ibm.com>
From: Himanshu Raj <rhim@cc.gatech.edu>
[patch 9/9] Guest page hinting: full s390 support.
s390 uses the milli-coded ESSA instruction to set the page state. The
page state is formed by four guest page states called block usage states
and three host page states called block content
2007 Apr 18
0
[patch 9/9] Guest page hinting: full s390 support.
From: Martin Schwidefsky <schwidefsky@de.ibm.com>
From: Hubertus Franke <frankeh@watson.ibm.com>
From: Himanshu Raj <rhim@cc.gatech.edu>
[patch 9/9] Guest page hinting: full s390 support.
s390 uses the milli-coded ESSA instruction to set the page state. The
page state is formed by four guest page states called block usage states
and three host page states called block content
2008 Mar 26
0
[LLVMdev] Checked arithmetic
On Wed, 26 Mar 2008, Duncan Sands wrote:
> Hi Chris,
>
>> Why not define an "add with overflow" intrinsic that returns its value and
>> overflow bit as an i1?
>
> what's the point? We have this today with apint codegen (if you turn on
> LegalizeTypes). For example, this function
The desired code is something like:
foo:
addl %eax, %ecx
jo
2016 Mar 25
2
summary( prcomp(*, tol = .) ) -- and 'rank.'
> On 25 Mar 2016, at 10:41 am, peter dalgaard <pdalgd at gmail.com> wrote:
>
> As I see it, the display showing the first p << n PCs adding up to 100% of the variance is plainly wrong.
>
> I suspect it comes about via a mental short-circuit: If we try to control p using a tolerance, then that amounts to saying that the remaining PCs are effectively zero-variance, but