thr3ads.net - similar to: "lean and mean lm/glm?"

Displaying 20 results from an estimated 20000 matches similar to: "lean and mean lm/glm?"

2012 Apr 26

How does .Fortran "dqrls" work?

Hi, all. I want to write some functions like glm() so i studied it. In glm.fit(), it calls a fortran subroutine named "dqrfit" to compute least squares solutions to the system x * b = y To learn how "dqrfit" works, I just follow how glm() calls "dqrfit" by my own example, my codes are given below: > qr <- >

Help: Where can I find the code for 'C_Cdqrls'?

2013 Apr 23

Help: Where can I find the code for 'C_Cdqrls'?

Dear all, I’m not sure if it is O.K. to ask this question here. But where can I find the code for the function ‘C_Cdqrls’ which is called by the R function ‘lsfit‘. Thank you all. Sorry for being naïve if so. -------------------- Ziqiang Zhao 2013-04-23 [[alternative HTML version deleted]]

R-users

2007 Dec 18

R-users

R-users E-mail: r-help@r-project.org I have a quenstion on "gam()" in "gam" package. The help of gam() says: 'gam' uses the _backfitting algorithm_ to combine different smoothing or fitting methods. On the other hand, lm.wfit(), which is a routine of gam.fit() contains: z <- .Fortran("dqrls", qr = x * wts, n = n, p = p, y = y *

Bug in glm.fit? (PR#1331)

2002 Feb 27

Bug in glm.fit? (PR#1331)

G'day all, I had a look at the GLM code of R (1.4.1) and I believe that there are problems with the function "glm.fit" that may bite in rare circumstances. Note, I have no data set with which I ran into trouble. This report is solely based on having a look at the code. Below I append a listing of the glm.fit function as produced by my system. I have added line numbers so that I

"gam()" in "gam" package

2007 Dec 18

"gam()" in "gam" package

lm.fit peak memory usage

2003 Jan 03

lm.fit peak memory usage

Hi, I've been running out of memory while using the lm.fit function - have solved the problem and thought there might be interest in incorporating some of the changes. Looked at the source and changed the following lines storage.mode(x) <- "double" storage.mode(y) <- "double" z <- .Fortran("dqrls", qr = x, n = n, p = p, y = y, ny = ny,

bigglm() results different from glm()

2009 Jul 03

bigglm() results different from glm()

Hi Sir, Thanks for making package available to us. I am facing few problems if you can give some hints: Problem-1: The model summary and residual deviance matched (in the mail below) but I didn't understand why AIC is still different. > AIC(m1) [1] 532965 > AIC(m1big_longer) [1] 101442.9 Problem-2: chunksize argument is there in bigglm but not in biglm, consequently,

how to use large data set ?

2006 Jul 19

how to use large data set ?

Un texte encapsul? et encod? dans un jeu de caract?res inconnu a ?t? nettoy?... Nom : non disponible Url : https://stat.ethz.ch/pipermail/r-help/attachments/20060719/a0844373/attachment.pl

bigglm() results different from glm()

2009 Mar 17

bigglm() results different from glm()

Dear all, I am using the bigglm package to fit a few GLM's to a large dataset (3 million rows, 6 columns). While trying to fit a Poisson GLM I noticed that the coefficient estimates were very different from what I obtained when estimating the model on a smaller dataset using glm(), I wrote a very basic toy example to compare the results of bigglm() against a glm() call. Consider the

How to Extract Information from SIMEX Output

2011 Apr 19

How to Extract Information from SIMEX Output

Below is a SIMEX object that was generated with the "simex" function from the "simex" package applied to a logistic regression fit. From this mountain of information I would like to extract all of the values summarized in this line: .. ..$ variance.jackknife: num [1:5, 1:4] 1.684 1.144 0.85 0.624 0.519 ... Can someone suggest how to go about doing this? I can extract the

big data

2010 Sep 08

big data

Hello, I searched the internet but i didn't find the answer for the next problem: I want to do a glm on a csv file consisting of 25 columns and 4 mln rows. Not all the columns are relevant. My problem is to read the data into R. Manipulate the data and then do a glm. I've tried with: dd<-scan("myfile.csv",colClasses=classes) dat<-as.data.frame(dd) My question is: what

Comparison: glm() vs. bigglm()

2007 Jun 29

Comparison: glm() vs. bigglm()

Hi, Until now, I thought that the results of glm() and bigglm() would coincide. Probably a naive assumption? Anyways, I've been using bigglm() on some datasets I have available. One of the sets has >15M observations. I have 3 continuous predictors (A, B, C) and a binary outcome (Y). And tried the following: m1 <- bigglm(Y~A+B+C, family=binomial(), data=dataset1, chunksize=10e6)

ff usage for glm

2012 Mar 30

ff usage for glm

Greetings useRs, Can anyone provide an example how to use ff to feed a very large data frame to glm? The data.frame cannot be loaded in R using conventional read.csv as it is too big. glm(...,data=ff.file) ?? Thank you Stephen B

Question on the idiom: start <- coef; start[fit$pivot] <- coef

1999 Apr 30

Question on the idiom: start <- coef; start[fit$pivot] <- coef

I wonder if someone could explain how the following R idiom works (it's used in glm.fit). start <- coef start[fit$pivot] <- coef coef is a vector of coefficients, set by .Fortran("dqrls", ...). fit$pivot is a vector of integer indexes (indicating how dqrls permuted the columns of x). If coef has n elements, fit$pivot is a permutation of seq(1,5). start[fit$pivot]

NaN when using dffits, stemming from lm.influence call

2006 Aug 31

NaN when using dffits, stemming from lm.influence call

Hi all I'm getting a NaN returned on using dffits, as explained below. To me, there seems no obvious (or non-obvious reason for that matter) reason why a NaN appears. Before I start digging further, can anyone see why dffits might be failing? Is there a problem with the data? Consider: # Load data dep <-

R-1.7.0 build feedback: NetBSD 1.6 (PR#2837)

2003 Apr 24

R-1.7.0 build feedback: NetBSD 1.6 (PR#2837)

R-1.7.0 built on NetBSD 1.6, but the validation test suite failed: Machinetype: Intel Pentium III (600 MHz); NetBSD 1.6 (GENERIC) Remote gcc version: gcc (GCC) 3.2.2 Remote g++ version: g++ (GCC) 3.2.2 Configure environment: CC=gcc CXX=g++ LDFLAGS=-Wl,-rpath,/usr/local/lib make[5]: Entering directory `/local/build/R-1.7.0/src/library' >>> Building/Updating

Problem in matrix definition?

2009 Aug 31

Problem in matrix definition?

I'm implementing a function to compute the moore-penrose inverse, using a code from the article: Fast Computation of Moore-Penrose Inverse Matrices. Neural Information Processing - Letters and Reviews. Vol.8, No.2, August 2005 However, the R presents an error message when I use the geninv. The odd thing is that the error occurs for some arrays, however they have the same size. And the R

Understanding svd usage and its necessity in generalized inverse calculation

2012 Dec 05

Understanding svd usage and its necessity in generalized inverse calculation

Dear R-devel: I could use some advice about matrix calculations and steps that might make for faster computation of generalized inverses. It appears in some projects there is a bottleneck at the use of svd in calculation of generalized inverses. Here's some Rprof output I need to understand. > summaryRprof("Amelia.out") $by.self self.time self.pct

standard errors for glm

2008 Aug 27

standard errors for glm

Hi, I'm currently using biglm package to compute GLM outputs on a very large dataset. But no function computes standard erros of predictions. I look in what is done in R, namely in the function predict.glm.R in stats package. In this function, we call predict.lm to compute the standard errors (line 51). The code of predict.lm (in lm.R) is very hard to understand. I wonder if there is any

glm-poisson fitting 400.000 records

2011 Oct 21

glm-poisson fitting 400.000 records

Hi, I am trying to fi a glm-poisson model to 400.000 records. I have tried biglm and glmulti but i have problems... can it really be the case that 400.000 are too many records??? I am thinking of using random samples of my dataset..... Many thanks, -- View this message in context: http://r.789695.n4.nabble.com/glm-poisson-fitting-400-000-records-tp3925100p3925100.html Sent from the R help

similar to: lean and mean lm/glm?