I am just about working through Faraways excellent tutorial "practical regression and ANOVA using R" on page 24 he makes the x matrix: x <- cbind(1,gala[,-c(1,2)]) how can I understand this gala[,-c(1,2)])... I couldn't find an explanation of such "c-like" abbreviations anywhere. thanks for a hint. another problem: I couldn't load the faraway library, using the library() command, even though I specified the path, .. do I need to put library files into a certain directory? I always got an error: Error in library(faraway) : There is no package called `faraway' Thanks a lot christoph -- Christoph Lehmann Department of Psychiatric Neurophysiology University Hospital of Clinical Psychiatry Waldau CH-3000 Bern 60 Switzerland Phone: ++41 31 930 93 83 Mobile: ++41 31 570 28 00 Fax: ++41 31 930 99 61 Email: lehmann at puk.unibe.ch Web: http://www.puk.unibe.ch/cl/pn_ni_cv_cl.html
ripley@stats.ox.ac.uk
2003-Feb-22  18:28 UTC
[R] faraway tutorial: cryptic command to newbie
On 22 Feb 2003, Christoph Lehmann wrote:> I am just about working through Faraways excellent tutorial "practical > regression and ANOVA using R"I assume this is a reference to the PDF version available via CRAN. I am afraid that is *not* a good discussion of how to do regression, especially not using R. That page is seriously misleading: good ways to compute regressions are QR decompositions with pivoting (which R uses) or an SVD. Solving the normal equations is well known to square the condition number, and is close to the worse possible way. (If you must use normal equations, do at least centre the columns, and preferably do some scaling.)> on page 24 he makes the x matrix: > x <- cbind(1,gala[,-c(1,2)]) > > how can I understand this gala[,-c(1,2)])... I couldn't find an > explanation of such "c-like" abbreviations anywhere.Well, it is in all good books (as they say) including `An Introduction to R'. (It's even on page 210 of that book!) -c(1,2) is (try it)> -c(1,2)[1] -1 -2 so this drops columns 1 and 2. It then adds in front a column made up of ones, which is usually a sign of someone not really understanding how R's linear models work.> another problem: I couldn't load the faraway library, using the > library() command, even though I specified the path, .. do I need to put > library files into a certain directory? I always got an error: Error in > library(faraway) : There is no package called `faraway'Did you *install* the *package*? Is it a valid R package which has passed R CMD check? -- Brian D. Ripley, ripley at stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UK Fax: +44 1865 272595
Adaikalavan Ramasamy
2003-Feb-24  06:31 UTC
[R] faraway tutorial: cryptic command to newbie
c = concatenate / combine Try help(c) or under the 'Vector and Assignments' section of An Introduction to R. You have to put the folder 'faraway' into c:/.../R/rw1061/library/ where ... is your path to R and rw1061 could be rw1062 ect depending on your version. -----Original Message----- From: Christoph Lehmann [mailto:lehmann at puk.unibe.ch] Sent: Sunday, February 23, 2003 12:49 AM To: r-help at stat.math.ethz.ch Subject: [R] faraway tutorial: cryptic command to newbie I am just about working through Faraways excellent tutorial "practical regression and ANOVA using R" on page 24 he makes the x matrix: x <- cbind(1,gala[,-c(1,2)]) how can I understand this gala[,-c(1,2)])... I couldn't find an explanation of such "c-like" abbreviations anywhere. thanks for a hint. another problem: I couldn't load the faraway library, using the library() command, even though I specified the path, .. do I need to put library files into a certain directory? I always got an error: Error in library(faraway) : There is no package called `faraway' Thanks a lot christoph -- Christoph Lehmann Department of Psychiatric Neurophysiology University Hospital of Clinical Psychiatry Waldau CH-3000 Bern 60 Switzerland Phone: ++41 31 930 93 83 Mobile: ++41 31 570 28 00 Fax: ++41 31 930 99 61 Email: lehmann at puk.unibe.ch Web: http://www.puk.unibe.ch/cl/pn_ni_cv_cl.html ______________________________________________ R-help at stat.math.ethz.ch mailing list http://www.stat.math.ethz.ch/mailman/listinfo/r-help
That's an unfair criticism. That discussion was never intended as a recommendation for how to compute a regression. Of course, SVD or QR decompositions are the preferred method but many newbies don't want to digest all that right from the start. These are just obscure details to the beginner. One of the strengths of R in teaching is that students can directly implement the formulae from the theory. This reinforces the connection between theory and practice. Implementing the normal equations directly is a quick early illustration of this connection. Explaining the precise details of how to fit a regression model is something that can be deferred. Julian Faraway>> I am just about working through Faraways excellent tutorial "practical >> regression and ANOVA using R" > >I assume this is a reference to the PDF version available via CRAN. I am >afraid that is *not* a good discussion of how to do regression,especially>not using R. That page is seriously misleading: good ways to compute >regressions are QR decompositions with pivoting (which R uses) or an SVD. >Solving the normal equations is well known to square the conditionnumber,>and is close to the worse possible way. (If you must use normal >equations, do at least centre the columns, and preferably do some >scaling.) > >> on page 24 he makes the x matrix: >> x <- cbind(1,gala[,-c(1,2)]) >> >> how can I understand this gala[,-c(1,2)])... I couldn't find an >> explanation of such "c-like" abbreviations anywhere. > >Well, it is in all good books (as they say) including `An Introduction to >R'. (It's even on page 210 of that book!) > >-c(1,2) is (try it) > >> -c(1,2) >[1] -1 -2 > >so this drops columns 1 and 2. It then adds in front a column made up of >ones, which is usually a sign of someone not really understanding how >R's linear models work. >
I'm no expert in these matters, but I'll toss in my $0.02 anyway. My recollection from reading Golub & Van Loan a few years ago is that there's quite a bit of controversy as to the "best" approach to least squares. Just recently I've read Monahan's "Numerical Methods in Statistics", which has three relevant chapters (including one titled "Regression Computations"). In it, several approaches were presented: QR-Householder, QR-Givens, SVD, MCD, sweep, etc. The conclusion drawn was that no single method is the best for all problems, and the task of writing a regression routine is best avoided unless the workhorse routines in stat packages are not satisfactory (in terms of speed/storage requirement/etc.). My impression is that, with the glaring exception of SAS (which uses sweep, if I'm not mistaken), most stat packages use QR, as a good compromise between stability and speed. Andy> -----Original Message----- > From: Chong Gu [mailto:chong at stat.purdue.edu] > Sent: Monday, February 24, 2003 1:51 PM > To: Julian Faraway > Cc: r-help at stat.math.ethz.ch > Subject: Re: [R] faraway tutorial: cryptic command to newbie > > > > Not only it's unfair criticism, it's probably also imprecise > information. > > For a detailed discussion of the precisions of regression estimates > through QR-decomposition and normal equations, one may consult Golub > and Van Loan's book on Matrix Computation (1989, Section 5.3.9 on page > 230). QR takes twice as much computation, requires more memory, but > does NOT necessarily provide better precision. > > The above said, I am not questioning the adequacy of the QR approach > to regression calculation as implemented in R. > > > > > That's an unfair criticism. That discussion was never intended as > > a recommendation for how to compute a regression. Of course, SVD or > > QR decompositions are the preferred method but many newbies > don't want to > > digest all that right from the start. These are just > obscure details to > > the beginner. > > > > One of the strengths of R in teaching is that students can directly > > implement the formulae from the theory. This reinforces the > connection > > between theory and practice. Implementing the normal > equations directly > > is a quick early illustration of this connection. > Explaining the precise > > details of how to fit a regression model is something that can be > > deferred. > > > > Julian Faraway > > > > >> I am just about working through Faraways excellent > tutorial "practical > > >> regression and ANOVA using R" > > > > > >I assume this is a reference to the PDF version available > via CRAN. I am > > >afraid that is *not* a good discussion of how to do regression, > > especially > > >not using R. That page is seriously misleading: good ways > to compute > > >regressions are QR decompositions with pivoting (which R > uses) or an SVD. > > >Solving the normal equations is well known to square the condition > > number, > > >and is close to the worse possible way. (If you must use normal > > >equations, do at least centre the columns, and preferably do some > > >scaling.) > > > > > >> on page 24 he makes the x matrix: > > >> x <- cbind(1,gala[,-c(1,2)]) > > >> > > >> how can I understand this gala[,-c(1,2)])... I couldn't find an > > >> explanation of such "c-like" abbreviations anywhere. > > > > > >Well, it is in all good books (as they say) including `An > Introduction to > > >R'. (It's even on page 210 of that book!) > > > > > >-c(1,2) is (try it) > > > > > >> -c(1,2) > > >[1] -1 -2 > > > > > >so this drops columns 1 and 2. It then adds in front a > column made up of > > >ones, which is usually a sign of someone not really > understanding how > > >R's linear models work. > > > > > > > ______________________________________________ > > R-help at stat.math.ethz.ch mailing list > > http://www.stat.math.ethz.ch/mailman/listinfo/r-help > > ______________________________________________ > R-help at stat.math.ethz.ch mailing list > http://www.stat.math.ethz.ch/mailman/listinfo/r-help >------------------------------------------------------------------------------