Tal Galili
2010-May-05 11:19 UTC
[R] OLS Regression diagnostic measures check list - what to consider?
Hello dear R help list, I wish to compile a check-list for diagnostic measures for OLS regression. My question: Can you offer more (or newer) tests/measures for the validity of a linear model then what is given here: http://www.statmethods.net/stats/rdiagnostics.html This resource gives a list of measures to test for: OUTLIERS, INFLUENTIAL OBSERVATIONS, NON-NORMALITY, NON-CONSTANT ERROR VARIANCE, MULTI-COLLINEARITY, NONLINEARITY, NON-INDEPENDENCE OF ERRORS and some global validation. I came across it after searching online for ways to validate a regression model. Although this is a great list, I am wondering if there is any newer methods that are overlooked, or important consideration to take into account that are not described in that page. Thanks, Tal ----------------Contact Details:------------------------------------------------------- Contact me: Tal.Galili@gmail.com | 972-52-7275845 Read me: www.talgalili.com (Hebrew) | www.biostatistics.co.il (Hebrew) | www.r-statistics.com (English) ---------------------------------------------------------------------------------------------- [[alternative HTML version deleted]]
Greg Snow
2010-May-05 18:29 UTC
[R] OLS Regression diagnostic measures check list - what to consider?
First a note, while that is a nice list, I think it needs a disclaimer about
only running tests that answer a meaningful question for the data/problem being
studied. If all those tests are run on datasets, I would be most suspicious of
those datasets which passed all the tests. Also, failing some of those tests
does not mean that there is a problem with the regression model or its
inferences.
This leads to what I think needs to be included on such lists (or replace such
lists): The methods described in the paper:
Buja, A., Cook, D. Hofmann, H., Lawrence, M. Lee, E.-K., Swayne,
D.F and Wickham, H. (2009) Statistical Inference for exploratory
data analysis and model diagnostics Phil. Trans. R. Soc. A 2009
367, 4361-4383 doi: 10.1098/rsta.2009.0120
Which in short says to create several plots, one is the residual (or other) plot
from the real data, the rest are based on simulated data that fulfills all the
assumptions. If you cannot tell which plot is "real", then any
violations of the assumptions are not practically significant.
The vis.test function in the TeachingDemos package implements a version of this
test.
--
Gregory (Greg) L. Snow Ph.D.
Statistical Data Center
Intermountain Healthcare
greg.snow at imail.org
801.408.8111
> -----Original Message-----
> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-
> project.org] On Behalf Of Tal Galili
> Sent: Wednesday, May 05, 2010 5:20 AM
> To: r-help at r-project.org
> Subject: [R] OLS Regression diagnostic measures check list - what to
> consider?
>
> Hello dear R help list,
>
> I wish to compile a check-list for diagnostic measures for OLS
> regression.
>
> My question:
> Can you offer more (or newer) tests/measures for the validity of a
> linear
> model then what is given here:
> http://www.statmethods.net/stats/rdiagnostics.html
>
> This resource gives a list of measures to test for:
> OUTLIERS, INFLUENTIAL OBSERVATIONS, NON-NORMALITY, NON-CONSTANT ERROR
> VARIANCE, MULTI-COLLINEARITY, NONLINEARITY, NON-INDEPENDENCE OF ERRORS
> and
> some global validation.
>
> I came across it after searching online for ways to validate a
> regression
> model.
> Although this is a great list, I am wondering if there is any newer
> methods
> that are overlooked, or important consideration to take into account
> that
> are not described in that page.
>
>
> Thanks,
> Tal
>
>
>
> ----------------Contact
> Details:-------------------------------------------------------
> Contact me: Tal.Galili at gmail.com | 972-52-7275845
> Read me: www.talgalili.com (Hebrew) | www.biostatistics.co.il (Hebrew)
> |
> www.r-statistics.com (English)
> -----------------------------------------------------------------------
> -----------------------
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-
> guide.html
> and provide commented, minimal, self-contained, reproducible code.
Liviu Andronic
2010-May-05 19:55 UTC
[R] OLS Regression diagnostic measures check list - what to consider?
On 5/5/10, Tal Galili <tal.galili at gmail.com> wrote:> Although this is a great list, I am wondering if there is any newer methods > that are overlooked, or important consideration to take into account that > are not described in that page. >Two on-line resources would be "REGRESSION DIAGNOSTICS" by John Fox [1] and ?Practical Regression and Anova using R? by Julian Faraway [2]. Regards Liviu [1] http://socserv.socsci.mcmaster.ca/jfox/Courses/Brazil-2009/index.html [2] http://cran.r-project.org/doc/contrib/Faraway-PRA.pdf