similar to: Collinearity in Linear Multiple Regression

Displaying 20 results from an estimated 7000 matches similar to: "Collinearity in Linear Multiple Regression"

2003 Jul 23
6
Condition indexes and variance inflation factors
Has anyone programmed condition indexes in R? I know that there is a function for variance inflation factors available in the car package; however, Belsley (1991) Conditioning Diagnostics (Wiley) notes that there are several weaknesses of VIFs: e.g. 1) High VIFs are sufficient but not necessary conditions for collinearity 2) VIFs don't diagnose the number of collinearities and 3) No one has
2002 Nov 05
2
Which columns give rise to linear dependency?
Short version If I have a data frame X and I suspect that there is a dependency between the columns how do I confirm that, and how do I tell which subset of columns is involved? ================================== Long version A colleague had been trying to use the SPSS RELIABILITY procedure. It told her that the determinant of the matrix was small. She asked me what that meant and I told her
2012 Apr 03
1
how to use condition indexes to test multi-collinearity
Dear Users, I try to calculate condition indexes and variance decomposition proportions in order to test for collinearity using colldiag() in perturb package, I got a large index and two variables with large variance decomposition proportions,but one of them is constant item.I also checked the VIF for that variable, the value is small.The result is as follows: Index intercept V1
2005 Apr 11
2
dealing with multicollinearity
I have a linear model y~x1+x2 of some data where the coefficient for x1 is higher than I would have expected from theory (0.7 vs 0.88) I wondered whether this would be an artifact due to x1 and x2 being correlated despite that the variance inflation factor is not too high (1.065): I used perturbation analysis to evaluate collinearity library(perturb)
2010 Mar 31
2
interpretation of p values for highly correlated logistic analysis
Dear list, I want to perform a logistic regression analysis with multiple categorical predictors (i.e., a logit) on some data where there is a very definite relationship between one predicator and the response/independent variable. The problem I have is that in such a case the p value goes very high (while I as a naive newbie would expect it to crash towards 0). I'll illustrate my problem
2008 Nov 20
1
Checking collinearity using lmer
I am running a logistic regression model with a random effect using lmer. I am uncertain how to check for collinearity between my parameters. I have already run cor() and linear regression for each combination of parameters, and all Rsqr values were <0.8….but I am analyzing ecological data so a 0.8 cutoff may be unrealistic. -is there a way to check variance inflation factors or tolerance
2005 Sep 13
3
Collineariy Diagnostics
Hi, and thanks for your help in order to do collinearity analysis I downloaded the perturb package. I run a lm (regression) and on that the ??calldiag?? commad to get condition numbers but i get the following message: the variable XY with modus ??numeric?? was not found (it does the same with all predictors despite all variables are numeric and exists). Can anyone tell me how can I go arround
2013 Feb 06
1
how to extract test for collinearity and constantcy used in lda
Hi everyone, I'm trying to vectorize an application of lda to each 2D slice of a 3D array, but am running into trouble: It seems there are quite a few 2D slices that trigger either the "variables are collinear" warning, or worse, trigger a "variable appears to be constant within groups" error and fails (i.e., ceases computation rather than skips bad slice). There are
2002 Jul 15
2
meaning of error message about collinearity
You are using a method that needs to estimate the covariance matrix of all the variables. If you have 80 variables, there are (80+1)*80/2 = 3240 variances and covariances to estimate. How many data points do you think you need to do that? Some people assume the covariance matrix is diagonal (i.e., assuming all the variables are uncorrelated). Even then you still have 80 variances to estimate.
2005 Jun 30
2
Finding out collinearity in regression
Hi, I am trying to find out a collinearity in explanatory variables with alias(). I creat a dataframe: dat <- ds[,sapply(ds,nlevels)>=2] dat$Y <- Response Explanatory variables are factor and response is continuous random variable. When I run a regression, I have the following error: fit <- aov( Y ~ . , data = dat) Error in "contrasts<-"(`*tmp*`, value =
2004 Jun 11
4
Regression query
Hi I have a set of data with both quantitative and categorical predictors. After scaling of response variable, i looked for multicollinearity (VIF values) among the predictors and removed the predictors who were hinding some of the other significant predictors. I'm curious to know whether the predictors (who are not significant) while doing simple 'lm' will be involved in
2009 Aug 16
1
How to deal with multicollinearity in mixed models (with lmer)?
Dear R users, I have a problem with multicollinearity in mixed models and I am using lmer in package lme4. From previous mailing list, I learn of a reply "http://www.mail-archive.com/r-help at stat.math.ethz.ch/msg38537.html" which states that if not for interpretation but just for prediction, multicollinearity does not matter much. However, I am using mixed model to interpret something,
2006 Jul 05
2
Colinearity Function in R
Is there a colinearty function implemented in R? I have tried help.search("colinearity") and help.search("collinearity") and have searched for "colinearity" and "collinearity" on http://www.rpad.org/Rpad/Rpad-refcard.pdf but with no success. Many thanks in advance, Peter Lauren.
2012 Jun 01
4
regsubsets (Leaps)
Hi i need to create a model from 250 + variables with high collinearity, and only 17 data points (p = 250, n = 750). I would prefer to use Cp, AIC, and/or BIC to narrow down the number of variables, and then use VIF to choose a model without collinearity (if possible). I realize that having a huge p and small n is going to give me extreme linear dependency problems, but I *think* these model
2010 Dec 03
2
difference between linear model & scatterplot matrix
Dear R-users, I'm studing a DB, structured like this (just a little part of my dataset): _____________________________________________________________________________________________________________ Site Latitude Longitude Year Tot-Prod Total_Density dmp Dendoudi-1 15.441964 -13.540179 2005 3271.16 1007 16993.25 Dendoudi-2 15.397321 -13.611607
2005 Mar 31
0
perturb package for evaluating collinearity
I've uploaded the R package "perturb" to CRAN. Perturb contains two programs for evaluating collinearity. "Colldiag" calculates condition indexes and variance decomposition proportions to detect and track down collinear sets of variables. "Perturb" takes a different approach. It re-estimates the model a specified number of times, adding random noise
2005 Mar 31
0
perturb package for evaluating collinearity
I've uploaded the R package "perturb" to CRAN. Perturb contains two programs for evaluating collinearity. "Colldiag" calculates condition indexes and variance decomposition proportions to detect and track down collinear sets of variables. "Perturb" takes a different approach. It re-estimates the model a specified number of times, adding random noise
2008 Oct 19
2
definition of "dffits"
R-users E-mail: r-help@r-project.org Hi! R-users. I am just wondering what the definition of "dffits" in R language is. Let me show you an simple example. function() { library(MASS) xx <- c(1,2,3,4,5) yy <- c(1,3,4,2,4) data1 <- data.frame(x=xx, y=yy) lm.out <- lm(y~., data=data1, x=T) lev1 <- lm.influence(lm.out)$hat sig1 <-
2013 Feb 05
3
Non linear programming: choose R that minimizes corr(y, x^R)
I am looking for a package that will allow me to choose R (a real number) that minimizes the correlation of y and x^R, i.e. find R such that corr(y,x^R) is minimized. Any suggestions for packages I might look at would be helpful. Thanks, John John David Sorkin M.D., Ph.D. Chief, Biostatistics and Informatics University of Maryland School of Medicine Division of Gerontology Baltimore VA
2003 Jun 30
1
Novice Questions
I'm writing a program to perform linear regressions to estimate the number of bank teller transactions per hour of various types based upon day of week, time of day, week of month and several prices. I've got about 25,000 records in my dataset, 85 columns of transaction counts (used 1 at a time), about 50 columns of binary indicators (day, week, pay period, hour, branch), and a half dozen