Displaying 20 results from an estimated 20000 matches similar to: "aggregating variables with pca"
2006 Jan 25
1
combining variables with PCA
hello R_team
having perfomed a PCA on my fitted model with the function:
data<- na.omit(dataset)
data.pca<-prcomp(data,scale =TRUE),
I´ve decided to aggregate two variables that are highly correlated.
My first question is:
How can I combine the two variables into one new predictor?
and secondly:
How can I predict with the newly created variable in a new dataset?
Guess I need the
2011 Jul 29
1
Limited number of principal components in PCA
Hi all,
I am attempting to run PCA on a matrix (nrow=66, ncol=84) using 'prcomp'
(stats package). My data (referred to as 'Q' in the code below) are
separate river streamflow gaging stations (columns) and peak instantaneous
discharge (rows). I am attempting to use PCA to identify regions of that
vary together.
I am entering the following command:
2009 Nov 04
2
PCA with tow response variables
Hi all,
I'm new to PCA in R, so this might be a basical thing, but I cannot find anything on the net about it.
I need to make a PCA plot with two response variables (df$resp1 and df$resp2) against eight metabolites (df$met1, df$met2, ...) and I don't have a clue how to do... and I've only used the simplest PCAs before, like this:
pcaObj=prcomp(t(df[idx, c(40:47)]))
2008 Jan 04
1
PCA error: svd(x, nu=0) infinite or missing values
Hi,
I am trying to do a PCA on my data but I keep getting the error message
svd(x, nu=0) infinite or missing values
>From the messages posted on the subject, I understand that the NAs in my
data might be the problem, but I thought na.omit would take care of that.
Less than 5% of my cells are missing data. However, the NAs are not
regularly distributed across my matrix: certain cases and
2011 Sep 28
0
PCA: prcomp rotations
Hi all,
I think I may be confused by different people/programs using the word
rotation differently.
Does prcomp not perform rotations by default?
If I understand it correctly retx=TRUE returns ordinated data, that I can
plot for individual samples (prcomp()$x: which is the scaled and centered
(rotated?) data multiplied by loadings).
What does it mean that the data is rotated from the
2006 Nov 16
1
Problems with principal components analysis PCA with prcomp
Dear friends,
I am beginning to use R software in my academic research and I'm having some
problems regarding the use of PCA.
I have a table with 24445 rows and 9 columns, and I used the function
prcomp() to do the analysis.
Working with an example?:
x<-read.table("test.txt", header=T)
row.names(x)<-x[,1]
x<-x[,-1]
require(stats)
pca<-prcomp(x, scale=T)
names(pca)
##
2010 Jan 25
1
PCA: Showing file datalabels on biplot
The script below successfully produces a biplot of the data but the 'site
names' (rows) and the names of the 'response variables' (columns) are shown
as simple numerals (rather than the column and row names). How might I
'enforce' the use of the row/column names used in the datafile (section of
datafile shown below)?
Can anyone help, please?
Section of datafile
sample a b
2011 Jan 26
1
Factor rotation (e.g., oblimin, varimax) and PCA
A bit of a newbee to R and factor rotation I am trying to understand
factor rotations and their implementation in R, particularly the
GPArotation library.
I have tried to reproduce some of the examples that I have found, e.g., I
have taken the values from Jacksons example in "Oblimin Rotation",
Encyclopedia of Biostatistics
2002 Oct 29
0
PCA with n << p (was R-1.6.0 crashing on RedHat6.3)
[Moderator's Note: This message needed manual interaction by me,
since the attachment originally was declared as
``application/octet-stream'' even though it was only plain
text. We do not allow octet-stream (aka binary!)
attachments on our mailing list -- for virus/spam filtering
reasons. -- MM]
We have also encountered the problem Douglas
2009 Nov 07
1
after PCA, the pc values are so large, wrong?
rm(list=ls())
yx.df<-read.csv("c:/MK-2-72.csv",sep=',',header=T,dec='.')
dim(yx.df)
#get X matrix
y<-yx.df[,1]
x<-yx.df[,2:643]
#conver to matrix
mat<-as.matrix(x)
#get row number
rownum<-nrow(mat)
#remove the constant parameters
mat1<-mat[,apply(mat,2,function(.col)!(all(.col[1]==.col[2:rownum])))]
dim(yx.df)
dim(mat1)
#remove columns with numbers of
2007 Nov 29
0
Doing PCA
Hi Fellow R enthusiasts
I have managed to reshape my data using a much shorter script than before. Woohoo
However now I have new problems. The code is below. There are no problems with the create matrix section.
The problem code is highlighted in bold. I am trying to do PCA on the data. Here are the errors.
Error1
code : OGSscaled = rangescale(OGS)
error message : Error in dim(newX) <-
2011 Jan 03
0
Using PCA to correct p-values from snpMatrix
Hi R-help folks,
I have been doing some single SNP association work using snpMatrix. This works
well, but produces a lot of false positives, because of population structure in
my data. I would like to correct the p-values (which snpMatrix gives me) for
population structure, possibly using principle component analysis (PCA).
My data is complicated, so here's a simple example of what
2010 Feb 04
0
pca in R: Problem Fixed
Good day all.
This is to thank all those who have helped in fixing this problem. Starting
with a text book was indeed a problem, however, that gave me a clue of what
I was looking for. This, with your contributions added to other materials I
got on the net, put me on the right track. Thank you so much.
Warmest regards
Ogbos
On 31 January 2010 14:07, S Ellison <S.Ellison@lgc.co.uk> wrote:
2009 Mar 08
2
prcomp(X,center=F) ??
I do not understand, from a PCA point of view, the option center=F
of prcomp()
According to the help page, the calculation in prcomp() "is done by a
singular value decomposition of the (centered and possibly scaled) data
matrix, not by using eigen on the covariance matrix" (as it's done by
princomp()) .
"This is generally the preferred method for numerical accuracy"
2007 Jun 27
1
Condensed PCA Results
Hello all,
I'm currently using R to do PCA Analysis, and was wondering if anyone knew the
specific R Code that could limit the output of the PCA Analysis so that you
only get the Principal Component features as your output and none of the
extraneous words or numbers that you don't want.
If that was unclear, let me use linear regression as an example:
"lm(y~x)" is the normal
2008 May 14
1
PCA in Microarrays
Dear useRs:
I'm not sure if it's the correct place to ask but I'll try it out. I've been
reading about how to perform Principal Component Analysis (PCA) in
microarrays (see [1]) and there's something that I don't get it. Basically
it's related with performing PCA over data sets which number of variables is
greater than the number of samples. For example in the paper
2011 May 13
2
biplots for PCA
Hi all
I have produced a biplot for a PCA (see attached pdf) that I ran however the
names of the variables which are placed at the end of the arrows overlap and
are thus unreadable. Similarly some of the numbered points overlap. I was
wondering if there was a way to edit the biplot to move the label names and
if not what the best alternative is.
Thanks
Anna
pca<-biodata[,3:10]
2004 May 10
1
environmental data as vector in PCA plots
Hi,
I want to include a vector representing the sites - environmental data
correlation in a PCA.
I currently use prcomp (no scaling) to perform the PCA, and envfit to
retrieve the coordinates of the environmental data vector. However, the
vector length is different from the one obtained in CAnoco when performing
a species - environmental biplot (scaling -2). How can I scale the vector
in order to
2008 Feb 14
1
Principal component analysis PCA
Hi,
I am trying to run PCA on a set of data with dimension 115*300,000. The
columns represnt the snps and the row represent the individuals. so this is
what i did.
#load the data
code<-read.table("code.txt", sep='\t', header=F, nrows=300000)
# do PCA #
pr<-prcomp(code, retx=T, center=T)
I am getting the following error message
"Error: cannot allocate vector of
2008 Dec 11
2
Principal Component Analysis - Selecting components? + right choice?
Dear R gurus,
I have some climatic data for a region of the world. They are monthly averages
1950 -2000 of precipitation (12 months), minimum temperature (12 months),
maximum temperature (12 months). I have scaled them to 2 km x 2km cells, and
I have around 75,000 cells.
I need to feed them into a statistical model as co-variates, to use them to
predict a response variable.
The climatic