thr3ads.net - similar to: "aggregating variables with pca"

Displaying 20 results from an estimated 20000 matches similar to: "aggregating variables with pca"

2006 Jan 25

combining variables with PCA

hello R_team having perfomed a PCA on my fitted model with the function: data<- na.omit(dataset) data.pca<-prcomp(data,scale =TRUE), I´ve decided to aggregate two variables that are highly correlated. My first question is: How can I combine the two variables into one new predictor? and secondly: How can I predict with the newly created variable in a new dataset? Guess I need the

Limited number of principal components in PCA

2011 Jul 29

Limited number of principal components in PCA

Hi all, I am attempting to run PCA on a matrix (nrow=66, ncol=84) using 'prcomp' (stats package). My data (referred to as 'Q' in the code below) are separate river streamflow gaging stations (columns) and peak instantaneous discharge (rows). I am attempting to use PCA to identify regions of that vary together. I am entering the following command:

PCA with tow response variables

2009 Nov 04

PCA with tow response variables

Hi all, I'm new to PCA in R, so this might be a basical thing, but I cannot find anything on the net about it. I need to make a PCA plot with two response variables (df$resp1 and df$resp2) against eight metabolites (df$met1, df$met2, ...) and I don't have a clue how to do... and I've only used the simplest PCAs before, like this: pcaObj=prcomp(t(df[idx, c(40:47)]))

PCA error: svd(x, nu=0) infinite or missing values

2008 Jan 04

PCA error: svd(x, nu=0) infinite or missing values

Hi, I am trying to do a PCA on my data but I keep getting the error message svd(x, nu=0) infinite or missing values >From the messages posted on the subject, I understand that the NAs in my data might be the problem, but I thought na.omit would take care of that. Less than 5% of my cells are missing data. However, the NAs are not regularly distributed across my matrix: certain cases and

PCA: prcomp rotations

2011 Sep 28

PCA: prcomp rotations

Hi all, I think I may be confused by different people/programs using the word rotation differently. Does prcomp not perform rotations by default? If I understand it correctly retx=TRUE returns ordinated data, that I can plot for individual samples (prcomp()$x: which is the scaled and centered (rotated?) data multiplied by loadings). What does it mean that the data is rotated from the

Problems with principal components analysis PCA with prcomp

2006 Nov 16

Problems with principal components analysis PCA with prcomp

Dear friends, I am beginning to use R software in my academic research and I'm having some problems regarding the use of PCA. I have a table with 24445 rows and 9 columns, and I used the function prcomp() to do the analysis. Working with an example?: x<-read.table("test.txt", header=T) row.names(x)<-x[,1] x<-x[,-1] require(stats) pca<-prcomp(x, scale=T) names(pca) ##

PCA: Showing file datalabels on biplot

2010 Jan 25

PCA: Showing file datalabels on biplot

The script below successfully produces a biplot of the data but the 'site names' (rows) and the names of the 'response variables' (columns) are shown as simple numerals (rather than the column and row names). How might I 'enforce' the use of the row/column names used in the datafile (section of datafile shown below)? Can anyone help, please? Section of datafile sample a b

Factor rotation (e.g., oblimin, varimax) and PCA

2011 Jan 26

Factor rotation (e.g., oblimin, varimax) and PCA

A bit of a newbee to R and factor rotation I am trying to understand factor rotations and their implementation in R, particularly the GPArotation library. I have tried to reproduce some of the examples that I have found, e.g., I have taken the values from Jacksons example in "Oblimin Rotation", Encyclopedia of Biostatistics

PCA with n << p (was R-1.6.0 crashing on RedHat6.3)

2002 Oct 29

PCA with n << p (was R-1.6.0 crashing on RedHat6.3)

[Moderator's Note: This message needed manual interaction by me, since the attachment originally was declared as ``application/octet-stream'' even though it was only plain text. We do not allow octet-stream (aka binary!) attachments on our mailing list -- for virus/spam filtering reasons. -- MM] We have also encountered the problem Douglas

after PCA, the pc values are so large, wrong?

2009 Nov 07

after PCA, the pc values are so large, wrong?

rm(list=ls()) yx.df<-read.csv("c:/MK-2-72.csv",sep=',',header=T,dec='.') dim(yx.df) #get X matrix y<-yx.df[,1] x<-yx.df[,2:643] #conver to matrix mat<-as.matrix(x) #get row number rownum<-nrow(mat) #remove the constant parameters mat1<-mat[,apply(mat,2,function(.col)!(all(.col[1]==.col[2:rownum])))] dim(yx.df) dim(mat1) #remove columns with numbers of

Doing PCA

2007 Nov 29

Doing PCA

Hi Fellow R enthusiasts I have managed to reshape my data using a much shorter script than before. Woohoo However now I have new problems. The code is below. There are no problems with the create matrix section. The problem code is highlighted in bold. I am trying to do PCA on the data. Here are the errors. Error1 code : OGSscaled = rangescale(OGS) error message : Error in dim(newX) <-

Using PCA to correct p-values from snpMatrix

2011 Jan 03

Using PCA to correct p-values from snpMatrix

Hi R-help folks, I have been doing some single SNP association work using snpMatrix. This works well, but produces a lot of false positives, because of population structure in my data. I would like to correct the p-values (which snpMatrix gives me) for population structure, possibly using principle component analysis (PCA). My data is complicated, so here's a simple example of what

pca in R: Problem Fixed

2010 Feb 04

pca in R: Problem Fixed

Good day all. This is to thank all those who have helped in fixing this problem. Starting with a text book was indeed a problem, however, that gave me a clue of what I was looking for. This, with your contributions added to other materials I got on the net, put me on the right track. Thank you so much. Warmest regards Ogbos On 31 January 2010 14:07, S Ellison <S.Ellison@lgc.co.uk> wrote:

prcomp(X,center=F) ??

2009 Mar 08

prcomp(X,center=F) ??

I do not understand, from a PCA point of view, the option center=F of prcomp() According to the help page, the calculation in prcomp() "is done by a singular value decomposition of the (centered and possibly scaled) data matrix, not by using eigen on the covariance matrix" (as it's done by princomp()) . "This is generally the preferred method for numerical accuracy"

Condensed PCA Results

2007 Jun 27

Condensed PCA Results

Hello all, I'm currently using R to do PCA Analysis, and was wondering if anyone knew the specific R Code that could limit the output of the PCA Analysis so that you only get the Principal Component features as your output and none of the extraneous words or numbers that you don't want. If that was unclear, let me use linear regression as an example: "lm(y~x)" is the normal

PCA in Microarrays

2008 May 14

PCA in Microarrays

Dear useRs: I'm not sure if it's the correct place to ask but I'll try it out. I've been reading about how to perform Principal Component Analysis (PCA) in microarrays (see [1]) and there's something that I don't get it. Basically it's related with performing PCA over data sets which number of variables is greater than the number of samples. For example in the paper

biplots for PCA

2011 May 13

biplots for PCA

Hi all I have produced a biplot for a PCA (see attached pdf) that I ran however the names of the variables which are placed at the end of the arrows overlap and are thus unreadable. Similarly some of the numbered points overlap. I was wondering if there was a way to edit the biplot to move the label names and if not what the best alternative is. Thanks Anna pca<-biodata[,3:10]

environmental data as vector in PCA plots

2004 May 10

environmental data as vector in PCA plots

Hi, I want to include a vector representing the sites - environmental data correlation in a PCA. I currently use prcomp (no scaling) to perform the PCA, and envfit to retrieve the coordinates of the environmental data vector. However, the vector length is different from the one obtained in CAnoco when performing a species - environmental biplot (scaling -2). How can I scale the vector in order to

Principal component analysis PCA

2008 Feb 14

Principal component analysis PCA

Hi, I am trying to run PCA on a set of data with dimension 115*300,000. The columns represnt the snps and the row represent the individuals. so this is what i did. #load the data code<-read.table("code.txt", sep='\t', header=F, nrows=300000) # do PCA # pr<-prcomp(code, retx=T, center=T) I am getting the following error message "Error: cannot allocate vector of

Principal Component Analysis - Selecting components? + right choice?

2008 Dec 11

Principal Component Analysis - Selecting components? + right choice?

Dear R gurus, I have some climatic data for a region of the world. They are monthly averages 1950 -2000 of precipitation (12 months), minimum temperature (12 months), maximum temperature (12 months). I have scaled them to 2 km x 2km cells, and I have around 75,000 cells. I need to feed them into a statistical model as co-variates, to use them to predict a response variable. The climatic

similar to: aggregating variables with pca