thr3ads.net - similar to: "tapply"

Displaying 20 results from an estimated 1000 matches similar to: "tapply"

2007 Jun 26

aggregating daily values

Hi, I swear I have read almost all the posted messages about this issue, but it's evident I couldn't find an answer (surely esay) to my problem. What I want is the following: Make 8 days aggregates from a daily series like this (dput output): structure(c(6.91777181625366, 0.79051125049591, 9.00625133514404, 9.86966037750244, 14.4326181411743, 3.70155477523804, 9.67768573760986,

intersect more than two sets

2007 Apr 24

intersect more than two sets

Hi, I searched the archives and did not find a good solution to that. assume I have 10 sets and I want to have the common character elements of them. how could i do that? -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III

network package in R

2011 May 27

network package in R

Hi there, I need a network builder and it can change the node size and color; I am not sure if network package in R can do this or not. The other functions I wanted have been found in that package. BTW, if there is another package in R relating to this, please suggest too. Thanks, Weiwei -- Weiwei Shi, Ph.D Research Scientist "Did you always know?" "No, I did not. But I

how to reverse a list

2007 Apr 11

how to reverse a list

Hi, there: I am wondering if there is a quick way to "reverse" a list like this: t0 <- list(a=1, b=1, c=2, d=1) reverst t0 to t1 > t1 $`1` [1] "a" "b" "d" $`2` [1] "c" thanks. -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III

cluster in R

2006 Oct 17

cluster in R

hi, is there some good summary on clustering methods in R? It seems there are many packages involving it. And I have two questions on clustering here: 1. Is there a way of evaluate the effecitives (or seperation) of clustering (rather than by visualization)? 2. Is there a search method (like genetic search) which can help find the best subset of attributes which gives best seperation? Thanks,

a string to enviroment or function

2007 Jun 25

a string to enviroment or function

Hi, I am wondering how to make a function Fun to make the following work: t0 <- (paste("hgu133a", "ENTREZID", sep="")) xx <- as.list(Fun(t0)) # make it work like xx<-as.list(hgu133aENTREZID) thanks, -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III

how to split data.frame by row?

2007 Oct 29

how to split data.frame by row?

hi, if I have 20 x 3 data.frame, how to split it into 10 x 6 (moving the lower part of 10x3 to column) or 5 x 12 thanks -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III

how to keep row name if there is only one row selected from a data frame

2009 Jul 13

how to keep row name if there is only one row selected from a data frame

Hi, there: Assume I have a dataframe with rownames like A with rownames like a to e, > A [,1] [,2] a 1 6 b 2 7 c 3 8 d 4 9 e 5 10 when I use A[1,], I lost the rowname for it, like below. How could I keep it? Is there an easy way instead that I have to modify by myself after I used A[1,] manually. > A[1,] [1] 1 6 Thanks, W. -- Weiwei Shi, Ph.D Research

2008 Aug 24

similarity between two gene lists with varied length

Dear listers, a little off-topic: I am looking for and compare algorithms which can calculate "distance" or "similarity" between two gene lists with different lengths. Any paper, any implementation in R and any suggestion is welcome! Thanks, -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..."

how to plot this?

2007 Jun 25

how to plot this?

Hi, there: Suppose I have a couple of data.frames and each one has five columns (one for x-axis, two for y-axis and two for std of y's.) There is another dimensions (besides x and y) which is continuous. My question is, how to plot such series of data frames in one plot (thus, 3-dimensional plot) AND multiple 2-D plots. I am not familar with R's plotting utilities. Thanks. -- Weiwei

memory problem in handling large dataset

2005 Oct 27

memory problem in handling large dataset

Dear Listers: I have a question on handling large dataset. I searched R-Search and I hope I can get more information as to my specific case. First, my dataset has 1.7 billion observations and 350 variables, among which, 300 are float and 50 are integers. My system has 8 G memory, 64bit CPU, linux box. (currently, we don't plan to buy more memory). > R.version _ platform

read.table

2005 Jul 13

read.table

Hi, I have a question on read.table. I have a dataset with 273,000 lines and 195 columns. I used the read.table to load the data into R: trn<-read.table('train1.dat', header=F, sep='|', na.strings='.') I found it takes forever. then I run 1/10 of the data (test) using read.table again. And this time it finished quickly. So, there might be something wrong in my data

have to point it out again: a distribution question

2005 Apr 28

have to point it out again: a distribution question

Stock returns and other financial data have often found to be heavy-tailed. Even Cauchy distributions (without even a first absolute moment) have been entertained as models. Your qq function subtracts numbers on the scale of a normal (0,1) distribution from the input data. When the input data are scaled so that they are insignificant compared to 1, say, then you get essentially the

Random Forest

2007 Apr 23

Random Forest

Hi, I am trying to print out my confusion matrix after having created my random forest. I have put in this command: fit<-randomForest(MMS_ENABLED_HANDSET~.,data=dat,ntree=500,mtry=14, na.action=na.omit,confusion=TRUE) but I can't get it to give me the confusion matrix, anyone know how this works? Thansk! Ruben [[alternative HTML version deleted]]

dlda{supclust} 's output

2007 May 01

dlda{supclust} 's output

Hi, I am using dlda algorithm from supclust package and I am wondering if the output can be a continuous probability instead of discrete class label (zero or one) since it puts some restriction on convariance matrix, compared with lda, while the latter can. thanks, -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..."

question on write.table

2005 Dec 15

question on write.table

Hi, I have a question on write.table: I have a data.frame called t7 as below: > dim(t7) [1] 14015184 6 > t7[1:5,] uci uce par line graphical.forms stems 1 0 0 0 0 active activ 2 0 0 0 0 policy polici 3 0 0 0 0 wc PC 4 0 0 0 0 eff elf 5 0 0 0 0 icn ICC I want to write the

a problem in random forest

2005 Oct 11

a problem in random forest

Hi, there: I spent some time on this but I think I really cannot figure it out, maybe I missed something here: my data looks like this: > dim(trn3) [1] 7361 209 > dim(val3) [1] 7427 209 > mg.rf2<-randomForest(x=trn3[,1:208], y=trn3[,209], data=trn3, xtest=val3[, 1:208], ytest=val3[,209], importance=T) my test data has 7427 observations but after prediction, > dim(mg.rf2$votes)

generalized linear model and missing handling

2005 Oct 04

generalized linear model and missing handling

Hi, I have a dataset and want to build a generalized linear model on it. Unfortunately, complete.cases(df) returns null, which means I have to find a way to "fill" those missings. One way is following my previous post to use median to replace(or use most freq. of level to replace for catergorical case), but I am wondering if there are other ways, when glm or something like it is

pretty report

2007 Jun 12

pretty report

Dear Listers: I have a couple of data frames to report and each corresponds to different condtions, e.g. conditions=c(10, 15, 20, 25). In this examples, four data frames need to be exported in a "pretty" report. I knew Perl has some module for exporting data to Excel and after googling, I found R does not. So I am wondering if there is a package in R for generating good reports. I

source a specific function

2007 Jun 18

source a specific function

Dear Listers: For example, if I have a .R source file which has more than one function, and I want to just load only one of the functions, how could I do that? (removing the rest after sourcing is not what I intend b/c in my workspace, I might have some of the rest and I don't want to change my workspace: i.e., I only change my workspace by adding one function from a R source file). Thanks,

similar to: tapply