thr3ads.net - similar to: "function to include factors in summary data frame"

Displaying 20 results from an estimated 20000 matches similar to: "function to include factors in summary data frame"

modify and append new rows to a data.frame using ddply

2013 Jun 10

modify and append new rows to a data.frame using ddply

Hi, I have a data.frame that contains a variable act which records the duration (in seconds) of two states (wet-dry) for several individuals (identified by Ring) over a period of time. Since I want to work with daytime (i.e. from sunrise till sunset) and night time (i.e. from sunset till next sunrise), I have to split act from time[i] till sunset and from sunset until time[i+1], and from time[k]

data.frame operations

2011 Feb 25

data.frame operations

Hi All, I've been doing circles with this all morning, although I'm sure there's a simple solution. I have a data.frame "pop" as follows: > pop ind xloc yloc gonad 1 1 5 2 12 2 1 5 4 25 3 1 3 3 30 4 1 5 4 10 5 1 1 4 08 . . . . . 10 1 1 4 11 And I need the following steps: 1. Sum

extract data for specific levels factor

2011 Oct 25

extract data for specific levels factor

Dear all, I'm trying to analyze data with the following structure: ind cat tx age 40.2 por fol peq vh 35 41.9 por fol med vh 35 68.9 por fol preov vh 35 71.5 por fol peq ser 37 67.5 por fol med ser 37 76.9 por fol preov ser 37 78.7 por fol peq otr 37 78.3 por fol med otr 37 82.1 por fol

using ddply but preserving some of the outside data

2009 Aug 05

using ddply but preserving some of the outside data

I have a bit of a quandy. I'm working with a data set for which I have sampled sites at a variety of dates. I want to use this data, and get a running average of the sampled values for the current and previous date. I originally thought something like ddply would be ideal for this, however, I cannot break up my data by date, and then apply a function that requires information

MASS fitdistr with plyr or data.table?

2011 Apr 27

MASS fitdistr with plyr or data.table?

I am trying to extract the shape and scale parameters of a wind speed distribution for different sites. I can do this in a clunky way, but I was hoping to find a way using data.table or plyr. However, when I try I am met with the following: set.seed(144) weib.dist<-rweibull(10000,shape=3,scale=8) weib.test<-data.table(cbind(1:10,weib.dist))

extracting the last row of each group in a data frame

2009 Nov 16

extracting the last row of each group in a data frame

Hi, I would like to extract the last row of each group in a data frame. The data frame is as follows Name Value A 1 A 2 A 3 B 4 B 8 C 2 D 3 I would like to get a data frame as Name Value A 3 B 8 C 2 D 3 Thank you for your suggestions in advance Jeff

Using by() and stacking back sub-data frames to one data frame

2009 Jun 25

Using by() and stacking back sub-data frames to one data frame

Dear all, I have a code where I subset a data frame to match entries within levels of an factor (actually, the full script uses three difference factors do do that). I'm very happy with the precision with which I can work with R, but since I loop over factor levels, and the data frame is big, the process is slow. So I've been trying to speed up the process using by(), but I got stuck at

Subset according to groups NA proportion within specific variables

2011 Feb 21

Subset according to groups NA proportion within specific variables

Dear R-List, I have a dataframe with one grouping variable (x) and three response variables (y,z,w). df<-data.frame(x=c(rep(1,3),rep(2,4),rep(3,5)),y=rnorm(12),z=c(3,4,5,NA,NA,NA,NA,1,2,1,2,1),w=c(1,2,3,3,4,3,5,NA,5,NA,7,8)) >df x y z w 1 0.29306106 3 1 1 0.54797780 4 2 1 -1.38365548 5 3 2 -0.20407986

standardizing one variable by dividing each value by the mean - but within levels of a factor

2010 Jan 20

standardizing one variable by dividing each value by the mean - but within levels of a factor

Hello! I have a data frame with a factor and a numeric variable: x<-data.frame(factor=c("b","b","d","d","e","e"),values=c(1,2,10,20,100,200)) For each level of "factor" - I would like to divide each value of "values" by the mean of "values" that corresponds to the level of "factor" In other

Aggregate certain rows in a matrix

2010 Sep 06

Aggregate certain rows in a matrix

Hi, I have a matrix that looks like this a <- c(1,1,1,1,2,2,3,3,3,3) b <- c(2,2,2,3,4,4,4,5,5,6) c <- c(1,2,3,4,5,6,7,8,9,10) M <- matrix(nr=10,nc=3) M[,1] <- a M[,2] <- b M[,3] <- c > M [,1] [,2] [,3] [1,] 1 2 1 [2,] 1 2 2 [3,] 1 2 3 [4,] 1 3 4 [5,] 2 4 5 [6,] 2 4 6 [7,] 3 4 7

converting the string columns in a data.frame to factors?

2011 Feb 28

converting the string columns in a data.frame to factors?

Dear All, I'm not sure if I understand the parameter stringsAsFactors correctly. I'm trying to convert the string columns in aframe1 to factors. But it seems stringsAsFactors=T in as.data.frame() doesn't do anything. Could anybody let know what is the correct way to converting strings to factors? > aframe1=data.frame(x=LETTERS[1:10], y=LETTERS[1:10], stringsAsFactors=F) >

Apply a function according to factor levels.

2012 Aug 17

Apply a function according to factor levels.

Dear all, I have a question on applying a function to the data according to factor levels. For example, for the data below, what is the best way to apply a function to "values" according to different levels of "samples" (1,2,3,4,5)? values ind sample 1 0.03325 1 1 2 0.03305 1 1 3 0.03185 1 1 4 0.03515 1 1 5 0.03375 1 1 6 0.01180

data frames; matching/merging

2010 Feb 08

data frames; matching/merging

Hi all, I'm feeling a little guilty to ask this question, since I've written a solution using a rather clunky for loop that gets the job done. But I'm convinced there must be a faster (and probably more elegant) way to accomplish what I'm looking to do (perhaps using the "merge" function?). I figured somebody out there might've already figured this out: I have

Change case of factor in data frame

2012 Dec 02

Change case of factor in data frame

I am trying to write a function to change the case of all of the text in a data frame to lower case. I do not have foreknowledge of the data frame names or the data types of each column. It seems that if one references the data frame by index, then it returns class "data.frame" but if it is referenced by name, it returns class "factor" or whatever the column actually is:

nested model with random factors

2009 Jul 08

nested model with random factors

I am really having trouble with getting the right syntax for my model. Here is a truncated version of my data: > data Ind Treatment Order Date PC1 1 PER14 SC 3rd 4-May-09 0.5704611 2 PER14 SH 1st 26-Apr-09 0.5329025 3 PER14 AC 2nd 29-Apr-09 2.1392279 4 PER25 SC 2nd 29-Apr-09 -0.2083382 5 PER25 SH 3rd 3-May-09 3.7818356 6

Assignemt problem ,,,,,,,,,,,,,,,

2006 Jun 16

Assignemt problem ,,,,,,,,,,,,,,,

Hello list, i have a very simple question about matrix assignment. i did like this. res<-1:30 dim(res)<-c(5,6) ind<-1:6 now i want to assign the value of this variable ind to first coloumn in matrix res. like res[,1]<-ind but this code is giving error , Actualy i have a for loop and value of ind variable is changing every time ,,total 6 times,,i want to assign every vlaue

Extract complete rows by group and maximum

2011 Apr 28

Extract complete rows by group and maximum

Hi I'm trying to extract complete rows from a dataframe by group based on the maximum in a column within that group. Thus I have a dataframe: cvd_basestudy ... es_time ... _____________ study1 ... 0.3091667 study2 ... 0.3091667 study2 ... 0.2625000 study3 ... 0.3033333 study3 ... 0.2625000 __________ etc I can extract the basestudy and the max(es_time)

struggling with "split" function

2009 Sep 06

struggling with "split" function

I am very sorry for such a simple question, but I am struggling with "split". I have the following data frame: x<-data.frame(A=c(NA,NA,NA,NA,"split",NA,NA,NA,NA,"split",NA,NA,NA,NA,"split",NA,NA,NA,NA),

sum specific rows in a data frame

2010 Apr 14

sum specific rows in a data frame

I have a data frame called "pose": DESCRIPTION QUANITY CLOSING.PRICE 1 WHEAT May/10 1 467.75 2 WHEAT May/10 2 467.75 3 WHEAT May/10 1 467.75 4 WHEAT May/10 1 467.75 5 COTTON NO.2 May/10 1 78.13 6 COTTON NO.2 May/10 3 78.13 7 COTTON NO.2 May/10 1 78.13

Writing a summary file in R

2011 Jul 27

Writing a summary file in R

Hello, I have an input file: http://r.789695.n4.nabble.com/file/n3700031/testOut.txt testOut.txt where col 1 is chromosome, column2 is start of region, column 3 is end of region, column 4 and 5 is base position, column 6 is total reads, column 7 is methylation data, and column 8 is the strand. I would like a summary output file such as:

similar to: function to include factors in summary data frame