thr3ads.net - similar to: "Assigning cases to groupings based on the values of several variables"

Displaying 20 results from an estimated 10000 matches similar to: "Assigning cases to groupings based on the values of several variables"

formatting a 6 million row data set; creating a censoring variable

2011 Aug 31

formatting a 6 million row data set; creating a censoring variable

List, Consider the following data. gender mygroup id 1 F A 1 2 F B 2 3 F B 2 4 F B 2 5 F C 2 6 F C 2 7 F C 2 8 F D 2 9 F D 2 10 F D 2 11 F D 2 12 F D 2 13 F D 2 14 M A 3 15 M A 3 16 M A 3 17

data manipulation and summaries with few million rows

2011 Aug 24

data manipulation and summaries with few million rows

I have a data set with about 6 million rows and 50 columns. It is a mixture of dates, factors, and numerics. What I am trying to accomplish can be seen with the following simplified data, which is given as dput output below. > head(myData) mydate gender mygroup id 1 2012-03-25 F A 1 2 2005-05-23 F B 2 3 2005-09-08 F B 2 4 2005-12-07 F B 2

code works in R desktop but not iin RWeb - How do I modify to get it working in RWeb, please?

2008 Oct 21

code works in R desktop but not iin RWeb - How do I modify to get it working in RWeb, please?

Hi, How are you? I have a quick question.... I have code that works perfectly with R desktop, but does not work with RWeb. Could you please tell me how to modify the code below so it will work with RWeb? #Read in txt file happyguys<-read.table("c:/test8.txt", header=TRUE, row.names=1) #Subset the txt file to only include certain values test<-subset(happyguys, GRADE == 7

F values from a Repeated Measures aov

2008 Apr 28

F values from a Repeated Measures aov

Hi Folks, I have repeated measures for data on association time (under 2 acoustic condtions) in male and female frogs as they grow to adulthood (6 timepoints). Thus, two within-subject variables (Acoustic Condition: 2 levels, Timepoint: 6 levels) and one between-subject variable (Sex:male or female). I am pretty sure my distributions depart from normality but I would first like to simply run a

how to plot two histograms overlapped in the same plane coordinate

2010 Jul 09

how to plot two histograms overlapped in the same plane coordinate

Dear R-help listers, I am new. I just want to get helps on how to plot two histograms overlapped in the same plane coordinate. What I did is very ugly. Could you please help me to improve it? I want to got a plot with semi- transparent overlapping region. And, I want to know how to specify the filled colors of the different histograms. I also prefer other solutions other than ggplot2. Many

lrm in Design package--missing value where TRUE/FALSE needed

2009 Sep 04

lrm in Design package--missing value where TRUE/FALSE needed

Hi, A error message arose while I was trying to fit a ordinal model with lrm() I am using R 2.8 with Design package. Here is a small set of mydata: RC RS Sex CovA CovB CovC CovD CovE 2 1 0 1 1 0 -0.005575280 2 2 1 0 1 0 1 -0.001959580 2 3 0 0 0 1 0 -0.004725880 2 0 0 0 1 0 0 -0.005504850 2 2 1 1 0 0 0 -0.003880170 1 2 1 0 0 1 0 -0.006074230 2 2 1 0 0 1 1 -0.003963920 2 2 1 0 0 1 0

Evaluation puzzle

2010 Nov 11

Evaluation puzzle

The survexp function can fail when called from another function. The "why" of this has me baffled, however. Here is a simple test case, using a very stripped down version of survexp: survexp.test <- function(formula, data, weights, subset, na.action, rmap, times, cohort=TRUE, conditional=FALSE, ratetable=survexp.us, scale=1, npoints, se.fit,

code works in R desktop but not iin RWeb - I got it working

2008 Oct 23

code works in R desktop but not iin RWeb - I got it working

Hi, I got this working. i had to get rid of an underscore in one of my variable names to work on RWeb. Thanks for your help though!! Natalie __________________ I think you have to be either honest or careful, since the code you submitted to Rweb is different with your former code! Why the condition was changed from "test<-subset(X, GRADE == 7 & Y_Q10A < 9)" to

score counts in an aggregate function

2010 Apr 16

score counts in an aggregate function

Dear R-Users, I have a big data set "mydata" with repeated observation and some missing values. It looks like the format below: userid sex item score1 score2 1 0 1 1 1 1 0 2 0 1 1 0 3 NA 1 1 0 4 1 0 2 1 1 0 1 2 1 2 NA 1 2 1 3 1

Semi Parametric Bootstrap

2013 Jan 10

Semi Parametric Bootstrap

Greetings to you all, I am performing a semi parametric bootstrap in R on a Gamma Distributed data and a Binomial distributed data. The main challenge am facing is the fact that the residual variance depends on the mean (if I am correct). I strongly feel that the script below may be wrong due to mean-variance relationship #####R code####### fit1s

subsetting within a function

2004 Dec 30

subsetting within a function

hi I am trying to write a function around a glm or similar function, in which I can pass the subsetting constraint as an argument to the function, but I am having trouble. the following commands (if I wanted SEX==0 to be my subset) in the global environment work fine: subexpr <- expression(SEX==0) subtest <- with(mydata, eval(subexpr)) test.glm <- glm(y~x1+x2, data=mydata,

error in nlme

2001 Sep 12

error in nlme

I'm getting an error from nlme that has me stymied. I have a data set ,'mydata', with variables: AChE, Dose, sex, set, and mrid; 'set' and 'mrid' indicate two levels of nesting, with 'set' nested within 'mrid'. I want to fit the model: mod <- nlme(AChE ~ Cexp(Dose, A, B, m), data=mydata, fixed = A+B+M~sex, random=A+B+m~sex | mrid/set,

multilevel basic lme question

2008 Jun 15

multilevel basic lme question

Hi R users I want to use the lme package for a multilevel analysis on the following example: > math<-c(2, 3,2, 5, 6 ,7 , 7) > sex<-c(1, 2, 1, 2, 2, 2, 1) > school_A<-c(1,1,1,2,2,2,2) > school_B<-c(10,10,10,20,20,20,20) > mydata<-data.frame(math, sex, school_A, school_B) > mydata School_A and school_B are two different school characteristics, math is an

Within-group correlation confidence intervals

2009 Sep 18

Within-group correlation confidence intervals

I'm trying to obtain within-group correlations on a subset of variables. I first selected my variables using the following command: mydata$x<-mydata[c("iq","education","achievement")] I'd like to look at correlations among those variables separately for men and women. My gender variable in mydata is coded 1 (women) and 0 (men). I have successfully used

How to label percentage values inside stacked bar plot using R-base

2013 Feb 18

How to label percentage values inside stacked bar plot using R-base

Hello, I am new to R. I would like others to explain to me how to add absolute values inside the individual stacked bars in a consistent way using the basic R plotting function (R base). I tried to plot a stacked bar graph using R base but the values appear in an inconsistent/illogical way in such a way that its supposed to be 100% for each village but they don't sum up to 100%. Here is the

a problem

2010 Jul 31

a problem

dear: when I read a Excel file(exp-11),The R project give me a error ,Just like this: ??'datafile' > write.foreign("exp-11.xls") ???list(df = df, datafile = datafile, codefile = codefile) : ??'datafile' > write.foreign("exp-11.xls") ???list(df = df, datafile = datafile, codefile = codefile) : ??'datafile' >

Problem with data conversion

2003 Dec 14

Problem with data conversion

Hi All: I came across the following problem while working with a dataset, and wondered if there could be a solution I sought here. My dataset consists of information on 402 individuals with the followng five variables (age,sex, status = a binary variable with levels "case" or "control", mma, dma). During data check, I found that in the raw data, the data entry

merging 2 frames while keeping all the entries from the "reference" frame

2011 Apr 04

merging 2 frames while keeping all the entries from the "reference" frame

Hello! I have my data frame "mydata" (below) and data frame "reference" - that contains all the dates I would like to be present in the final data frame. I am trying to merge them so that the the result data frame contains all 8 dates in both subgroups (i.e., Group1 should have 8 rows and Group2 too). But when I merge it it's not coming out this way. Any hint would be

further notes on model.frame issue

2009 Jan 19

further notes on model.frame issue

This is a follow-up on my note of Saturday. Let me start with two important clarifications - I think this would be a nice addition, but I've had exactly one use for it in the 15+ years of developing the survival package. - I have a work around for the current case. Prioritize accordingly. The ideal would be to change survexp as follows: fit <- survexp( ~ gender,

Reading in tab (and space) delimited data within a script XXXX

2012 Jan 19

Reading in tab (and space) delimited data within a script XXXX

Hello everyone, I use Bob Muenchen's approach for reading in "in-stream" (to use SAS parlance) delimited data within a script. This works great: mystring <- "id,workshop,gender,q1,q2,q3,q4 1,1,f,1,1,5,1 2,2,f,2,1,4,1 3,1,f,2,2,4,3 4,2, ,3,1, ,3 5,1,m,4,5,2,4 6,2,m,5,4,5,5 7,1,m,5,3,4,4 8,2,m,4,5,5,5" mydata <- read.table( textConnection(mystring),

similar to: Assigning cases to groupings based on the values of several variables