Manish Dalwani
2012-Apr-29 21:46 UTC
[R] need help with avg.surv (Direct Adjusted Survival Curve)
Hello R users, I am trying to obtain a direct adjusted survival curve. I am sending my whole code (see below). It's basically the larynx cancer data with Stage 1-4. I am using the cox model using coxph option, see the fit3 coxph. When I use the avg.surv option on fit3, I get the following error: "fits<-avg.surv(fit3, var.name="stage.fac", var.values=c(1,2,3,4), data=larynx) Error in `contrasts<-`(`*tmp*`, value = contrasts.arg[[nn]]) : contrasts apply only to factors" If try using var.values = c ("1","2","3","4") option, I get the following error: "Error in xmat %*% cfit$coef : non-conformable arguments In addition: Warning message: In model.matrix.default(Terms, mf, contrasts = contrast.arg) : variable 'stage.fac' converted to a factor" Please advise me on what I am doing wrong! Regards, Manish Dalwani University of Colorado library(survival) library(ggplot2) setwd("/Users/manishdalwani/surv_6646/") #file.show("Larynx.txt") #Stage of disease (1=stage 1, 2=stage2, 3=stage 3, 4=stage 4) #Time to death or on-study time, months #Age at diagnosis of larynx cancer #Year of diagnosis of larynx cancer #Death indicator (0=alive, 1=dead) larynx <-read.table(file ="larynx.dat",col.names =c("stage","time","age","year","death"),colClasses =c("factor","numeric","numeric","numeric","numeric")) #the death indicator needs to be numeric for the survfit function to work # splitting age into three intervals: <55, 50-70, >70 age.larynx <- rep(0, length(larynx[,1])) age.larynx [larynx$age < 55] <- 1 age.larynx [larynx$age >= 55 & larynx$age <= 70] <- 2 age.larynx [larynx$age >70] <- 3 larynx <- cbind(larynx, age.larynx) larynx$age.larynx <- factor(larynx$age.larynx, labels=c("1", "2", "3")) larynx$stage.fac<-as.factor(larynx$stage) year.larynx <- rep(0, length(larynx[,1])) year.larynx [larynx$year >= 74] <- 1 year.larynx [larynx$year < 74] <- 2 larynx <- cbind(larynx, year.larynx) head(larynx) summary(larynx) kapmeier.fit <- survfit( Surv(time, death) ~ stage, data = larynx, type = "kaplan-meier") sumkap <- summary(kapmeier.fit) attributes(kapmeier.fit) attributes(sumkap) plot(kapmeier.fit, lty=2:3, fun="cumhaz", xlab="Months",ylab="Cumulative Hazard of death") legend(4,.4,c("stage1","stage2","stage3","stage4"),lty=1:2) plot(kapmeier.fit, lty=2:3, xlab="Months",ylab="Survival Function") #construct a data frame for the plots plotdata <- data.frame(time = sumkap$time, surv = sumkap$surv, strata = sumkap$strata) fact.stage<-factor(larynx$stage) fit1<-coxph(Surv(time, death) ~ fact.stage + age.larynx + factor(year>=74), data=larynx, method=c("efron")) summary(fit1) avg.surv <- function(cfit, var.name, var.values, data, weights) { if(missing(data)){ if(!is.null(cfit$model)) mframe <- cfit$model elsemframe <-model.frame(cfit,sys.parent()) } else mframe <- model.frame(cfit, data) var.num <- match(var.name, names(mframe)) data.patterns <- apply(data.matrix(mframe[, - c(1, var.num)]), 1, paste, collapse = ",") data.patterns <- factor(data.patterns,levels=unique(data.patterns)) if(missing(weights)) weights <- table(data.patterns) else weights <- tapply(weights, data.patterns, sum) kp <- !duplicated(data.patterns) mframe <- mframe[kp,] obs.var <- mframe[,var.num] lps <- (cfit$linear.predictor)[kp] tframe <- mframe[rep(1,length(var.values)),] tframe[,var.num] <- var.values xmat <- model.matrix(cfit,data=tframe)[,-1] tlps <- as.vector(xmat%*%cfit$coef) names(tlps) <- var.values obs.off <- tlps[as.character(obs.var)] explp.off <- exp(outer(lps - obs.off ,tlps,"+")) bfit <- survfit.coxph(cfit, se.fit = F) fits <- outer(bfit$surv,explp.off,function(x,y) x^y) avg.fits <- apply(sweep(fits,2,weights,"*"),c(1,3),sum)/sum(weights) dimnames(avg.fits) <- list(NULL,var.values) list(time=bfit$time,fits=avg.fits) } fits<-avg.surv(fit1, var.name="fact.stage", data=larynx, var.values=c("0","1","2","3")) matlines(fits$time,fits$fits) fit2<-coxph(Surv(time, death) ~ stage + age.larynx + factor(year>=74), data=larynx, method=c("efron")) summary(fit2) fit3<-coxph(Surv(time, death) ~ stage.fac + age.larynx + year.larynx), data=larynx, method=c("efron")) summary(fit3) fit4<-coxph(Surv(time,death)~factor(stage)+factor(age>=74)+factor(year>=74),data=larynx,method=c("efron")) summary(fit4) lines(survexp(~stage+ratetable(stage,age.larynx,year>=74),data=larynx,ratetable=fit2,cohort=TRUE),col="purple") fits<-avg.surv(fit3, var.name="stage.fac", var.values=c(1,2,3,4), data=larynx) matlines(fits$time,fits$fits) [[alternative HTML version deleted]]
Maybe Matching Threads
- need help with avg.surv (Direct Adjusted Survival Curve), Message-ID:
- ow to apply a panel function to each of several data series plotted on the same graph in lattice
- prediction intervals (alpha and beta) for model average estimates from binomial glm and model.avg (library=dRedging)
- Passing a string variable to Surv
- ConstrOptim Function (Related to Constraint Matrix/ui/ci error)