Michel Boutsen
2009-May-18 13:07 UTC
[R] discrepancies between stata and r for a cox regression
Hello I would like to develop the use of R. Trying R and more particulary the cox model, I am surprised by discrepancies between results with stata and R for a cox model With the same data base, I get a hazard ratio (4.82) that is not the same obtained with stata (4.52) You will find attached the file leukemia.dta I used (Stata) Here are the codes for R library(foreign) leukemia<-read.dta("leukemia.dta") library(survival) res <- coxph(Surv(TIME, STATUS)~TREAT, data=leukemia) summary(res) and here the codes for for stata use "leukemia.dta",clear stset TIME, failure(STATUS==1) stcox TREAT SPSS and EPIinfo give the same HR than Stata I tried with an other database without any problem What would be the problem??? I changed of pc and versions of R (2.81 & 2.9.0) without any change. The means are the same for the two packages. I saw a few posts with discrepancies but with no explanations Thanks in advance Michel Boutsen Brussel's University Department of Biostatistics
Michel Boutsen
2009-May-18 15:50 UTC
[R] discrepancies between stata and r for a cox regression
Hello I would like to develop the use of R. Trying R and more particulary the cox model, I am surprised by discrepancies between results with stata and R for a cox model With the same data base, I get a hazard ratio (4.82) that is not the same obtained with stata (4.52) You will find attached the file leukemia.dta I used (Stata) Here are the codes for R leukemia=read.fwf(file="leukem.txt",widths=c(4,2,3,2,5,2),col.names=c("id","TREAT","TIME","STATUS","LOGWBC", "GENDER")) library(survival) res <- coxph(Surv(TIME, STATUS)~TREAT, data=leukemia) summary(res) and here the codes for for stata infix ID 2-3 TREAT 6 TIME 8-9 STATUS 11 LOGWBC 12-16 GENDER 18 using "g:rleukem.txt",clear stset TIME, failure(STATUS==1) stcox TREAT SPSS and EPIinfo give the same HR than Stata I tried with an other database without any problem What would be the problem??? I changed of pc and versions of R (2.81 & 2.9.0) without any change. The means are the same for the two packages. I saw a few posts with discrepancies but not with the same database Thanks in advance Michel Boutsen Brussel's University Department of Biostatistics -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: leukem.txt URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20090518/bdadb8ea/attachment-0002.txt>