TASK:   problem with "glm" with binomial errors
STATUS: Open
FROM:   p.dalgaard@kubism.ku.dk
        in glm(,binomial) it's possible that loss of significant
        digits make expected values 0 or 1 even though there's no
        divergence of the fit.  (Happened to me with menarche data,
        infants and grown-ups included)
        [ Need the example data.  Glm needs a complete overhaul. ]
OK, here's a way to fake some data (simulated as if menarche occurs
uniformly between 10 an 15 years of age) There's no real problem with
fitting a logistic regression to this, but the logistic curve gets too
close to 0 and 1 for the very young and the very old.
BTW: rbinom gives NA for p=0 and p=1, which causes some problems with
the simulations. 
BTW2: Doesn't "occurred" take a double r?
> fake.age<-runif(500,0,50)
> fake.menarche<-rbinom(500,1,.99999 *(fake.age > 15) + .99999 *
(fake.age > 10 & fake.age <= 15)/5 * (fake.age - 10)+.000005)
> glm(fake.menarche~fake.age,binomial)
Error: fitted probabilities of 0 or 1 occured> fake2.age<-fake.age[fake.age<25&fake.age>5]
> fake2.menarche<-fake.menarche[fake.age<25&fake.age>5]
> glm(fake2.menarche~fake2.age,binomial)
Call:  glm(fake2.menarche ~ fake2.age, binomial) 
Coefficients:
(Intercept)    fake2.age  
   -15.1061       1.2011  
Degrees of Freedom: 189 Total; 187 Residual
Null Deviance: 253.9 
Residual Deviance: 48.73 > 
-- 
   O__  ---- Peter Dalgaard             Blegdamsvej 3  
  c/ /'_ --- Dept. of Biostatistics     2200 Cph. N   
 (*) \(*) -- University of Copenhagen   Denmark      Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard@biostat.ku.dk)             FAX: (+45) 35327907
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
r-devel mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To:
r-devel-request@stat.math.ethz.ch
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-