Patrick Giraudoux
2012-Mar-21 09:56 UTC
[R] AIC models are not all fitted to the same number of observation
Hi,
Using lme from the package nlme 3.1-103, I meet a strange warning. I am
trying to compare to models with:
library(nlme)
lmez6=lme(lepus~vulpes,random=~1|troncon/an,data=ika_z6_test)
lmez60=lme(lepus~1,random=~1|troncon/an,data=ika_z6_test)
Both have the same number of observations and groups:
lmez6
Linear mixed-effects model fit by REML
Data: ika_z6_test
Log-restricted-likelihood: -2267.756
Fixed: lepus ~ vulpes
(Intercept) vulpes
1.35017117 0.04722338
Random effects:
Formula: ~1 | troncon
(Intercept)
StdDev: 0.8080261
Formula: ~1 | an %in% troncon
(Intercept) Residual
StdDev: 1.086611 0.4440076
Number of Observations: 1350
Number of Groups:
troncon an %in% troncon
169 1350
> lmez60
Linear mixed-effects model fit by REML
Data: ika_z6_test
Log-restricted-likelihood: -2266.869
Fixed: lepus ~ 1
(Intercept)
1.435569
Random effects:
Formula: ~1 | troncon
(Intercept)
StdDev: 0.8139646
Formula: ~1 | an %in% troncon
(Intercept) Residual
StdDev: 1.086843 0.4445815
Number of Observations: 1350
Number of Groups:
troncon an %in% troncon
169 1350
...but when I want to compare their AIC, I get:
AIC(lmez6,lmez60)
df AIC
lmez6 5 4545.511
lmez60 4 4541.737
Warning message:
In AIC.default(lmez6, lmez60) :
models are not all fitted to the same number of observations
Has anybody an explanation about this strange warning ? To what extent
this warning may limit the conclusions that could be drawn from AIC
comparison ?
Thanks in advance,
Patrick
Patrick Giraudoux
2012-Mar-21 14:55 UTC
[R] AIC models are not all fitted to the same number of observation
Le 21/03/2012 10:56, Patrick Giraudoux a ?crit :> Hi, > > Using lme from the package nlme 3.1-103, I meet a strange warning. I > am trying to compare to models with: > > library(nlme) > lmez6=lme(lepus~vulpes,random=~1|troncon/an,data=ika_z6_test) > lmez60=lme(lepus~1,random=~1|troncon/an,data=ika_z6_test) > > Both have the same number of observations and groups: > > lmez6 > Linear mixed-effects model fit by REML > Data: ika_z6_test > Log-restricted-likelihood: -2267.756 > Fixed: lepus ~ vulpes > (Intercept) vulpes > 1.35017117 0.04722338 > > Random effects: > Formula: ~1 | troncon > (Intercept) > StdDev: 0.8080261 > > Formula: ~1 | an %in% troncon > (Intercept) Residual > StdDev: 1.086611 0.4440076 > > Number of Observations: 1350 > Number of Groups: > troncon an %in% troncon > 169 1350 > > > > lmez60 > Linear mixed-effects model fit by REML > Data: ika_z6_test > Log-restricted-likelihood: -2266.869 > Fixed: lepus ~ 1 > (Intercept) > 1.435569 > > Random effects: > Formula: ~1 | troncon > (Intercept) > StdDev: 0.8139646 > > Formula: ~1 | an %in% troncon > (Intercept) Residual > StdDev: 1.086843 0.4445815 > > Number of Observations: 1350 > Number of Groups: > troncon an %in% troncon > 169 1350 > > ...but when I want to compare their AIC, I get: > > AIC(lmez6,lmez60) > df AIC > lmez6 5 4545.511 > lmez60 4 4541.737 > Warning message: > In AIC.default(lmez6, lmez60) : > models are not all fitted to the same number of observations > > > Has anybody an explanation about this strange warning ? To what extent > this warning may limit the conclusions that could be drawn from AIC > comparison ? > > Thanks in advance, > > Patrick > > >Sorry to go on on the thread, I have created, but the trouble I meet is above my level in stats... Actually, not using AIC but an anova approach, I get a more informative message: anova(lmez6, lmez60) Model df AIC BIC logLik Test L.Ratio p-value lmez6 1 5 4545.511 4571.543 -2267.756 lmez60 2 4 4541.737 4562.566 -2266.869 1 vs 2 1.774036 0.1829 Warning message: In anova.lme(lmez6, lmez60) : Fitted objects with different fixed effects. REML comparisons are not meaningful. And fubbling a bit more, I disclosed that this was an effect of fitting the model using REML. If fitted using ML, things are going (apparently) smoothly: lmez6=lme(lepus~vulpes,random=~1|troncon/an,data=ika_z6_test,method="ML") > lmez60=lme(lepus~1,random=~1|troncon/an,data=ika_z6_test,method="ML") > anova(lmez6, lmez60) Model df AIC BIC logLik Test L.Ratio p-value lmez6 1 5 4536.406 4562.445 -2263.203 lmez60 2 4 4538.262 4559.093 -2265.131 1 vs 2 3.856102 0.0496 > AIC(lmez6,lmez60) df AIC lmez6 5 4536.406 lmez60 4 4538.262 Now I have the following problem. What I understood from Pinheiro and Bates's book and some forums, is that ML estimations are biased to some extent tending to underestimate variance parameters. So probably not to recommend however results looks consistent here. Thus, I am lost. The two models looks to me clearly embedded (one is just a null model with the only intercept to estimate and the other with intercept + one independent variable (numeric), both have the same random effects, the same response variable and the same number of observations). Warnings, from this point of view sounds inconsistent. They are probably not, but beyond my understanding... Any idea ? Patrick