Luis Reino
2008-Jul-29 16:11 UTC
[R] Bootstraping GAMs for confidence intervales calculation
Dear R-Users, I am resending this message just to reminder my question regarding the calculation of a bootstrap confidence intervals for a GAM plot. I am trying to apply a bootstrap to a GAM in order to calculate the 95% confidence intervals for a smooth curve obtained by the ?plot.gam? function of the mgcv package. Nonetheless, I am getting some difficulties in transposing the results for the graphs. I used the following commands in R, ?mgcv? and ?boot? packages: *> attach(bbvc_11Jul08)* *> model.boot<-function(data,indices){* *+ sub.data<-data[indices,]* *+ model<-gam(asin(sqrt(Cov0_30))~s(age,k=4,bs="cr"),family=gaussian,data=sub.data)* *+ coef(model)}* *> gam.boot<-boot(bbvc_11Jul08,model.boot,R=1000)* * * *> gam.boot* ORDINARY NONPARAMETRIC BOOTSTRAP Call: boot(data = bbvc_11Jul08, statistic = model.boot, R = 1000) Bootstrap Statistics : original bias std. error t1* 0.40370112 0.0002762674 0.02017378 t2* 0.04715501 -0.0144132280 0.07648348 t3* 0.14590936 -0.0135820501 0.05161244 t4* 0.13520098 -0.0096405733 0.04470793 I obtained 4 indexes in the ?bootstrap? output. I know that for a lm function, with an independent variable, t1 is the index of the intercept and t2 is the index of the variable, but for GAMs I don't find what each indexes really mean! Moreover, to calculate the confidence intervals of 95% I proceed as follows: * * *>boot.ci(gam.boot,conf=0.95) * BOOTSTRAP CONFIDENCE INTERVAL CALCULATIONS Based on 1000 bootstrap replicates CALL : boot.ci(boot.out = gam.boot, conf = 0.95) Intervals : Level Normal Basic Studentized 95% ( 0.3639, 0.4430 ) ( 0.3635, 0.4434 ) ( 0.3583, 0.4507 ) Level Percentile BCa 95% ( 0.3640, 0.4439 ) ( 0.3623, 0.4434 ) Calculations and Intervals on Original Scale Warning message: In sqrt(tv[, 2]) : NaNs produced My first doubt is when I make "boot.ci", which of the four should I choose? By default, R calculates "index=1:min(2,length(boot.out$t0)" but I don't know if this is the correct form to do it. Finally, I don?t know how to introduce the 95% confidence intervals in the "plot.gam". I hope you can help me. Thanks in advance. Lu?s Reino -- Luis Reino, PhD Student Centro de Estudos Florestais Departamento de Engenharia Florestal Instituto Superior de Agronomia Universidade T?cnica de Lisboa 1349-017 Lisboa Portugal