thr3ads.net - R help - [R] Resampling to find Confidence intervals [Jan 2011]

If this information is useful, please help other people find it:
Share via:

Ben Ward

2011-Jan-04 00:03 UTC

[R] Resampling to find Confidence intervals

Hi, I'm doing some modelling (lm) for my 3rd year dissertation and I 
want to do some resampling, especially as I'm working with microbes, 
getting them to evolve resistance to antimicrobial compounds, and after 
each exposure I'm measuring the minimum concentration required to kill 
them (which I'm expecting to rise over time, or exposures), I have 5 
lineages per cleaner, and I'm using 2 cleaners(of different chemical 
origin, and it's these two different origins I'm interested in, or 
rather, and differences in concentration results between them). So the 
amount of data I get is small, hence my desire to resample. But thats 
not so important.

I have used help from Kaplans Book: Statistical Modelling A Fresh 
Approach, to get write the following code for my project:

samps = do(500)*
    coef(lm(MIC. ~ 1 + Challenge + Cleaner + Replicate, 
data=resample(ecoli)))
  sd(samps)

But the "resample" and "do" operators are functions specific
to  a
workspace that comes with the book, not a normal R setup. So I was 
thinking of ways I could achive the same result, or sort of result 
because the resample should be different each time, I think the 
following would work to the same effect:

resampled_ecoli = sample(ecoli, 500, replace=T)
coefs = (coef(lm(MIC. ~ 1 + Challenge + Cleaner + Replicate, 
data=resampled_ecoli)))
sd(coefs)

And then I can work out confidence intervals by multiplying the standard 
errors by 2.

Although I'm not used to doing this sort of operation in R so I don't 
want to do the wrong thing.
If anyon could tell me if that would work or what I need to do instead 
I'd be eternally greatful.

Thanks,
Ben Ward.

Dieter Menne

2011-Jan-04 16:24 UTC

head link

[R] Resampling to find Confidence intervals

Axolotl9250 wrote:> 
> ...
> resampled_ecoli = sample(ecoli, 500, replace=T)
> coefs = (coef(lm(MIC. ~ 1 + Challenge + Cleaner + Replicate, 
> data=resampled_ecoli)))
> sd(coefs)
> 
> ...
> Below a simplified and self-consistent version of your code, and some
changes

Dieter

# resample
d = data.frame(x=rnorm(10))
d$y = d$x*3+rnorm(10,0.01)

# if you do this, you only get ONE bootstrap sample
d1 = d[sample(1:nrow(d),10,TRUE),]
d1.coef = coef(lm(y~x,data=d1))
d1.coef
# No error below, because you compute the sd of (Intercept) and slope
# but result is wrong!
sd(d1.coef)

# We have to do this over and over
# Check ?replicate for a more R-ish approach....
nsamples = 1000
allboot = NULL
for (i in 1:1000) {
  d1 = d[sample(1:nrow(d),10,TRUE),]
  d1.coef = coef(lm(y~x,data=d1))
  allboot = rbind(allboot,d1.coef) # Not very efficient, preallocate!
}
head(allboot) # display first of nsamples lines
apply(allboot,2,mean) # Compute mean
apply(allboot,2,sd) # compute sd
# After you are sure you understood the above, you might try package boot.

-- 
View this message in context:
http://r.789695.n4.nabble.com/Resampling-to-find-Confidence-intervals-tp3172867p3173846.html
Sent from the R help mailing list archive at Nabble.com.

Seemingly Similar Threads

Search for more reasonably related threads

R help - Jan 2011 - Resampling to find Confidence intervals

[R] Resampling to find Confidence intervals

[R] Resampling to find Confidence intervals

Seemingly Similar Threads