Hi David,
On Fri, Nov 21, 2008 at 12:01:52PM -0800, dschruth
wrote:> I'm a programmer in a biology lab who is starting to use R to automate
> some of our statistical analysis of growth rate determination. But I'm
> running into some problems as I re-code.
>
> 1) Hypotheses concerning Slope similarity/difference:
> I'm using R's anova(lm()) methods to analyse a model which looks
> like this:
> growth.metric ~ time * test.tube
> I understand that testing the the interaction between time and tube
> (time:test.tube) will tell us if the growth rates (for the last three
> test tubes) are significantly different from one another (Ho=slopes
> are the same). The purpose of doing this test is so that we can be
> certain our cultures have fully acclimated to the treatment and aren't
> going to change much if we stop measuring. This is an important cost
> saving practice too as measurements can go on for years. Yet I'm
> worried that our null and alternative hypotheses should be swapped so
> that our test is more conservative (Ho=slopes are different ... ie
> still acclimating.)
Good thinking.
> Is there a way to specify my model that flips these hypotheses?
> Should I be using a different method? Is this even appropriate?
You could think about equivalence tests. See e.g. references in the
equivalence package.
> 2) Growth Rate is confounded with Variance of Growth Rate
> I'm also worried about the fact that rates for cultures with faster
> growth are calculated using fewer data points (assuming similar
> sampling times between treatments) . The result is that growth ~ var
> (growth). Not only does this put a wrinkle in my analysis between
> treatments, but it also biases the growth acclimation determining
> ANCOVA test above. Faster growing cultures will usually pass the "no
> significant difference between slopes test" more easily because there
> are fewer points from which to be certain about rejecting Ho.
>
> Is there a way to control for this?
> Perhaps I could include the number of points in my model?
Depending on the model that you apply, you might be able to explicitly
model the variance to allow for this possibility. I would guess that
it's not necessarily only the fewer data points contributing to the
greater variation. Faster-growing cultures might also be inherently
more variable.
> 3) Statistical validity of using subsets of growth.metric measurements
> within a test tube
> There are some lab members who insist that we can throw out the
> beginning and end of our log transformed growth.metric measurements
> because they are outliers in determining maximum growth. I've
> proposed looping through all possible combinations of 3 or more points
> within the growth curve and using the highest or best fitting (best R-
> squared) slope. But this idea has been rejected by our PI as not be a
> valid thing to do.
>
> Ideas here?
I'm feeling very cautious about giving advice on this question as I
don't know enough about the area. Sorry.
I hope that this helps, otherwise.
Andrew
--
Andrew Robinson
Department of Mathematics and Statistics Tel: +61-3-8344-6410
University of Melbourne, VIC 3010 Australia Fax: +61-3-8344-4599
http://www.ms.unimelb.edu.au/~andrewpr
http://blogs.mbs.edu/fishing-in-the-bay/