Displaying 20 results from an estimated 10000 matches similar to: "Overdispersion in count data"
2009 Feb 16
1
Overdispersion with binomial distribution
I am attempting to run a glm with a binomial model to analyze proportion
data.
I have been following Crawley's book closely and am wondering if there is
an accepted standard for how much is too much overdispersion? (e.g. change
in AIC has an accepted standard of 2).
In the example, he fits several models, binomial and quasibinomial and then
accepts the quasibinomial.
The output for residual
2011 Jun 13
1
glm with binomial errors - problem with overdispersion
Dear all,
I am new to R and my question may be trivial to you...
I am doing a GLM with binomial errors to compare proportions of species in
different categories of seed sizes (4 categories) between 2 sites.
In the model summary the residual deviance is much higher than the degree
of freedom (Residual deviance: 153.74 on 4 degrees of freedom) and even
after correcting for overdispersion by
2009 Feb 23
1
Follow-up to Reply: Overdispersion with binomial distribution
THANKS so very much for your help (previous and future!). I have a two
follow-up questions.
1) You say that dispersion = 1 by definition ....dispersion changes from 1
to 13.5 when I go from binomial to quasibinomial....does this suggest that
I should use the binomial? i.e., is the dispersion factor more important
that the
2) Is there a cutoff for too much overdispersion - mine seems to be
2011 Apr 21
1
Accounting for overdispersion in a mixed-effect model with a proportion response variable and categorical explanatory variables.
Dear R-help-list,
I have a problem in which the explanatory variables are categorical,
the response variable is a proportion, and experiment contains
technical replicates (pseudoreplicates) as well as biological
replicated. I am new to both generalized linear models and mixed-
effects models and would greatly appreciate the advice of experienced
analysts in this matter.
I analyzed the
2007 Jan 11
2
overdispersion
How can I eliminate the overdispersion for binary data apart the use of the quasibinomial?
help me
Eva Iannario
------------------------------------------------------
Passa a Infostrada. ADSL e Telefono senza limiti e senza canone Telecom
http://click.libero.it/infostrada11gen07
2009 Mar 02
2
Unrealistic dispersion parameter for quasibinomial
I am running a binomial glm with response variable the no of mites of two
species y->cbind(mitea,miteb) against two continuous variables (temperature
and predatory mites) - see below. My model shows overdispersion as the
residual deviance is 48.81 on 5 degrees of freedom. If I use quasibinomial
to account for overdispersion the dispersion parameter estimate is 2501139,
which seems
2012 Feb 07
1
binomial vs quasibinomial
After looking at 48 glm binomial models I decided to try the quasibinomial
with the top model 25 (lowest AIC). To try to account for overdispersion
(residual deviance 2679.7/68 d.f.) After doing so the dispersion factor is
the same for the quasibinomial and less sectors of the beach were
significant by p-value. While the p-values in the binomial were more
significant for each section of the
2008 May 07
2
Estimating QAIC using glm with the quasibinomial family
Hello R-list. I am a "long time listener - first time caller" who has
been using R in research and graduate teaching for over 5 years. I
hope that my question is simple but not too foolish. I've looked
through the FAQ and searched the R site mail list with some close hits
but no direct answers, so...
I would like to estimate QAIC (and QAICc) for a glm fit using the
2005 Oct 20
3
different F test in drop1 and anova
Hi,
I was wondering why anova() and drop1() give different tail
probabilities for F tests.
I guess overdispersion is calculated differently in the following
example, but why?
Thanks for any advice,
Tom
For example:
> x<-c(2,3,4,5,6)
> y<-c(0,1,0,0,1)
> b1<-glm(y~x,binomial)
> b2<-glm(y~1,binomial)
> drop1(b1,test="F")
Single term deletions
Model:
y ~
2003 Feb 18
4
glm and overdispersion
Hi,
I am performing glm with binomial family and my data show slight
overdispersion (HF<1.5). Nevertheless, in order to take into account for
this heterogeneity though weak, I use F-test rather than Chi-square
(Krackow & Tkadlec, 2001). But surprisingly, outputs of this two tests
are exactly similar. What is the reason and how can I scale the output
by overdispersion ??
Thank you,
2008 Sep 16
1
Using quasibinomial family in lmer
Dear R-Users,
I can't understand the behaviour of quasibinomial in lmer. It doesn't
appear to be calculating a scaling parameter, and looks to be reducing the
standard errors of fixed effects estimates when overdispersion is present
(and when it is not present also)! A simple demo of what I'm seeing is
given below. Comments appreciated?
Thanks,
Russell Millar
Dept of Stat
U.
2010 Nov 19
2
Question on overdispersion
I have a few questions relating to overdispersion in a sex ratio data set
that I am working with (note that I already have an analysis with GLMMs for
fixed effects, this is just to estimate dispersion). The response variable
is binomial because nestlings can only be male or female. I have samples of
1-5 nestlings from each nest (individuals within a nest are not independent,
so the response
2008 May 01
2
zero variance in part of a glm (PR#11355)
In this real example (below), all four of the replicates in one
treatment combination had zero failures, and this produced a very high
standard error in the summary.lm.
=20
Just adding one failure to one of the replicates produced a well-behaved
standard error.
=20
I don't know if this is a bug, but it is certainly hard for users to
understand.
=20
I would value your comments=20
=20
Thanks
=20
2015 Jun 25
1
Estimating overdispersion when using glm for count and binomial data
Dear All
I recently proposed a simple modification to Wedderburn's 1974 estimate
of overdispersion for count and binomial data, which is used in glm for
the quasipoisson and quasibinomial families (see the reference below).
Although my motivation for the modification arose from considering
sparse data, it will be almost identical to Wedderburn's estimate when
the data are not sparse.
2000 Apr 19
1
scale factors/overdispersion in GLM: possible bug?
I've been poking around with GLMs (on which I am *not* an expert) on
behalf of a student, particularly binomial (standard logit link) nested
models with overdispersion.
I have one possible bug to report (but I'm not confident enough to be
*sure* it's a bug); one comment on the general inconsistency that seems to
afflict the various functions for dealing with overdispersion in GLMs
2010 Feb 18
0
Appropriate test for overdispersion in binomial data
Dear R users,
Overdispersion is often a problem in binomial data. I attempt to model a
binary response (sex-ratio) with three categorical explanatory variables,
using GLM, which could assume the form:
y<-cbind(sexf, sample-sexf)
model<-glm(y ~ age+month+year, binomial)
summary(model)
Output:
(Dispersion parameter for binomial family taken to be 1)
Null deviance: 8956.7 on 582
2011 Aug 27
1
hopelessly overdispersed?
dear list!
i am running an anlysis on proportion data using binomial (quasibinomial
family) error structure. My data comprises of two continuous vars, body
size and range size, as well as of feeding guild, nest placement, nest
type and foragig strata as factors. I hope to model with these variables
the preference of primary forests (#successes) by certain bird species.
My code therefore looks
2003 Jul 03
1
How to use quasibinomial?
Dear all,
I've got some questions, probably due to misunderstandings on my behalf, related
to fitting overdispersed binomial data using glm().
1. I can't seem to get the correct p-values from anova.glm() for the F-tests when
supplying the dispersion argument and having fitted the model using
family=quasibinomial. Actually the p-values for the F-tests seems identical to the
p-values for
2005 Oct 10
1
interpretation output glmmPQL
Hi !
We study the effect of several variables on fruit set for 44 individuals
(plants). For each individual, we have the number of fruits, the number
of flowers and a value for each variable.
Here is our first model in R :
y <- cbind(indnbfruits,indnbflowers);
model1
<-glm(y~red*yellow+I(red^2)+I(yellow^2)+densite8+I(densite8^2)+freq8_4+I
(freq8_4^2), quasibinomial);
- We have
2007 Feb 25
0
Overdispersion in a GLM binomial model
Hello,
The share of concurring votes (i.e. yes-yes and no-no) in total votes
between a pair of voters is a function of their ideological distance (index
continuous on [1,2]).
I show by other means that the votes typically are highly positively
correlated (with an average c=0.6). This is because voters sit together and
discuss the issue before taking a vote, but also because they share common