Kai Ying
2012-Jul-09 02:21 UTC
[R] classification using zero-inflated negative binomial mixture model
Hi, I want using zero-inflated negative binomial regression model to classify data(a vector of data), that is I want know each observed value is more likely belong to the "zero" or "count" distribution(better with relative probability). My data is some like: count site samp 12909 1 1 602 1 2 50 1 3 1218 1 4 91291 1 5 while "count" is the data with a mixture of "zero" and "non-zero" distribution I want know, and "site", "samp" are two prediction valuables with additive effect(but I am not interested in it). I have tried the zeroinfl function of pscl package to fit zero-inflated negative binomial regression. But it can not give you the classification result of "count". Can anyone help with some indication of how to do it or other tools that can do this job ?? -- Kai Ying Iowa State University ISU, Ames IA 50010 Email: yingk@iastate.edu [[alternative HTML version deleted]]
Ben Bolker
2012-Jul-09 16:00 UTC
[R] classification using zero-inflated negative binomial mixture model
Kai Ying <yingk <at> iastate.edu> writes:> > Hi, > I want using zero-inflated negative binomial regression model to > classify data(a vector of data), that is I want know each observed value is > more likely belong to the "zero" or "count" distribution(better with > relative probability). My data is some like: > > count site samp > > 12909 1 1 > > 602 1 2 > > 50 1 3 > > 1218 1 4 > > 91291 1 5 > > while "count" is the data with a mixture of "zero" and "non-zero" > distribution I want know, and "site", "samp" are two prediction valuables > with additive effect(but I am not interested in it). > > I have tried the zeroinfl function of pscl package to fit zero-inflated > negative binomial regression. But it can not give you the classification > result of "count". Can anyone help with some indication of how to do it or > other tools that can do this job ??Not sure, but you may be able to do this by hand. For a predicted mean value mu, overdispersion parameter k, and zero-inflation probability p, the probability p_z of a structural zero is p, while the probability of a sampling zero p_s is (k/(mu+k))^k ; therefore the probability that an observed zero is a structural zero is p_z/(p_s+p_z) ...