Raghu Naik
2009-Apr-07 16:53 UTC
[R] Comparing Sample / Population Geographic Distributions
I am trying to confirm that the sample is not different from the population with respect to geographical respresentation for the following data: region population sample n_total s_total 1 north 1244 232 9077 1032 2 south 4333 100 9077 1032 3 east 1500 200 9077 1032 4 west 2000 500 9077 1032 For example the first line of this data shows that our sample consisted of 232 individuals from the north that had a population of 1244. The table also shows that in total, we chose 1032 individuals (s_total) from a population size of 9077 (n_total). I thought that one way of doing that would be to run a logistic regression of sample / population proportion on region (north, south, east, west) and source (sample, population) and show that the source factor is insignificant. I am not exactly satisfied with the results with a huge residual deviance and a sample factor p value of 1.00. Therefore, I was wondering if there was a better approach of making such comparison. I would appreciate any suggestions. Raghu The data / logistic regression code is as below : ____________________________________________________ library(sqldf) mystring <- ("region, population, sample north, 1244, 232 south, 4333, 100 east, 1500, 200 west, 2000, 500") one <- read.table(textConnection(mystring),header=TRUE,sep=",") n_total <- sum(one$population) s_total <- sum(one$sample) one <- cbind(one,n_total,s_total) ; one two <- sqldf(" select population as count, n_total as total , 'population' as source, region from one union select sample as count, s_total as total, 'sample' as source, region from one " ) attach(two) prop <- count/total model1 <- glm(prop ~ source + region, weights=total, quasibinomial) [[alternative HTML version deleted]]
Seemingly Similar Threads
- Undefined columns selected
- Graphically show population density for a specific geographic area using R
- ''$'' placeholder naming can confuse your runner
- getting all pairwise combinations of elements in a character string
- [klibc:update-dash] mystring: fix "Illegal number" on FreeBSD & macOS for x=; echo $((x))