Steven Gorlé
2007-Jan-03 21:29 UTC
[R] stratified sampling from known population datafile
Dear R-wizards,
I have a population from which I want to draw a stratified sample by region.
In Venables and Ripley "Modern Applied statistics with S" I found
some
great procedures for Simple Random Sampling (with and without replacement)
and for Systematic sampling and it works!
For stratified sampling I referred to the manual of the survey package.Are
there any other papers available on this subject?
Is the output correct? And how can I draw a random (stratified by region)
sample from my (population) datafile bmi?
dstrat<-svydesign(id=~1,strata=~REGIONCH, data=bmi)
Warning in svydesign(id = ~1, strata = ~REGIONCH, data = bmi) :
No weights or probabilities supplied, assuming equal
probability> summary(dstrat)
Stratified Independent Sampling design (with replacement)
svydesign(id = ~1, strata = ~REGIONCH, data = bmi)
Probabilities:
Min. 1st Qu. Median Mean 3rd Qu. Max.
1 1 1 1 1 1
Stratum Sizes:
Brussels Flanders Walloonia
obs 2571 2987 3006
design.PSU 2571 2987 3006
actual.PSU 2571 2987 3006
Data variables:
[1] "ID" "WFIN" "HH"
"REGION" "EDU3" "FA3"
[7] "TA2" "AGE7" "SEX"
"VOEG" "BMI" "LNBMI"
[13] "LNVOEG" "FLA" "BRU"
"WAL" "AGEGR1" "AGEGR2"
[19] "AGEGR3" "AGEGR4" "AGEGR5"
"AGEGR6" "AGEGR7" "EDUPRIM"
[25] "EDUSEC" "EDUHIGH" "INCLOW"
"INCMED" "INCHIG" "REGIONCH"
[31] "PROVINCE" "SGP" "GHQ12"
"GHQBIN"> svymean(~BMI, dstrat)
mean SE
BMI NA NA
Thanks in advance!!
Kind regards,
Steven Gorle