I am simulating sickness among a group of families. Part of the task is
to randomly draw who in the family will be sick, randomly drawing from
family ID's where Dad =1, Mom = 2, Kid1 = 3, Kid2 = 4., etc. My census of
Dads is of the form shown below.
Dad_ID Spouse (Y=1;N=0) #Kids #People_Becoming_Sick
1 1 0 1
2 0 2 2
3 1 0 2
4 1 3 3
...
The end output needed is if 3 people in a family are to be sick, was it
the dad and two kids, with random family ID's = {1,3,4}, or the mom, dad,
and one kid, with random family ID's = {2,1,4}, etc.. The complication
is that length of the family ID's to choose from and the associated
sampling probabilities -- changes with each family. I could loop through
the Dads, from i in 1:nrow(census), but is there a way I could vectorize
sample() to get at the same objective?
My attempts to use the apply-based functions have dead ended. Other ideas
to vectorize this problem are warmly welcomed.
Regards,
Stephen Collins, MPP | Analyst
Health & Benefits | Aon Consulting
200 East Randolph, Suite 900, Chicago, IL
Tel: 312-381-2578 | Fax: 312-381-0136
Email: stephen_collins@aon.com
Aon Consulting selected by the readers of Business Insurance as the “Best
Employee Benefit Consulting Firm” in 2006, 2007, and 2008
NOTE: The information contained in this transmission, including any
attachment(s) is only for the use of the intended individual(s) or entity,
and may contain information that is privileged and confidential. If the
reader of this message is not an intended recipient, you are hereby
notified that any dissemination, distribution, disclosure, or copying of
this information is unauthorized and strictly prohibited. If you have
received this communication in error, please contact the sender
immediately by reply email and destroy all copies of the original message.
[[alternative HTML version deleted]]
On 11/7/2008 12:00 PM, Stephen Collins wrote:> I am simulating sickness among a group of families. Part of the task is > to randomly draw who in the family will be sick, randomly drawing from > family ID's where Dad =1, Mom = 2, Kid1 = 3, Kid2 = 4., etc. My census of > Dads is of the form shown below. > > Dad_ID Spouse (Y=1;N=0) #Kids #People_Becoming_Sick > 1 1 0 1 > 2 0 2 2 > 3 1 0 2 > 4 1 3 3 > ... > > The end output needed is if 3 people in a family are to be sick, was it > the dad and two kids, with random family ID's = {1,3,4}, or the mom, dad, > and one kid, with random family ID's = {2,1,4}, etc.. The complication > is that length of the family ID's to choose from and the associated > sampling probabilities -- changes with each family. I could loop through > the Dads, from i in 1:nrow(census), but is there a way I could vectorize > sample() to get at the same objective? > > My attempts to use the apply-based functions have dead ended. Other ideas > to vectorize this problem are warmly welcomed.You might want to transform runif instead of using sample(). For example, if you want to generate M integers from 1:n_i, where n_i varies from sample to sample, you could use gen <- ceiling(runif(M, 0, n)) (where n is a vector of length M giving the upper limits). Duncan Murdoch> > > > Regards, > > Stephen Collins, MPP | Analyst > Health & Benefits | Aon Consulting > 200 East Randolph, Suite 900, Chicago, IL > Tel: 312-381-2578 | Fax: 312-381-0136 > Email: stephen_collins at aon.com > > Aon Consulting selected by the readers of Business Insurance as the ???Best > Employee Benefit Consulting Firm??? in 2006, 2007, and 2008 > > NOTE: The information contained in this transmission, including any > attachment(s) is only for the use of the intended individual(s) or entity, > and may contain information that is privileged and confidential. If the > reader of this message is not an intended recipient, you are hereby > notified that any dissemination, distribution, disclosure, or copying of > this information is unauthorized and strictly prohibited. If you have > received this communication in error, please contact the sender > immediately by reply email and destroy all copies of the original message. > > > [[alternative HTML version deleted]] > > > > ------------------------------------------------------------------------ > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.
Hi,
I'm not quite sure I understood everything but is this something close?
d <- read.table(textConnection("Dad_ID SpouseYN NKids NSick
1 1 0 1
2 0 2 2
3 1 0 2
4 1 3 3"), header=TRUE)
mapply(sample, d$NKids+d$SpouseYN+1, d$NSick)
# d$NKids+d$SpouseYN+1 is supposed to be family size
Best regards,
Kenn
On Fri, Nov 7, 2008 at 7:00 PM, Stephen Collins
<Stephen_Collins@aon.com>wrote:
> I am simulating sickness among a group of families. Part of the task is
> to randomly draw who in the family will be sick, randomly drawing from
> family ID's where Dad =1, Mom = 2, Kid1 = 3, Kid2 = 4., etc. My census
of
> Dads is of the form shown below.
>
> Dad_ID Spouse (Y=1;N=0) #Kids #People_Becoming_Sick
> 1 1 0 1
> 2 0 2 2
> 3 1 0 2
> 4 1 3 3
> ...
>
> The end output needed is if 3 people in a family are to be sick, was it
> the dad and two kids, with random family ID's = {1,3,4}, or the mom,
dad,
> and one kid, with random family ID's = {2,1,4}, etc.. The
complication
> is that length of the family ID's to choose from and the associated
> sampling probabilities -- changes with each family. I could loop through
> the Dads, from i in 1:nrow(census), but is there a way I could vectorize
> sample() to get at the same objective?
>
> My attempts to use the apply-based functions have dead ended. Other ideas
> to vectorize this problem are warmly welcomed.
>
>
>
> Regards,
>
> Stephen Collins, MPP | Analyst
> Health & Benefits | Aon Consulting
> 200 East Randolph, Suite 900, Chicago, IL
> Tel: 312-381-2578 | Fax: 312-381-0136
> Email: stephen_collins@aon.com
>
> Aon Consulting selected by the readers of Business Insurance as the “Best
> Employee Benefit Consulting Firm†in 2006, 2007, and 2008
>
> NOTE: The information contained in this transmission, including any
> attachment(s) is only for the use of the intended individual(s) or entity,
> and may contain information that is privileged and confidential. If the
> reader of this message is not an intended recipient, you are hereby
> notified that any dissemination, distribution, disclosure, or copying of
> this information is unauthorized and strictly prohibited. If you have
> received this communication in error, please contact the sender
> immediately by reply email and destroy all copies of the original message.
>
>
> [[alternative HTML version deleted]]
>
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
>
[[alternative HTML version deleted]]