Hello, I am looking at a two-way ANOVA dataset, and would like to count the rows in the dataframe with the same level of the first factor ("Gender") and the second factor ("Dosage"). In other words, I am interested in the number of observations per each "cell" in a (not necessarily balanced) two-way layout. What is the simplest way to do this? Thanks a lot, Asaf Observation Gender Dosage Alertness 1 1 m a 8 2 2 m a 12 3 3 m a 13 4 4 m a 12 5 5 m b 6 6 6 m b 7 7 7 m b 23 8 8 m b 14 9 9 f a 15 10 10 f a 12 11 11 f a 22 12 12 f a 14 13 13 f b 15 14 14 f b 12 15 15 f b 18 16 16 f b 22 -- View this message in context: http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html Sent from the R help mailing list archive at Nabble.com.
Hi> -----Original Message----- > From: r-help-bounces at r-project.org [mailto:r-help-bounces at r- > project.org] On Behalf Of asafwe > Sent: Monday, October 22, 2012 4:02 AM > To: r-help at r-project.org > Subject: [R] Counting duplicates in a dataframe > > Hello, > > I am looking at a two-way ANOVA dataset, and would like to count the > rows in the dataframe with the same level of the first factor > ("Gender") and the second factor ("Dosage"). In other words, I am > interested in the number of observations per each "cell" in a (not > necessarily balanced) two-way layout.How is it realated with duplicates? Do you want something like that? xtabs(~Gender+Dosage, data=some.data.frame) Regards Petr> > What is the simplest way to do this? > > Thanks a lot, > > Asaf > > Observation Gender Dosage Alertness > 1 1 m a 8 > 2 2 m a 12 > 3 3 m a 13 > 4 4 m a 12 > 5 5 m b 6 > 6 6 m b 7 > 7 7 m b 23 > 8 8 m b 14 > 9 9 f a 15 > 10 10 f a 12 > 11 11 f a 22 > 12 12 f a 14 > 13 13 f b 15 > 14 14 f b 12 > 15 15 f b 18 > 16 16 f b 22 > > > > -- > View this message in context: http://r.789695.n4.nabble.com/Counting- > duplicates-in-a-dataframe-tp4646954.html > Sent from the R help mailing list archive at Nabble.com. > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting- > guide.html > and provide commented, minimal, self-contained, reproducible code.
Hi, try this: dat1<-read.table(text=" Observation Gender Dosage Alertness 1??????????? 1????? m????? a??????? 8 2??????????? 2????? m????? a??????? 12 3??????????? 3????? m????? a??????? 13 4??????????? 4????? m????? a??????? 12 5??????????? 5????? m????? b??????? 6 6??????????? 6????? m????? b??????? 7 7??????????? 7????? m????? b??????? 23 8??????????? 8????? m????? b??????? 14 9??????????? 9????? f????? a??????? 15 10????????? 10????? f????? a??????? 12 11????????? 11????? f????? a??????? 22 12????????? 12????? f????? a??????? 14 13????????? 13????? f????? b??????? 15 14????????? 14????? f????? b??????? 12 15????????? 15????? f????? b??????? 18 16????????? 16????? f????? b??????? 22 ",sep="",header=TRUE,stringsAsFactors=FALSE) library(reshape) ?cast(dat1,Gender~Dosage,length) #? Gender a b #1????? f 4 4 #2????? m 4 4 A.K. ----- Original Message ----- From: asafwe <asafw at wharton.upenn.edu> To: r-help at r-project.org Cc: Sent: Sunday, October 21, 2012 10:02 PM Subject: [R] Counting duplicates in a dataframe Hello, I am looking at a two-way ANOVA dataset, and would like to count the rows in the dataframe with the same level of the first factor ("Gender") and the second factor ("Dosage"). In other words, I am interested in the number of observations per each "cell" in a (not necessarily balanced) two-way layout. What is the simplest way to do this? Thanks a lot, Asaf ? Observation Gender Dosage Alertness 1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ? 8 2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12 3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13 4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12 5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ? 6 6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ? 7 7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23 8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14 9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15 10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12 11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22 12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14 13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15 14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12 15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18 16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22 -- View this message in context: http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html Sent from the R help mailing list archive at Nabble.com. ______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
HI, Another way: dat1<-read.table(text=" Observation Gender Dosage Alertness 1??????????? 1????? m????? a??????? 8 2??????????? 2????? m????? a??????? 12 3??????????? 3????? m????? a??????? 13 4??????????? 4????? m????? a??????? 12 5??????????? 5????? m????? b??????? 6 6??????????? 6????? m????? b??????? 7 7??????????? 7????? m????? b??????? 23 8??????????? 8????? m????? b??????? 14 9??????????? 9????? f????? a??????? 15 10????????? 10????? f????? a??????? 12 11????????? 11????? f????? a??????? 22 12????????? 12????? f????? a??????? 14 13????????? 13????? f????? b??????? 15 14????????? 14????? f????? b??????? 12 15????????? 15????? f????? b??????? 18 16????????? 16????? f????? b??????? 22 ",sep="",header=TRUE,stringsAsFactors=FALSE) tapply(dat1$Observation,list(dat1$Gender,dat1$Dosage),length) #? a b #f 4 4 #m 4 4 A.K. ----- Original Message ----- From: asafwe <asafw at wharton.upenn.edu> To: r-help at r-project.org Cc: Sent: Sunday, October 21, 2012 10:02 PM Subject: [R] Counting duplicates in a dataframe Hello, I am looking at a two-way ANOVA dataset, and would like to count the rows in the dataframe with the same level of the first factor ("Gender") and the second factor ("Dosage"). In other words, I am interested in the number of observations per each "cell" in a (not necessarily balanced) two-way layout. What is the simplest way to do this? Thanks a lot, Asaf ? Observation Gender Dosage Alertness 1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ? 8 2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12 3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13 4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12 5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ? 6 6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ? 7 7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23 8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14 9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15 10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12 11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22 12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14 13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15 14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12 15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18 16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22 -- View this message in context: http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html Sent from the R help mailing list archive at Nabble.com. ______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
On Oct 22, 2012, at 7:48 AM, arun wrote:> HI, > Another way: > dat1<-read.table(text=" > Observation Gender Dosage Alertness > 1 1 m a 8 > 2 2 m a 12 > 3 3 m a 13 > 4 4 m a 12 > 5 5 m b 6 > 6 6 m b 7 > 7 7 m b 23 > 8 8 m b 14 > 9 9 f a 15 > 10 10 f a 12 > 11 11 f a 22 > 12 12 f a 14 > 13 13 f b 15 > 14 14 f b 12 > 15 15 f b 18 > 16 16 f b 22 > ",sep="",header=TRUE,stringsAsFactors=FALSE) > > > tapply(dat1$Observation,list(dat1$Gender,dat1$Dosage),length) > # a b > #f 4 4 > #m 4 4How is that different than: table(dat1$Gender, dat1$Dosage) -- David.> A.K. > > ----- Original Message ----- > From: asafwe <asafw at wharton.upenn.edu> > To: r-help at r-project.org > Cc: > Sent: Sunday, October 21, 2012 10:02 PM > Subject: [R] Counting duplicates in a dataframe > > Hello, > > I am looking at a two-way ANOVA dataset, and would like to count the rows in > the dataframe with the same level of the first factor ("Gender") and the > second factor ("Dosage"). In other words, I am interested in the number of > observations per each "cell" in a (not necessarily balanced) two-way layout. > > What is the simplest way to do this? > > Thanks a lot, > > Asaf > > Observation Gender Dosage Alertness > 1 1 m a 8 > 2 2 m a 12 > 3 3 m a 13 > 4 4 m a 12 > 5 5 m b 6 > 6 6 m b 7 > 7 7 m b 23 > 8 8 m b 14 > 9 9 f a 15 > 10 10 f a 12 > 11 11 f a 22 > 12 12 f a 14 > 13 13 f b 15 > 14 14 f b 12 > 15 15 f b 18 > 16 16 f b 22 > > > > -- > View this message in context: http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html > Sent from the R help mailing list archive at Nabble.com. > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.David Winsemius, MD Alameda, CA, USA
Thank you all; David -- this is, in fact, exactly what I need! Asaf -- View this message in context: http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954p4647075.html Sent from the R help mailing list archive at Nabble.com.
Hi, ?ctab() also gets the same result. library(catspec) dat2<-within(dat1,{Gender<-factor(Gender);Dosage<-factor(Dosage)}) ?ctab(dat2$Gender,dat2$Dosage) #?? a b #????? #f? 4 4 #m? 4 4 A.K. ----- Original Message ----- From: David Winsemius <dwinsemius at comcast.net> To: arun <smartpink111 at yahoo.com> Cc: asafwe <asafw at wharton.upenn.edu>; R help <r-help at r-project.org> Sent: Monday, October 22, 2012 5:26 PM Subject: Re: [R] Counting duplicates in a dataframe On Oct 22, 2012, at 7:48 AM, arun wrote:> HI, > Another way: > dat1<-read.table(text=" > Observation Gender Dosage Alertness > 1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ? 8 > 2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12 > 3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13 > 4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12 > 5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ? 6 > 6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ? 7 > 7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23 > 8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14 > 9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15 > 10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12 > 11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22 > 12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14 > 13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15 > 14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12 > 15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18 > 16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22 > ",sep="",header=TRUE,stringsAsFactors=FALSE) > > > tapply(dat1$Observation,list(dat1$Gender,dat1$Dosage),length) > #? a b > #f 4 4 > #m 4 4How is that different than: table(dat1$Gender, dat1$Dosage) -- David.> A.K. > > ----- Original Message ----- > From: asafwe <asafw at wharton.upenn.edu> > To: r-help at r-project.org > Cc: > Sent: Sunday, October 21, 2012 10:02 PM > Subject: [R] Counting duplicates in a dataframe > > Hello, > > I am looking at a two-way ANOVA dataset, and would like to count the rows in > the dataframe with the same level of the first factor ("Gender") and the > second factor ("Dosage"). In other words, I am interested in the number of > observations per each "cell" in a (not necessarily balanced) two-way layout. > > What is the simplest way to do this? > > Thanks a lot, > > Asaf > >? ? Observation Gender Dosage Alertness > 1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ? 8 > 2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12 > 3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13 > 4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12 > 5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ? 6 > 6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ? 7 > 7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23 > 8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14 > 9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15 > 10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12 > 11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22 > 12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14 > 13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15 > 14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12 > 15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18 > 16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22 > > > > -- > View this message in context: http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html > Sent from the R help mailing list archive at Nabble.com. > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.David Winsemius, MD Alameda, CA, USA