thr3ads.net - R help - [R] Counting duplicates in a dataframe [Oct 2012]

If this information is useful, please help other people find it:
Share via:

asafwe

2012-Oct-22 02:02 UTC

[R] Counting duplicates in a dataframe

Hello,

I am looking at a two-way ANOVA dataset, and would like to count the rows in
the dataframe with the same level of the first factor ("Gender") and
the
second factor ("Dosage"). In other words, I am interested in the
number of
observations per each "cell" in a (not necessarily balanced) two-way
layout.

What is the simplest way to do this?

Thanks a lot,

Asaf

   Observation Gender Dosage Alertness
1            1      m      a         8
2            2      m      a        12
3            3      m      a        13
4            4      m      a        12
5            5      m      b         6
6            6      m      b         7
7            7      m      b        23
8            8      m      b        14
9            9      f      a        15
10          10      f      a        12
11          11      f      a        22
12          12      f      a        14
13          13      f      b        15
14          14      f      b        12
15          15      f      b        18
16          16      f      b        22



--
View this message in context:
http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html
Sent from the R help mailing list archive at Nabble.com.

PIKAL Petr

2012-Oct-22 08:55 UTC

head link

[R] Counting duplicates in a dataframe

Hi
> -----Original Message-----
> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-
> project.org] On Behalf Of asafwe
> Sent: Monday, October 22, 2012 4:02 AM
> To: r-help at r-project.org
> Subject: [R] Counting duplicates in a dataframe
> 
> Hello,
> 
> I am looking at a two-way ANOVA dataset, and would like to count the
> rows in the dataframe with the same level of the first factor
> ("Gender") and the second factor ("Dosage"). In other
words, I am
> interested in the number of observations per each "cell" in a
(not
> necessarily balanced) two-way layout.
How is it realated with duplicates?

Do you want something like that?
xtabs(~Gender+Dosage, data=some.data.frame)

Regards
Petr
> 
> What is the simplest way to do this?
> 
> Thanks a lot,
> 
> Asaf
> 
>    Observation Gender Dosage Alertness
> 1            1      m      a         8
> 2            2      m      a        12
> 3            3      m      a        13
> 4            4      m      a        12
> 5            5      m      b         6
> 6            6      m      b         7
> 7            7      m      b        23
> 8            8      m      b        14
> 9            9      f      a        15
> 10          10      f      a        12
> 11          11      f      a        22
> 12          12      f      a        14
> 13          13      f      b        15
> 14          14      f      b        12
> 15          15      f      b        18
> 16          16      f      b        22
> 
> 
> 
> --
> View this message in context: http://r.789695.n4.nabble.com/Counting-
> duplicates-in-a-dataframe-tp4646954.html
> Sent from the R help mailing list archive at Nabble.com.
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-
> guide.html
> and provide commented, minimal, self-contained, reproducible code.

arun

2012-Oct-22 12:33 UTC

head link

[R] Counting duplicates in a dataframe

Hi,
try this:
dat1<-read.table(text="
Observation Gender Dosage Alertness
1??????????? 1????? m????? a??????? 8
2??????????? 2????? m????? a??????? 12
3??????????? 3????? m????? a??????? 13
4??????????? 4????? m????? a??????? 12
5??????????? 5????? m????? b??????? 6
6??????????? 6????? m????? b??????? 7
7??????????? 7????? m????? b??????? 23
8??????????? 8????? m????? b??????? 14
9??????????? 9????? f????? a??????? 15
10????????? 10????? f????? a??????? 12
11????????? 11????? f????? a??????? 22
12????????? 12????? f????? a??????? 14
13????????? 13????? f????? b??????? 15
14????????? 14????? f????? b??????? 12
15????????? 15????? f????? b??????? 18
16????????? 16????? f????? b??????? 22
",sep="",header=TRUE,stringsAsFactors=FALSE)

library(reshape)
?cast(dat1,Gender~Dosage,length)
#? Gender a b
#1????? f 4 4
#2????? m 4 4
A.K.





----- Original Message -----
From: asafwe <asafw at wharton.upenn.edu>
To: r-help at r-project.org
Cc: 
Sent: Sunday, October 21, 2012 10:02 PM
Subject: [R] Counting duplicates in a dataframe

Hello,

I am looking at a two-way ANOVA dataset, and would like to count the rows in
the dataframe with the same level of the first factor ("Gender") and
the
second factor ("Dosage"). In other words, I am interested in the
number of
observations per each "cell" in a (not necessarily balanced) two-way
layout.

What is the simplest way to do this?

Thanks a lot,

Asaf

?  Observation Gender Dosage Alertness
1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ?  8
2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12
3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13
4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12
5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ?  6
6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ?  7
7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23
8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14
9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15
10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12
11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22
12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14
13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15
14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12
15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18
16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22



--
View this message in context:
http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

arun

2012-Oct-22 14:48 UTC

head link

[R] Counting duplicates in a dataframe

HI,
Another way:
dat1<-read.table(text="
Observation Gender Dosage Alertness
1??????????? 1????? m????? a??????? 8
2??????????? 2????? m????? a??????? 12
3??????????? 3????? m????? a??????? 13
4??????????? 4????? m????? a??????? 12
5??????????? 5????? m????? b??????? 6
6??????????? 6????? m????? b??????? 7
7??????????? 7????? m????? b??????? 23
8??????????? 8????? m????? b??????? 14
9??????????? 9????? f????? a??????? 15
10????????? 10????? f????? a??????? 12
11????????? 11????? f????? a??????? 22
12????????? 12????? f????? a??????? 14
13????????? 13????? f????? b??????? 15
14????????? 14????? f????? b??????? 12
15????????? 15????? f????? b??????? 18
16????????? 16????? f????? b??????? 22
",sep="",header=TRUE,stringsAsFactors=FALSE)


tapply(dat1$Observation,list(dat1$Gender,dat1$Dosage),length)
#? a b
#f 4 4
#m 4 4
A.K.

----- Original Message -----
From: asafwe <asafw at wharton.upenn.edu>
To: r-help at r-project.org
Cc: 
Sent: Sunday, October 21, 2012 10:02 PM
Subject: [R] Counting duplicates in a dataframe

Hello,

I am looking at a two-way ANOVA dataset, and would like to count the rows in
the dataframe with the same level of the first factor ("Gender") and
the
second factor ("Dosage"). In other words, I am interested in the
number of
observations per each "cell" in a (not necessarily balanced) two-way
layout.

What is the simplest way to do this?

Thanks a lot,

Asaf

?  Observation Gender Dosage Alertness
1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ?  8
2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12
3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13
4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12
5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ?  6
6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ?  7
7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23
8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14
9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15
10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12
11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22
12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14
13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15
14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12
15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18
16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22



--
View this message in context:
http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

David Winsemius

2012-Oct-22 21:26 UTC

head link

[R] Counting duplicates in a dataframe

On Oct 22, 2012, at 7:48 AM, arun wrote:
> HI,
> Another way:
> dat1<-read.table(text="
> Observation Gender Dosage Alertness
> 1            1      m      a        8
> 2            2      m      a        12
> 3            3      m      a        13
> 4            4      m      a        12
> 5            5      m      b        6
> 6            6      m      b        7
> 7            7      m      b        23
> 8            8      m      b        14
> 9            9      f      a        15
> 10          10      f      a        12
> 11          11      f      a        22
> 12          12      f      a        14
> 13          13      f      b        15
> 14          14      f      b        12
> 15          15      f      b        18
> 16          16      f      b        22
> ",sep="",header=TRUE,stringsAsFactors=FALSE)
> 
> 
> tapply(dat1$Observation,list(dat1$Gender,dat1$Dosage),length)
> #  a b
> #f 4 4
> #m 4 4
How is that different than:

table(dat1$Gender, dat1$Dosage)

-- 
David.
> A.K.
> 
> ----- Original Message -----
> From: asafwe <asafw at wharton.upenn.edu>
> To: r-help at r-project.org
> Cc: 
> Sent: Sunday, October 21, 2012 10:02 PM
> Subject: [R] Counting duplicates in a dataframe
> 
> Hello,
> 
> I am looking at a two-way ANOVA dataset, and would like to count the rows
in
> the dataframe with the same level of the first factor ("Gender")
and the
> second factor ("Dosage"). In other words, I am interested in the
number of
> observations per each "cell" in a (not necessarily balanced)
two-way layout.
> 
> What is the simplest way to do this?
> 
> Thanks a lot,
> 
> Asaf
> 
>    Observation Gender Dosage Alertness
> 1            1      m      a         8
> 2            2      m      a        12
> 3            3      m      a        13
> 4            4      m      a        12
> 5            5      m      b         6
> 6            6      m      b         7
> 7            7      m      b        23
> 8            8      m      b        14
> 9            9      f      a        15
> 10          10      f      a        12
> 11          11      f      a        22
> 12          12      f      a        14
> 13          13      f      b        15
> 14          14      f      b        12
> 15          15      f      b        18
> 16          16      f      b        22
> 
> 
> 
> --
> View this message in context:
http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html
> Sent from the R help mailing list archive at Nabble.com.
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
Alameda, CA, USA

asafwe

2012-Oct-22 22:19 UTC

head link

[R] Counting duplicates in a dataframe

Thank you all; David -- this is, in fact, exactly what I need!

Asaf



--
View this message in context:
http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954p4647075.html
Sent from the R help mailing list archive at Nabble.com.

arun

2012-Oct-22 23:53 UTC

head link

[R] Counting duplicates in a dataframe

Hi,
?ctab() also gets the same result.
library(catspec)
dat2<-within(dat1,{Gender<-factor(Gender);Dosage<-factor(Dosage)})
?ctab(dat2$Gender,dat2$Dosage)
#?? a b
#????? 
#f? 4 4
#m? 4 4
A.K.



----- Original Message -----
From: David Winsemius <dwinsemius at comcast.net>
To: arun <smartpink111 at yahoo.com>
Cc: asafwe <asafw at wharton.upenn.edu>; R help <r-help at
r-project.org>
Sent: Monday, October 22, 2012 5:26 PM
Subject: Re: [R] Counting duplicates in a dataframe


On Oct 22, 2012, at 7:48 AM, arun wrote:
> HI,
> Another way:
> dat1<-read.table(text="
> Observation Gender Dosage Alertness
> 1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ? 8
> 2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12
> 3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13
> 4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12
> 5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ? 6
> 6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ? 7
> 7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23
> 8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14
> 9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15
> 10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12
> 11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22
> 12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14
> 13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15
> 14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12
> 15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18
> 16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22
> ",sep="",header=TRUE,stringsAsFactors=FALSE)
> 
> 
> tapply(dat1$Observation,list(dat1$Gender,dat1$Dosage),length)
> #? a b
> #f 4 4
> #m 4 4
How is that different than:

table(dat1$Gender, dat1$Dosage)

-- 
David.
> A.K.
> 
> ----- Original Message -----
> From: asafwe <asafw at wharton.upenn.edu>
> To: r-help at r-project.org
> Cc: 
> Sent: Sunday, October 21, 2012 10:02 PM
> Subject: [R] Counting duplicates in a dataframe
> 
> Hello,
> 
> I am looking at a two-way ANOVA dataset, and would like to count the rows
in
> the dataframe with the same level of the first factor ("Gender")
and the
> second factor ("Dosage"). In other words, I am interested in the
number of
> observations per each "cell" in a (not necessarily balanced)
two-way layout.
> 
> What is the simplest way to do this?
> 
> Thanks a lot,
> 
> Asaf
> 
>? ? Observation Gender Dosage Alertness
> 1? ? ? ? ? ? 1? ? ? m? ? ? a? ? ? ?  8
> 2? ? ? ? ? ? 2? ? ? m? ? ? a? ? ? ? 12
> 3? ? ? ? ? ? 3? ? ? m? ? ? a? ? ? ? 13
> 4? ? ? ? ? ? 4? ? ? m? ? ? a? ? ? ? 12
> 5? ? ? ? ? ? 5? ? ? m? ? ? b? ? ? ?  6
> 6? ? ? ? ? ? 6? ? ? m? ? ? b? ? ? ?  7
> 7? ? ? ? ? ? 7? ? ? m? ? ? b? ? ? ? 23
> 8? ? ? ? ? ? 8? ? ? m? ? ? b? ? ? ? 14
> 9? ? ? ? ? ? 9? ? ? f? ? ? a? ? ? ? 15
> 10? ? ? ? ? 10? ? ? f? ? ? a? ? ? ? 12
> 11? ? ? ? ? 11? ? ? f? ? ? a? ? ? ? 22
> 12? ? ? ? ? 12? ? ? f? ? ? a? ? ? ? 14
> 13? ? ? ? ? 13? ? ? f? ? ? b? ? ? ? 15
> 14? ? ? ? ? 14? ? ? f? ? ? b? ? ? ? 12
> 15? ? ? ? ? 15? ? ? f? ? ? b? ? ? ? 18
> 16? ? ? ? ? 16? ? ? f? ? ? b? ? ? ? 22
> 
> 
> 
> --
> View this message in context:
http://r.789695.n4.nabble.com/Counting-duplicates-in-a-dataframe-tp4646954.html
> Sent from the R help mailing list archive at Nabble.com.
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
Alameda, CA, USA

R help - Oct 2012 - Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe

[R] Counting duplicates in a dataframe