Hi Weijia,
Try this:
dat1<-read.table(text="
ID? AGE GENER LDL
1??? 25??? M????????? 137
1??? 25??? M????????? 125
2??? 34??? F??????????? 108
3??? 30??? F????????? 150
3??? 30??? F??????????? 147
3??? 30??? F??????????? 165
",sep="",header=TRUE,stringsAsFactors=FALSE)
subset(dat1,!is.na(match(AGE,AGE[duplicated(AGE)])))
? #ID AGE GENER LDL
#1? 1? 25???? M 137
#2? 1? 25???? M 125
#4? 3? 30???? F 150
#5? 3? 30???? F 147
#6? 3? 30???? F 165
#I guess this helps you.
A.K.
----- Original Message -----
From: wwang.nyu <wwang.nyu at gmail.com>
To: arun <smartpink111 at yahoo.com>
Cc:
Sent: Friday, August 3, 2012 11:59 AM
Subject: Re: [R] all duplicated wanted
Hi, A.K.
Thanks for your example, that is enlighten.
What I am concerned is, to show all duplicates.
'duplicated' only shows me the second duplicate of each ID.
For example, I have these
ID? AGE GENER LDL
1? ? 25? ? M? ? ? ? ? 137
1? ? 25? ? M? ? ? ? ? 125
2? ? 34? ? F? ? ? ? ? ? 108
3? ? 30? ? F? ? ? ? ? 150
3? ? 30? ? F? ? ? ? ? ? 147
3? ? 30? ? F? ? ? ? ? ? 165
I want this:
ID? AGE GENER LDL
1? ? 25? ? M? ? ? ? ? 137
1? ? 25? ? M? ? ? ? ? 125
3? ? 30? ? F? ? ? ? ? 150
3? ? 30? ? F? ? ? ? ? ? 147
3? ? 30? ? F? ? ? ? ? ? 165
Instead of having this, which 'duplicated' will give me:
ID? AGE GENER LDL
1? ? 25? ? M? ? ? ? ? 125
3? ? 30? ? F? ? ? ? ? ? 165
Can you help me on this?
Thanks
Weijia Wang
On Aug 3, 2012, at 9:20 AM, arun <smartpink111 at yahoo.com> wrote:
>
>
> HI,
>
> If you want both the duplicated IDs and nonduplicated IDs to be printed,
>? ID <- sample(1:10, 10, replace=TRUE)
>? ID
> # [1] 3 7 5 8 1 5 4 6 7 2
> ID[!duplicated(ID)]
> #[1] 3 7 5 8 1 4 6 2
>
>?
ID2<-c(4,4,4,3,4,1,2,5,7,4,3,2,5,9,8,12,"A1","A2","A1","B1")
>? ID2[!duplicated(ID2)]
> # [1] "4"? "3"? "1"? "2"?
"5"? "7"? "9"? "8"? "12"
"A1" "A2" "B1"
> ID2<-c(4,4,4,3,4,1,2,5,7,4,3,2,5,9,8,12,12,1,3,5,8)
> # ID2[!duplicated(ID2)]
> [1]? 4? 3? 1? 2? 5? 7? 9? 8 12
>
>
> I hope this is what you wanted.
>
> A.K.
>
>
>
>
> ----- Original Message -----
> From: Weijia Wang <wwang.nyu at gmail.com>
> To: r-help at r-project.org
> Cc:
> Sent: Friday, August 3, 2012 3:06 AM
> Subject: [R] all duplicated wanted
>
> Hi,
>
> Has anyone been able to figure out how to print all duplicated
observations?
>
> I have a dataset, with patients ID, and other lab records.
>
> Some patients have multiple lab records, but 'duplicated' ID will
only show me the duplicates, not the original observation.
>
> How can I print both the original one and the duplicates?
>
> Thanks
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>