thr3ads.net - R help - [R] Maintaining repeated ID numbers when transposing with reshape [Aug 2008]

If this information is useful, please help other people find it:
Share via:

jcarmichael

2008-Aug-25 21:15 UTC

[R] Maintaining repeated ID numbers when transposing with reshape

I have a dataset in "long" format that looks something like this:

ID   TEST    RESULT
1       A          17
1       A          12
1       B          15
1       C          12
2       B           8
2       B           9

Now what I would like to do is transpose it like so:

ID    TEST A    TEST B    TEST C
1         17           15          12
1         12            .            .
2          .             8            .
2          .             9            .

When I try:

reshape(mydata, v.names="result",
idvar="id",timevar="test",
direction="wide")

It gives me only the first occurrence of each test for each subject.  How
can I transpose my dataset in this way without losing information about
repeated tests?

Any help or guidance would be appreciated!  Thanks!
-- 
View this message in context:
http://www.nabble.com/Maintaining-repeated-ID-numbers-when-transposing-with-reshape-tp19151853p19151853.html
Sent from the R help mailing list archive at Nabble.com.

Adaikalavan Ramasamy

2008-Aug-25 22:05 UTC

head link

[R] Maintaining repeated ID numbers when transposing with reshape

There might be a more elegant way of doing this but here is a way of 
doing it without reshape().

    df <- data.frame( ID=c(1,1,1,1,2,2),
                     
TEST=c("A","A","B","C","B","B"),
                      RESULT=c(17,12,15,12,8,9) )

    df.s <- split( df, df$ID )

    out  <- sapply( df.s, function(m)
                    tapply( m$RESULT, m$TEST, paste, collapse="," ) )

    t(out)

      A       B     C
    1 "17,12" "15"  "12"
    2 NA      "8,9" NA

Not the same output as you wanted. This makes more sense unless you have 
a reason to priotize 17 instead of 12 in the first row.

Regards, Adai


jcarmichael wrote:> I have a dataset in "long" format that looks something like this:
> 
> ID   TEST    RESULT
> 1       A          17
> 1       A          12
> 1       B          15
> 1       C          12
> 2       B           8
> 2       B           9
> 
> Now what I would like to do is transpose it like so:
> 
> ID    TEST A    TEST B    TEST C
> 1         17           15          12
> 1         12            .            .
> 2          .             8            .
> 2          .             9            .
> 
> When I try:
> 
> reshape(mydata, v.names="result",
idvar="id",timevar="test",
> direction="wide")
> 
> It gives me only the first occurrence of each test for each subject.  How
> can I transpose my dataset in this way without losing information about
> repeated tests?
> 
> Any help or guidance would be appreciated!  Thanks!

jcarmichael

2008-Aug-26 17:47 UTC

head link

[R] Maintaining repeated ID numbers when transposing with reshape

Thank you for your suggestion, I will play around with it. I guess my concern
is that I need each test result to occupy its own "cell" rather than
have
one or more in the same row.


Adaikalavan Ramasamy-2 wrote:> 
> There might be a more elegant way of doing this but here is a way of 
> doing it without reshape().
> 
>     df <- data.frame( ID=c(1,1,1,1,2,2),
>                      
TEST=c("A","A","B","C","B","B"),
>                       RESULT=c(17,12,15,12,8,9) )
> 
>     df.s <- split( df, df$ID )
> 
>     out  <- sapply( df.s, function(m)
>                     tapply( m$RESULT, m$TEST, paste, collapse=","
) )
> 
>     t(out)
> 
>       A       B     C
>     1 "17,12" "15"  "12"
>     2 NA      "8,9" NA
> 
> Not the same output as you wanted. This makes more sense unless you have 
> a reason to priotize 17 instead of 12 in the first row.
> 
> Regards, Adai
> 
> 
> jcarmichael wrote:
>> I have a dataset in "long" format that looks something like
this:
>> 
>> ID   TEST    RESULT
>> 1       A          17
>> 1       A          12
>> 1       B          15
>> 1       C          12
>> 2       B           8
>> 2       B           9
>> 
>> Now what I would like to do is transpose it like so:
>> 
>> ID    TEST A    TEST B    TEST C
>> 1         17           15          12
>> 1         12            .            .
>> 2          .             8            .
>> 2          .             9            .
>> 
>> When I try:
>> 
>> reshape(mydata, v.names="result",
idvar="id",timevar="test",
>> direction="wide")
>> 
>> It gives me only the first occurrence of each test for each subject. 
How
>> can I transpose my dataset in this way without losing information about
>> repeated tests?
>> 
>> Any help or guidance would be appreciated!  Thanks!
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 
> 
-- 
View this message in context:
http://www.nabble.com/Maintaining-repeated-ID-numbers-when-transposing-with-reshape-tp19151853p19166910.html
Sent from the R help mailing list archive at Nabble.com.

Adaikalavan Ramasamy

2008-Aug-28 17:13 UTC

head link

[R] Maintaining repeated ID numbers when transposing with reshape

Not the prettiest code but it returns what you want. Might be slow for 
large dataframes.

df <- data.frame( ID=c(1,1,1,1,2,2),
                  
TEST=c("A","A","B","C","B","B"),
                   RESULT=c(17,12,15,12,8,9) )


big.out <- list(NULL)

for( uID in unique(df$ID) ){
  m <- df[ df$ID == uID, , drop=FALSE ]
  run.order <- unlist(sapply( table(m$TEST), function(x) if(x > 0) 1:x) )
  m <- cbind( m, run.order=run.order )

  nr <- max(run.order)
  out <- matrix( nr=nr, nc=nlevels(m$TEST),
                 dimnames=list( rep(uID, nr), levels(m$TEST) ))

  for(i in 1:nrow(m)) out[ m$run.order[i], m$TEST[i] ] <- m$RESULT[i]
  big.out[[uID]] <- out
}

do.call( "rbind", big.out )

    A  B  C
1 17 15 12
1 12 NA NA
2 NA  8 NA
2 NA  9 NA


Regards, Adai


jcarmichael wrote:> Thank you for your suggestion, I will play around with it. I guess my
concern
> is that I need each test result to occupy its own "cell" rather
than have
> one or more in the same row.
> 
> 
> Adaikalavan Ramasamy-2 wrote:
>> There might be a more elegant way of doing this but here is a way of 
>> doing it without reshape().
>>
>>     df <- data.frame( ID=c(1,1,1,1,2,2),
>>                      
TEST=c("A","A","B","C","B","B"),
>>                       RESULT=c(17,12,15,12,8,9) )
>>
>>     df.s <- split( df, df$ID )
>>
>>     out  <- sapply( df.s, function(m)
>>                     tapply( m$RESULT, m$TEST, paste,
collapse="," ) )
>>
>>     t(out)
>>
>>       A       B     C
>>     1 "17,12" "15"  "12"
>>     2 NA      "8,9" NA
>>
>> Not the same output as you wanted. This makes more sense unless you
have
>> a reason to priotize 17 instead of 12 in the first row.
>>
>> Regards, Adai
>>
>>
>> jcarmichael wrote:
>>> I have a dataset in "long" format that looks something
like this:
>>>
>>> ID   TEST    RESULT
>>> 1       A          17
>>> 1       A          12
>>> 1       B          15
>>> 1       C          12
>>> 2       B           8
>>> 2       B           9
>>>
>>> Now what I would like to do is transpose it like so:
>>>
>>> ID    TEST A    TEST B    TEST C
>>> 1         17           15          12
>>> 1         12            .            .
>>> 2          .             8            .
>>> 2          .             9            .
>>>
>>> When I try:
>>>
>>> reshape(mydata, v.names="result",
idvar="id",timevar="test",
>>> direction="wide")
>>>
>>> It gives me only the first occurrence of each test for each
subject.  How
>>> can I transpose my dataset in this way without losing information
about
>>> repeated tests?
>>>
>>> Any help or guidance would be appreciated!  Thanks!
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>>
>

Apparently Analagous Threads

Search for more possibly parallel threads

R help - Aug 2008 - Maintaining repeated ID numbers when transposing with reshape

[R] Maintaining repeated ID numbers when transposing with reshape

[R] Maintaining repeated ID numbers when transposing with reshape

[R] Maintaining repeated ID numbers when transposing with reshape

[R] Maintaining repeated ID numbers when transposing with reshape

Apparently Analagous Threads