thr3ads.net - R help - [R] Read data in sequences [Apr 2010]

If this information is useful, please help other people find it:
Share via:

RockO

2010-Apr-09 15:25 UTC

[R] Read data in sequences

Dear R users,

I tried to find a solution in the search list, but I cannot find it. I would
like to read a .txt file with, let say, three variables, with two of which
have repeated values in a number a columns. 

An example:

The variables: Treat, x1, x2.
The values:
A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4 ...
B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1 ...
...

In the first column, the letters represent the variable "Treat", and
the
sequence of numbers on a same line represent pairs of values for "x1"
and
"x2".

In SAS, this type of dataset is easy to read using "@@" as in:
data a;
input Treat @ x1 x2 @@;

But I would like to know how to read it with R, to get rid of my addiction
to SAS.

Thank You,

Rock Ouimet
DRF-MRNF, Quebec
-- 
View this message in context:
http://n4.nabble.com/Read-data-in-sequences-tp1819487p1819487.html
Sent from the R help mailing list archive at Nabble.com.

Dieter Menne

2010-Apr-09 16:04 UTC

head link

[R] Read data in sequences

RockO wrote:> 
> I tried to find a solution in the search list, but I cannot find it. I
> would like to read a .txt file with, let say, three variables, with two of
> which have repeated values in a number a columns. 
> 
> The variables: Treat, x1, x2.
> The values:
> A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4 ...
> B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1 ...
> ...
> 
> In the first column, the letters represent the variable "Treat",
and the
> sequence of numbers on a same line represent pairs of values for
"x1" and
> "x2".
> 
> 
Looks like SAS is quite elegant here (don't kill me, I could not afford
using SAS, R has save my retirement fund).

I would first read it in "as usual", and do the reformatting later. 

library(reshape)
wide = read.table("wideseq.txt",sep=" ")
# renames columns
names(wide) = 
c("varname",rep(c("x1","x2"),ncol(wide)%/%2))
str(wide)
melt(wide) 

Now you have the long format, which is not exactly what you want, but
typically much more useful in R than the format you require. You might use
one of the function in package reshape to get your format.

Dieter











-- 
View this message in context:
http://n4.nabble.com/Read-data-in-sequences-tp1819487p1819559.html
Sent from the R help mailing list archive at Nabble.com.

Phil Spector

2010-Apr-09 17:16 UTC

head link

[R] Read data in sequences

Rock -
    Here's one way:
> x = textConnection('A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4
+ B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1')> dat = read.table(x)
> names(dat) =
c('grp','x1','x2','x3','x4','x5','x6','x7','x8')
>
reshape(dat,idvar='grp',varying=list(c('x1','x3','x5','x7'),+                                     
c('x2','x4','x6','x8')),
+                                 direction='long',timevar=NULL)
     grp   x1  x2
A.1   A  2.5 3.4
B.1   B  5.3 5.4
A.2   A  2.7 5.6
B.2   B  6.5 7.5
A.3   A  5.7 5.4
B.3   B  1.3 4.5
A.4   A 10.1 9.4
B.4   B 10.5 4.1

You could generalize the varying argument like this:

mkvarying = function(n)list(paste('x',seq(1,n,by=2),sep=''),
                             paste('x',seq(2,n,by=2),sep=''))

and use

reshape(dat,idvar='grp',varying=mkvarying(8),direction='long',timevar=NULL)


 					- Phil Spector
 					 Statistical Computing Facility
 					 Department of Statistics
 					 UC Berkeley
 					 spector at stat.berkeley.edu
On Fri, 9 Apr 2010, RockO wrote:
>
> Dear R users,
>
> I tried to find a solution in the search list, but I cannot find it. I
would
> like to read a .txt file with, let say, three variables, with two of which
> have repeated values in a number a columns.
>
> An example:
>
> The variables: Treat, x1, x2.
> The values:
> A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4 ...
> B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1 ...
> ...
>
> In the first column, the letters represent the variable "Treat",
and the
> sequence of numbers on a same line represent pairs of values for
"x1" and
> "x2".
>
> In SAS, this type of dataset is easy to read using "@@" as in:
> data a;
> input Treat @ x1 x2 @@;
>
> But I would like to know how to read it with R, to get rid of my addiction
> to SAS.
>
> Thank You,
>
> Rock Ouimet
> DRF-MRNF, Quebec
> -- 
> View this message in context:
http://n4.nabble.com/Read-data-in-sequences-tp1819487p1819487.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

Gabor Grothendieck

2010-Apr-10 17:54 UTC

head link

[R] Read data in sequences

Try this.  The data portion DF[-1] is transposed forming length 2
columns composed of successive elements of DF rows.  This is then
transposed back and the id's added in:

# read in
Lines <- "A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4
 B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1"
DF <- read.table(textConnection(Lines))

# form required data frame
data.frame(id = DF[gl(2,4),1], t(matrix(t(DF[-1]), 2)))



On Fri, Apr 9, 2010 at 11:25 AM, RockO <rock.ouimet at gmail.com>
wrote:>
> Dear R users,
>
> I tried to find a solution in the search list, but I cannot find it. I
would
> like to read a .txt file with, let say, three variables, with two of which
> have repeated values in a number a columns.
>
> An example:
>
> The variables: Treat, x1, x2.
> The values:
> A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4 ...
> B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1 ...
> ...
>
> In the first column, the letters represent the variable "Treat",
and the
> sequence of numbers on a same line represent pairs of values for
"x1" and
> "x2".
>
> In SAS, this type of dataset is easy to read using "@@" as in:
> data a;
> input Treat @ x1 x2 @@;
>
> But I would like to know how to read it with R, to get rid of my addiction
> to SAS.
>
> Thank You,
>
> Rock Ouimet
> DRF-MRNF, Quebec
> --
> View this message in context:
http://n4.nabble.com/Read-data-in-sequences-tp1819487p1819487.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

RockO

2010-Apr-11 18:03 UTC

head link

[R] Read data in sequences

Hi,

Thank you very much for your help! Here is the coding I used to make it
work:

    x <- textConnection("A 2.5 3.4 2.7 5.6 5.7 5.4 10.1 9.4 
    B 5.3 5.4 6.5 7.5 1.3 4.5 10.5 4.1 
    C 2.1 2.3 2.1 5.4 9.0 4.5 20.1 3.7")
    dat <- read.table(x)
    nv <- 2 # Number of variables that are repeated in sequence
    mkvarying <-
function(n)list(paste("V",seq(nv,n,by=nv),sep=""),paste("V",seq(nv+1,n,by=nv),sep=""))
   
reshape(dat,idvar="V1",varying=mkvarying(I(length(dat))),direction="long",timevar=NULL)

It is a good idea to first read the data then rearrange them in a second
step.

Rock
DRF-MRNF, Quebec
-- 
View this message in context:
http://n4.nabble.com/Read-data-in-sequences-tp1819487p1836275.html
Sent from the R help mailing list archive at Nabble.com.

RockO

2010-Apr-11 18:17 UTC

head link

[R] Read data in sequences

Gabor's elegant method works as well indeed where gl(number of factor
level,number of replications):
> data.frame(id = dat[gl(3,4),1], t(matrix(t(dat[-1]), 2)))
Rock
DRF-MRNF, Quebec
-- 
View this message in context:
http://n4.nabble.com/Read-data-in-sequences-tp1819487p1836285.html
Sent from the R help mailing list archive at Nabble.com.

Possibly Parallel Threads

Search for more maybe matching threads

R help - Apr 2010 - Read data in sequences

[R] Read data in sequences

[R] Read data in sequences

[R] Read data in sequences

[R] Read data in sequences

[R] Read data in sequences

[R] Read data in sequences

Possibly Parallel Threads