thr3ads.net - R help - [R] reshaping data? [Sep 2005]

If this information is useful, please help other people find it:
Share via:

Karin Lagesen

2005-Sep-29 13:16 UTC

[R] reshaping data?

I have a file like this:


a       0.1
a       0.2
a       0.9
b       0.5
b       0.9
b       0.7
c       0.6
c       0.99
c       0.88

Which I would like to get to be the following matrix:

      0.1     0.2    0.3    0.4  ...  
a     1        2     0       0 
b     0        0     0       0 
..

I.e: each place in the matrix denotes how many entries in each
category that are betwee 0.0 and 0.1, 0.1 and 0.2 and so on.

The way I was thinking of doing it was by constructing an empty matrix
and then doing a for loop testing each element and incrementing in the
matrix as appropriate. However, it struck me that this has to be
easier to do than that. Am I right?

Karin
-- 
Karin Lagesen, PhD student
karin.lagesen at medisin.uio.no
http://www.cmbn.no/rognes/

Dimitris Rizopoulos

2005-Sep-29 14:00 UTC

head link

[R] reshaping data?

you could use something like this (but maybe there are better 
proposals):

dat <- data.frame(g = rep(letters[1:3], each = 5), val = runif(15))

out <- do.call(rbind, lapply(split(dat$val, dat$g), function(x){
    f <- factor(findInterval(x, vec = seq(0, 1, 0.1)), levels = 1:10)
    table(f)
    }))
colnames(out) <- seq(0.1, 1, 0.1)
out


I hope it helps.

Best,
Dimitris

----
Dimitris Rizopoulos
Ph.D. Student
Biostatistical Centre
School of Public Health
Catholic University of Leuven

Address: Kapucijnenvoer 35, Leuven, Belgium
Tel: +32/(0)16/336899
Fax: +32/(0)16/337015
Web: http://www.med.kuleuven.be/biostat/
     http://www.student.kuleuven.be/~m0390867/dimitris.htm


----- Original Message ----- 
From: "Karin Lagesen" <karin.lagesen at medisin.uio.no>
To: <r-help at r-project.org>
Sent: Thursday, September 29, 2005 3:16 PM
Subject: [R] reshaping data?

>
> I have a file like this:
>
>
> a       0.1
> a       0.2
> a       0.9
> b       0.5
> b       0.9
> b       0.7
> c       0.6
> c       0.99
> c       0.88
>
> Which I would like to get to be the following matrix:
>
>      0.1     0.2    0.3    0.4  ...
> a     1        2     0       0
> b     0        0     0       0
> ..
>
> I.e: each place in the matrix denotes how many entries in each
> category that are betwee 0.0 and 0.1, 0.1 and 0.2 and so on.
>
> The way I was thinking of doing it was by constructing an empty 
> matrix
> and then doing a for loop testing each element and incrementing in 
> the
> matrix as appropriate. However, it struck me that this has to be
> easier to do than that. Am I right?
>
> Karin
> -- 
> Karin Lagesen, PhD student
> karin.lagesen at medisin.uio.no
> http://www.cmbn.no/rognes/
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html
> 

Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm

Thomas Lumley

2005-Sep-29 14:05 UTC

head link

[R] reshaping data?

On Thu, 29 Sep 2005, Karin Lagesen wrote:
>
> I have a file like this:
>
>
> a       0.1
> a       0.2
> a       0.9
> b       0.5
> b       0.9
> b       0.7
> c       0.6
> c       0.99
> c       0.88
>
> Which I would like to get to be the following matrix:
>
>      0.1     0.2    0.3    0.4  ...
> a     1        2     0       0
> b     0        0     0       0
> ..
>
> I.e: each place in the matrix denotes how many entries in each
> category that are betwee 0.0 and 0.1, 0.1 and 0.2 and so on.
>
If your variables are called, say, id and value, and value is between 0 
and 1 then

table(id,cut(value,breaks=(0:10)/10))

works.

 	-thomas

Gabor Grothendieck

2005-Sep-29 14:09 UTC

head link

[R] reshaping data?

This can be shortened slightly using cut:

table(data.frame(g = dat$g, val = cut(dat$val, 0:10/10, lab = 1:10/10)))

On 9/29/05, Dimitris Rizopoulos <dimitris.rizopoulos at med.kuleuven.be>
wrote:> you could use something like this (but maybe there are better
> proposals):
>
> dat <- data.frame(g = rep(letters[1:3], each = 5), val = runif(15))
>
> out <- do.call(rbind, lapply(split(dat$val, dat$g), function(x){
>    f <- factor(findInterval(x, vec = seq(0, 1, 0.1)), levels = 1:10)
>    table(f)
>    }))
> colnames(out) <- seq(0.1, 1, 0.1)
> out
>
>
> I hope it helps.
>
> Best,
> Dimitris
>
> ----
> Dimitris Rizopoulos
> Ph.D. Student
> Biostatistical Centre
> School of Public Health
> Catholic University of Leuven
>
> Address: Kapucijnenvoer 35, Leuven, Belgium
> Tel: +32/(0)16/336899
> Fax: +32/(0)16/337015
> Web: http://www.med.kuleuven.be/biostat/
>     http://www.student.kuleuven.be/~m0390867/dimitris.htm
>
>
> ----- Original Message -----
> From: "Karin Lagesen" <karin.lagesen at medisin.uio.no>
> To: <r-help at r-project.org>
> Sent: Thursday, September 29, 2005 3:16 PM
> Subject: [R] reshaping data?
>
>
> >
> > I have a file like this:
> >
> >
> > a       0.1
> > a       0.2
> > a       0.9
> > b       0.5
> > b       0.9
> > b       0.7
> > c       0.6
> > c       0.99
> > c       0.88
> >
> > Which I would like to get to be the following matrix:
> >
> >      0.1     0.2    0.3    0.4  ...
> > a     1        2     0       0
> > b     0        0     0       0
> > ..
> >
> > I.e: each place in the matrix denotes how many entries in each
> > category that are betwee 0.0 and 0.1, 0.1 and 0.2 and so on.
> >
> > The way I was thinking of doing it was by constructing an empty
> > matrix
> > and then doing a for loop testing each element and incrementing in
> > the
> > matrix as appropriate. However, it struck me that this has to be
> > easier to do than that. Am I right?
> >
> > Karin
> > --
> > Karin Lagesen, PhD student
> > karin.lagesen at medisin.uio.no
> > http://www.cmbn.no/rognes/
> >
> > ______________________________________________
> > R-help at stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide!
> > http://www.R-project.org/posting-guide.html
> >
>
>
> Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide!
http://www.R-project.org/posting-guide.html
>

Gabor Grothendieck

2005-Sep-29 14:22 UTC

head link

[R] reshaping data?

And slightly more to:

table(g = dat$g, val = cut(dat$val, 0:10/10, lab = 1:10/10))

On 9/29/05, Gabor Grothendieck <ggrothendieck at gmail.com>
wrote:> This can be shortened slightly using cut:
>
> table(data.frame(g = dat$g, val = cut(dat$val, 0:10/10, lab = 1:10/10)))
>
> On 9/29/05, Dimitris Rizopoulos <dimitris.rizopoulos at
med.kuleuven.be> wrote:
> > you could use something like this (but maybe there are better
> > proposals):
> >
> > dat <- data.frame(g = rep(letters[1:3], each = 5), val = runif(15))
> >
> > out <- do.call(rbind, lapply(split(dat$val, dat$g), function(x){
> >    f <- factor(findInterval(x, vec = seq(0, 1, 0.1)), levels =
1:10)
> >    table(f)
> >    }))
> > colnames(out) <- seq(0.1, 1, 0.1)
> > out
> >
> >
> > I hope it helps.
> >
> > Best,
> > Dimitris
> >
> > ----
> > Dimitris Rizopoulos
> > Ph.D. Student
> > Biostatistical Centre
> > School of Public Health
> > Catholic University of Leuven
> >
> > Address: Kapucijnenvoer 35, Leuven, Belgium
> > Tel: +32/(0)16/336899
> > Fax: +32/(0)16/337015
> > Web: http://www.med.kuleuven.be/biostat/
> >     http://www.student.kuleuven.be/~m0390867/dimitris.htm
> >
> >
> > ----- Original Message -----
> > From: "Karin Lagesen" <karin.lagesen at
medisin.uio.no>
> > To: <r-help at r-project.org>
> > Sent: Thursday, September 29, 2005 3:16 PM
> > Subject: [R] reshaping data?
> >
> >
> > >
> > > I have a file like this:
> > >
> > >
> > > a       0.1
> > > a       0.2
> > > a       0.9
> > > b       0.5
> > > b       0.9
> > > b       0.7
> > > c       0.6
> > > c       0.99
> > > c       0.88
> > >
> > > Which I would like to get to be the following matrix:
> > >
> > >      0.1     0.2    0.3    0.4  ...
> > > a     1        2     0       0
> > > b     0        0     0       0
> > > ..
> > >
> > > I.e: each place in the matrix denotes how many entries in each
> > > category that are betwee 0.0 and 0.1, 0.1 and 0.2 and so on.
> > >
> > > The way I was thinking of doing it was by constructing an empty
> > > matrix
> > > and then doing a for loop testing each element and incrementing
in
> > > the
> > > matrix as appropriate. However, it struck me that this has to be
> > > easier to do than that. Am I right?
> > >
> > > Karin
> > > --
> > > Karin Lagesen, PhD student
> > > karin.lagesen at medisin.uio.no
> > > http://www.cmbn.no/rognes/
> > >
> > > ______________________________________________
> > > R-help at stat.math.ethz.ch mailing list
> > > https://stat.ethz.ch/mailman/listinfo/r-help
> > > PLEASE do read the posting guide!
> > > http://www.R-project.org/posting-guide.html
> > >
> >
> >
> > Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm
> >
> > ______________________________________________
> > R-help at stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide!
http://www.R-project.org/posting-guide.html
> >
>

Jim Lemon

2005-Sep-30 08:58 UTC

head link

[R] reshaping data?

Karin Lagesen wrote:> I have a file like this:
> 
> 
> a       0.1
> a       0.2
> a       0.9
> b       0.5
> b       0.9
> b       0.7
> c       0.6
> c       0.99
> c       0.88
> 
> Which I would like to get to be the following matrix:
> 
>       0.1     0.2    0.3    0.4  ...  
> a     1        2     0       0 
> b     0        0     0       0 
> ..
> If you have a file named KL.dat with the above data in it:

KL.df<-read.table("KL.dat",header=FALSE)
KL.mat<-as.matrix(table(KL.df))

Jim

R help - Sep 2005 - reshaping data?

[R] reshaping data?

[R] reshaping data?

[R] reshaping data?

[R] reshaping data?

[R] reshaping data?

[R] reshaping data?