Dear all, please could you advise on the R code I could use in order to do the following operation : a. -- I have 2 lists of "genome coordinates" : a list is composed by numbers that represent genome coordinates; let's say list N : n1 n2 n3 n4 and a list M: m1 m2 m3 m4 m5 2 -- and a data frame C, where for some pairs of coordinates (n,m) from the lists above, we have a numerical intensity; for example : n1; m1; 100 n1; m2; 300 The question would be : what is the most efficient R code I could use in order to integrate the list N, the list M, and the data frame C, in order to obtain a DATA FRAME, -- list N as the columns names -- list M as the rows names -- the values in the cells of N * M, corresponding to the numerical values in the data frame C. A little example would be : n1 n2 n3 n4 m1 100 - - - m2 300 - - - m3 - - - - m4 - - - - m5 - - - - I wrote a script in perl, although i would like to do this in R Many thanks ;) -- bogdan [[alternative HTML version deleted]]
Reproducible example, please. -- In particular, what exactly does C look ilike? (You should know this by now). -- Bert Bert Gunter "The trouble with having an open mind is that people keep coming along and sticking things into it." -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) On Mon, Jun 5, 2017 at 6:45 PM, Bogdan Tanasa <tanasa at gmail.com> wrote:> Dear all, > > please could you advise on the R code I could use in order to do the > following operation : > > a. -- I have 2 lists of "genome coordinates" : a list is composed by > numbers that represent genome coordinates; > > let's say list N : > > n1 > > n2 > > n3 > > n4 > > and a list M: > > m1 > > m2 > > m3 > > m4 > > m5 > > 2 -- and a data frame C, where for some pairs of coordinates (n,m) from the > lists above, we have a numerical intensity; > > for example : > > n1; m1; 100 > > n1; m2; 300 > > The question would be : what is the most efficient R code I could use in > order to integrate the list N, the list M, and the data frame C, in order > to obtain a DATA FRAME, > > -- list N as the columns names > -- list M as the rows names > -- the values in the cells of N * M, corresponding to the numerical values > in the data frame C. > > A little example would be : > > n1 n2 n3 n4 > > m1 100 - - - > > m2 300 - - - > > m3 - - - - > > m4 - - - - > > m5 - - - - > I wrote a script in perl, although i would like to do this in R > Many thanks ;) > -- bogdan > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.
Dear Bert, thank you for your response. here it is the piece of R code : given 3 data frames below --- N <- data.frame(N=c("n1","n2","n3","n4")) M <- data.frame(M=c("m1","m2","m3","m4","m5")) C <- data.frame(n=c("n1","n2","n3"), m=c("m1","m1","m3"), I=c(100,300,400)) how shall I integrate N, and M, and C in such a way that at the end we have a data frame with : - list N as the columns names - list M as the rows names - the values in the cells of N * M, corresponding to the numerical values in the data frame C. more precisely, the result shall be : n1 n2 n3 n4 m1 100 200 - - m2 - - - - m3 - - 300 - m4 - - - - m5 - - - - thank you ! On Mon, Jun 5, 2017 at 6:57 PM, Bert Gunter <bgunter.4567 at gmail.com> wrote:> Reproducible example, please. -- In particular, what exactly does C look > ilike? > > (You should know this by now). > > -- Bert > Bert Gunter > > "The trouble with having an open mind is that people keep coming along > and sticking things into it." > -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) > > > On Mon, Jun 5, 2017 at 6:45 PM, Bogdan Tanasa <tanasa at gmail.com> wrote: > > Dear all, > > > > please could you advise on the R code I could use in order to do the > > following operation : > > > > a. -- I have 2 lists of "genome coordinates" : a list is composed by > > numbers that represent genome coordinates; > > > > let's say list N : > > > > n1 > > > > n2 > > > > n3 > > > > n4 > > > > and a list M: > > > > m1 > > > > m2 > > > > m3 > > > > m4 > > > > m5 > > > > 2 -- and a data frame C, where for some pairs of coordinates (n,m) from > the > > lists above, we have a numerical intensity; > > > > for example : > > > > n1; m1; 100 > > > > n1; m2; 300 > > > > The question would be : what is the most efficient R code I could use in > > order to integrate the list N, the list M, and the data frame C, in order > > to obtain a DATA FRAME, > > > > -- list N as the columns names > > -- list M as the rows names > > -- the values in the cells of N * M, corresponding to the numerical > values > > in the data frame C. > > > > A little example would be : > > > > n1 n2 n3 n4 > > > > m1 100 - - - > > > > m2 300 - - - > > > > m3 - - - - > > > > m4 - - - - > > > > m5 - - - - > > I wrote a script in perl, although i would like to do this in R > > Many thanks ;) > > -- bogdan > > > > [[alternative HTML version deleted]] > > > > ______________________________________________ > > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > > https://stat.ethz.ch/mailman/listinfo/r-help > > PLEASE do read the posting guide http://www.R-project.org/ > posting-guide.html > > and provide commented, minimal, self-contained, reproducible code. >[[alternative HTML version deleted]]