Hi expeRts,
I would like to calculate weighted mean by two factors.
My code is as follows:
R> tmp <- by(re$meta.sales.lkm[, c("pc", "sales")],
                      re$meta.sales.lkm[, c("size", "yr")],
function(x)
                      weighted.mean(x[,1], x[,2]))
The result is as follows:
R> tmp
size: micro
yr: 1994
[1] 1.090
------------------------------------------------------------
size: small
yr: 1994
[1] 1.135
------------------------------------------------------------
size: medium
yr: 1994
[1] 1.113
------------------------------------------------------------
size: large
yr: 1994
[1] 1.105
------------------------------------------------------------
size: micro
yr: 1995
[1] 1.167
------------------------------------------------------------
size: small
yr: 1995
[1] 1.096
------------------------------------------------------------
size: medium
yr: 1995
[1] 1.056
....
....
But the form I want to get is as follows:
            1994       1995         1996      .....
micro    1.090      1.167         .............
small     1.135      1.096
medium 1.113      1.056        .... ........
large      1.105      ....... ...........
That is, the result should be tabularized.
How can I get the above form directly? (I don't want to modify tmp with
as.vector() and matrix() to get the result)
Thank  you in advance.
--------------------------------------------------------------------------
Donghyun Oh
CESIS, KTH
--------------------------------------------------------------------------
	[[alternative HTML version deleted]]
Sounds like a job for plyr: http://had.co.nz/plyr On Mon, Apr 13, 2009 at 7:56 PM, Dong H. Oh <r.arecibo at gmail.com> wrote:> Hi expeRts, > > I would like to calculate weighted mean by two factors. > > My code is as follows: > > R> tmp <- by(re$meta.sales.lkm[, c("pc", "sales")], > ? ? ? ? ? ? ? ? ? ? ?re$meta.sales.lkm[, c("size", "yr")], function(x) > ? ? ? ? ? ? ? ? ? ? ?weighted.mean(x[,1], x[,2])) > > The result is as follows: > R> tmp > size: micro > yr: 1994 > [1] 1.090 > ------------------------------------------------------------ > size: small > yr: 1994 > [1] 1.135 > ------------------------------------------------------------ > size: medium > yr: 1994 > [1] 1.113 > ------------------------------------------------------------ > size: large > yr: 1994 > [1] 1.105 > ------------------------------------------------------------ > size: micro > yr: 1995 > [1] 1.167 > ------------------------------------------------------------ > size: small > yr: 1995 > [1] 1.096 > ------------------------------------------------------------ > size: medium > yr: 1995 > [1] 1.056 > .... > .... > > But the form I want to get is as follows: > ? ? ? ? ? ?1994 ? ? ? 1995 ? ? ? ? 1996 ? ? ?..... > micro ? ?1.090 ? ? ?1.167 ? ? ? ? ............. > small ? ? 1.135 ? ? ?1.096 > medium 1.113 ? ? ?1.056 ? ? ? ?.... ........ > large ? ? ?1.105 ? ? ?....... ........... > > That is, the result should be tabularized. > How can I get the above form directly? (I don't want to modify tmp with > as.vector() and matrix() to get the result) > > Thank ?you in advance. > > -------------------------------------------------------------------------- > Donghyun Oh > CESIS, KTH > -------------------------------------------------------------------------- > > ? ? ? ?[[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >-- Mike Lawrence Graduate Student Department of Psychology Dalhousie University Looking to arrange a meeting? Check my public calendar: http://tr.im/mikes_public_calendar ~ Certainty is folly... I think. ~
Note that that output of by() is a matrix, but with some extra
attributes added to it.
Since you didn't supply any data I made up some that might
resemble yours.
  > set.seed(1)
  > re<-list(meta.sales.lkm=data.frame(pc=runif(40), sales=rpois(40,3),
size=sample(c("small","medium","large"),size=40,replace=TRUE),
yr=sample(1994:1998,size=40,replace=TRUE)))
I ran your by() call
   > tmp<-by(re$meta.sales.lkm[, c("pc", "sales")],
               re$meta.sales.lkm[, c("size", "yr")],
               function(x) weighted.mean(x[,1], x[,2]))
and looked at it with dput and saw that all the usual
components of a matrix are in it
   > dput(tmp)
   structure(c(0.86969084572047, 0.687022846657783, 0.40032217082464,
   0.125555095961317, 0.529131081343318, 0.64538708513137,
0.613078526553831,
   0.663822646145351, 0.48206098045921, 0.333916208640273,
0.513083046752339,
   NA, 0.457996427547187, 0.30292882991489, NA), .Dim = c(3L, 5L
   ), .Dimnames = structure(list(size = c("large", "medium",
"small"
   ), yr = c("1994", "1995", "1996",
"1997", "1998")), .Names c("size",
   "yr")), call = by.data.frame(data = re$meta.sales.lkm[,
c("pc",
       "sales")], INDICES = re$meta.sales.lkm[, c("size",
"yr")],
       FUN = function(x) weighted.mean(x[, 1], x[, 2])), class = "by")
It is just the print method for 'by' objects that makes it look
different.
Since there is no special 'by' method for '[' you can use tmp[,]
to view
the matrix part of it
   > tmp[,]
           yr
   size          1994      1995      1996      1997      1998
     large  0.8696908 0.1255551 0.6130785 0.3339162 0.4579964
     medium 0.6870228 0.5291311 0.6638226 0.5130830 0.3029288
     small  0.4003222 0.6453871 0.4820610        NA        NA
If there were a [.by then you might have to manually remove the
"call" attribute and change the class to "matrix".
Bill Dunlap
TIBCO Software Inc - Spotfire Division
wdunlap tibco.com 
---------------------------------------
R] weighted mean and by() with two index
Dong H. Oh r.arecibo at gmail.com 
Tue Apr 14 00:56:28 CEST 2009
Hi expeRts,
I would like to calculate weighted mean by two factors.
My code is as follows:
R> tmp <- by(re$meta.sales.lkm[, c("pc", "sales")],
                      re$meta.sales.lkm[, c("size", "yr")],
function(x)
                      weighted.mean(x[,1], x[,2]))
The result is as follows:
R> tmp
size: micro
yr: 1994
[1] 1.090
------------------------------------------------------------
size: small
yr: 1994
[1] 1.135
------------------------------------------------------------
size: medium
yr: 1994
[1] 1.113
------------------------------------------------------------
size: large
yr: 1994
[1] 1.105
------------------------------------------------------------
size: micro
yr: 1995
[1] 1.167
------------------------------------------------------------
size: small
yr: 1995
[1] 1.096
------------------------------------------------------------
size: medium
yr: 1995
[1] 1.056
....
....
But the form I want to get is as follows:
            1994       1995         1996      .....
micro    1.090      1.167         .............
small     1.135      1.096
medium 1.113      1.056        .... ........
large      1.105      ....... ...........
That is, the result should be tabularized.
How can I get the above form directly? (I don't want to modify tmp with
as.vector() and matrix() to get the result)
Thank  you in advance.
------------------------------------------------------------------------
--
Donghyun Oh
CESIS, KTH
------------------------------------------------------------------------
--
	[[alternative HTML version deleted]]