Dear all, I'd like to get a percentage variable based on a group, but without creating a new data frame. For example: data(iris) iris$percent <-unlist(tapply(iris$Sepal.Length,iris$Species,function(x) x/sum(x, na.rm=TRUE))) This does not work, I should have only three standard values, respectively for setosa, versicolor, and virginica. How can I do this? MANY THANKS, Karine [[alternative HTML version deleted]]
HI, Not sure if this is what you meant. tapply(iris$Sepal.Length,iris$Species,FUN=function(x) sum(x)/sum(iris$Sepal.Length)*100) ?#? setosa versicolor? virginica ?# 28.55676?? 33.86195?? 37.58129 A.K. ----- Original Message ----- From: Karine Charlebois <karine.charlebois at outlook.com> To: "r-help at r-project.org" <r-help at r-project.org> Cc: Sent: Tuesday, January 15, 2013 9:30 PM Subject: [R] Get a percent variable based on group Dear all, I'd like to get a percentage variable based on a group, but without creating a new data frame. For example: data(iris) iris$percent <-unlist(tapply(iris$Sepal.Length,iris$Species,function(x) x/sum(x, na.rm=TRUE))) This does not work, I should have only three standard values, respectively for setosa, versicolor, and virginica. How can I do this? MANY THANKS, Karine ??? ??? ??? ? ??? ??? ? ??? [[alternative HTML version deleted]] ______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
On Jan 15, 2013, at 6:30 PM, Karine Charlebois wrote:> Dear all, I'd like to get a percentage variable based on a group, but without creating a new data frame. > For example: > data(iris) > > iris$percent <-unlist(tapply(iris$Sepal.Length,iris$Species,function(x) x/sum(x, na.rm=TRUE)))A percentage is 100 times a fraction whose nominal value is unity. My guess is that you want a percentage of the group mean? So this would just be: iris$percent <-ave(iris$Sepal.Length, iris$Species, FUN=function(x) 100*x/mean(x, na.rm=TRUE)) head(iris)> > This does not work, I should have only three standard values, respectively for setosa, versicolor, and virginica. How can I do this?If you just want three values, then I do not see how these are percentages.> tapply(iris$Sepal.Length,iris$Species,function(x) mean(x, na.rm=TRUE))setosa versicolor virginica 5.006 5.936 6.588 -- David Winsemius Alameda, CA, USA
Hi, Is it this? aggregate(iris$Sepal.Length,by=list(iris$Species),FUN=function(x) sum(x)/sum(iris$Sepal.Length)*100) ???? Group.1??????? x 1???? setosa 28.55676 2 versicolor 33.86195 3? virginica 37.58129 A.K. ________________________________ From: Karine Charlebois <karine.charlebois at outlook.com> To: arun <smartpink111 at yahoo.com> Sent: Tuesday, January 15, 2013 10:22 PM Subject: RE: [R] Get a percent variable based on group For example,? iris$percent <- unlist(tapply(iris$Sepal.Length,iris$Species,function(x) x/sum(iris$Sepal.Length, na.rm=TRUE))) aggregate(iris$percent, by=list(iris$Species), FUN=sum, na.rm=TRUE) this last command should return 100% for each specie, not the following values: Group.1???????? x 1???? setosa 0.2855676 2 versicolor 0.3386195 3? virginica 0.3758129 ________________________________ From: karine.charlebois at outlook.com To: smartpink111 at yahoo.com Subject: RE: [R] Get a percent variable based on group Date: Tue, 15 Jan 2013 22:13:27 -0500 No, it is not. I need a new column with these values. Karine> Date: Tue, 15 Jan 2013 19:11:22 -0800 > From: smartpink111 at yahoo.com > Subject: Re: [R] Get a percent variable based on group > To: karine.charlebois at outlook.com > CC: r-help at r-project.org > > > > HI, > > Not sure if this is what you meant. > tapply(iris$Sepal.Length,iris$Species,FUN=function(x) sum(x)/sum(iris$Sepal.Length)*100) > ?#? setosa versicolor? virginica > ?# 28.55676?? 33.86195?? 37.58129 > A.K. > > > ----- Original Message ----- > From: Karine Charlebois <karine.charlebois at outlook.com> > To: "r-help at r-project.org" <r-help at r-project.org> > Cc: > Sent: Tuesday, January 15, 2013 9:30 PM > Subject: [R] Get a percent variable based on group > > Dear all, I'd like to get a percentage variable based on a group, but without creating a new data frame. > For example: > data(iris) > > iris$percent <-unlist(tapply(iris$Sepal.Length,iris$Species,function(x) x/sum(x, na.rm=TRUE))) > > This does not work, I should have only three standard values, respectively for setosa, versicolor, and virginica. How can I do this? > > MANY THANKS, > > Karine > ??? ???????? ?????? ??? ? > ??? [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >
As others have said, your goal is unclear to us. However, one guess I have not
seen others make is if you are looking for a way to normalize within each group,
perhaps you should look at
?ave
which typically creates a vector just as long as your data vector and grouping
vector.
---------------------------------------------------------------------------
Jeff Newmiller                        The     .....       .....  Go Live...
DCN:<jdnewmil at dcn.davis.ca.us>        Basics: ##.#.       ##.#.  Live
Go...
                                      Live:   OO#.. Dead: OO#..  Playing
Research Engineer (Solar/Batteries            O.O#.       #.O#.  with
/Software/Embedded Controllers)               .OO#.       .OO#.  rocks...1k
--------------------------------------------------------------------------- 
Sent from my phone. Please excuse my brevity.
Karine Charlebois <karine.charlebois at outlook.com> wrote:
>Dear all, I'd like to get a percentage variable based on a group, but
>without creating a new data frame. 
>For example:
>data(iris)
>
>iris$percent <-unlist(tapply(iris$Sepal.Length,iris$Species,function(x)
>x/sum(x, na.rm=TRUE)))
>
>This does not work, I should have only three standard values,
>respectively for setosa, versicolor, and virginica. How can I do this?
>
>MANY THANKS,
>
>Karine 
> 		 	   		  
>	[[alternative HTML version deleted]]
>
>______________________________________________
>R-help at r-project.org mailing list
>https://stat.ethz.ch/mailman/listinfo/r-help
>PLEASE do read the posting guide
>http://www.R-project.org/posting-guide.html
>and provide commented, minimal, self-contained, reproducible code.