thr3ads.net - R help - [R] summaryBy(): Is it the best option? [Dec 2006]

If this information is useful, please help other people find it:
Share via:

Werner Wernersen

2006-Dec-05 00:30 UTC

[R] summaryBy(): Is it the best option?

Hi,

since I have quite large tables and the processing
takes quite a while I am
curious if I can improve the performance of this
aggregation somehow: At the 
moment I am using summaryBy from the doBy package
under R 2.4.0, Win2K.

summaryBy(soc_s6aq5 + soc_s6aq7 + soc_s6aq9 +
soc_s6aq11 ~ hh + 
comgroup,soc6a,postfix=c("","","",""),FUN=sum,
na.rm=T)

The data.frame has 124100 rows and 13 cols.

Thanks for any hints!

Werner

Petr Pikal

2006-Dec-06 09:27 UTC

head link

[R] summaryBy(): Is it the best option?

Hi

I did not see any answer yet so I try. You can use aggregate, by or 
sapply.

Something like
aggregate(soc6a[, your columns], list(hh=hh), sum, na.rm=T)
by(soc6a[, your columns], hh, sum, na.rm=T)
sapply(split(soc6a[, your columns], hh), sum, na.rm=T)

But you have to check speed gain by yourself.

HTH
Petr



On 5 Dec 2006 at 1:30, Werner Wernersen wrote:

Date sent:      	Tue, 5 Dec 2006 01:30:50 +0100 (CET)
From:           	Werner Wernersen <pensterfuzzer at yahoo.de>
To:             	r-help at stat.math.ethz.ch
Subject:        	[R] summaryBy(): Is it the best option?
> Hi,
> 
> since I have quite large tables and the processing
> takes quite a while I am
> curious if I can improve the performance of this
> aggregation somehow: At the 
> moment I am using summaryBy from the doBy package
> under R 2.4.0, Win2K.
> 
> summaryBy(soc_s6aq5 + soc_s6aq7 + soc_s6aq9 +
> soc_s6aq11 ~ hh + 
>
comgroup,soc6a,postfix=c("","","",""),FUN=sum,
> na.rm=T)
> 
> The data.frame has 124100 rows and 13 cols.
> 
> Thanks for any hints!
> 
> Werner
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html and provide commented,
> minimal, self-contained, reproducible code.
Petr Pikal
petr.pikal at precheza.cz

Seemingly Similar Threads

Search for more reasonably related threads

R help - Dec 2006 - summaryBy(): Is it the best option?

[R] summaryBy(): Is it the best option?

[R] summaryBy(): Is it the best option?

Seemingly Similar Threads