thr3ads.net - R help - [R] Improving effeciency

If this information is useful, please help other people find it:
Share via:

Simon Cullen

2004-Jul-06 12:56 UTC

[R] Improving effeciency - better table()?

Hi,

I've been running some simulations for a while and the performance of R  
has been great. However, I've recently changed the code to perform a sort  
of chi-square goodness-of-fit test. To get the observed values for each  
cell I've been using table() - specifically I've been using cut2 from  
Hmisc to divide up the range into a specified number of cells and then  
using table to count how many observations appear in each cell.
> obs <- table(cut2(z.trun, cuts=breaks))
Having done this I've found that the code takes much longer to run - up to  
10x as long. Is there a more effecient way of doing this? Anyone have any  
thoughts?

-- 
SC

Simon Cullen
Room 3030
Dept. Of Economics
Trinity College Dublin

Ph. (608)3477
Email cullens at tcd.ie

Roger D. Peng

2004-Jul-06 13:00 UTC

head link

[R] Improving effeciency - better table()?

Have you tried using hist() with specifying `br' and `plot = FALSE'?
See the note in ?cut.

-roger

Simon Cullen wrote:> Hi,
> 
> I've been running some simulations for a while and the performance of R
> has been great. However, I've recently changed the code to perform a 
> sort  of chi-square goodness-of-fit test. To get the observed values for 
> each  cell I've been using table() - specifically I've been using
cut2
> from  Hmisc to divide up the range into a specified number of cells and 
> then  using table to count how many observations appear in each cell.
> 
>> obs <- table(cut2(z.trun, cuts=breaks))
> 
> 
> Having done this I've found that the code takes much longer to run - up
> to  10x as long. Is there a more effecient way of doing this? Anyone 
> have any  thoughts?
> 
-- 
Roger D. Peng
biostat.jhsph.edu/~rpeng

Liaw, Andy

2004-Jul-06 13:02 UTC

head link

[R] Improving effeciency - better table()?

Since you didn't provide an example of what z.trun and breaks may look like,
most people can only guess.  Before asking how code can be made more
efficient, it might be more helpful to find out where in the code is taking
time.  Try:

Rprof()
obs <- table(cut2(z.trun, cuts=breaks))
Rprof(NULL)
summaryRprof()

Andy
> From: Simon Cullen
> 
> Hi,
> 
> I've been running some simulations for a while and the 
> performance of R  
> has been great. However, I've recently changed the code to 
> perform a sort  
> of chi-square goodness-of-fit test. To get the observed 
> values for each  
> cell I've been using table() - specifically I've been using 
> cut2 from  
> Hmisc to divide up the range into a specified number of cells 
> and then  
> using table to count how many observations appear in each cell.
> 
> > obs <- table(cut2(z.trun, cuts=breaks))
> 
> Having done this I've found that the code takes much longer 
> to run - up to  
> 10x as long. Is there a more effecient way of doing this? 
> Anyone have any  
> thoughts?
> 
> -- 
> SC
> 
> Simon Cullen
> Room 3030
> Dept. Of Economics
> Trinity College Dublin
> 
> Ph. (608)3477
> Email cullens at tcd.ie

Marc Schwartz

2004-Jul-06 13:11 UTC

head link

[R] Improving effeciency - better table()?

On Tue, 2004-07-06 at 07:56, Simon Cullen wrote:> Hi,
> 
> I've been running some simulations for a while and the performance of R
> has been great. However, I've recently changed the code to perform a
sort
> of chi-square goodness-of-fit test. To get the observed values for each  
> cell I've been using table() - specifically I've been using cut2
from
> Hmisc to divide up the range into a specified number of cells and then  
> using table to count how many observations appear in each cell.
> 
> > obs <- table(cut2(z.trun, cuts=breaks))
> 
> Having done this I've found that the code takes much longer to run - up
to
> 10x as long. Is there a more effecient way of doing this? Anyone have any  
> thoughts?

It would appear that you might be attempting to do a Hosmer-Lemeshow
type of GOF test.

If indeed that is the case, before making the above more efficient, you
should spend some time reviewing the following posts by Frank Harrell on
this subject:

maths.newcastle.edu.au/~rking/R/help/02b/4210.html

maths.newcastle.edu.au/~rking/R/help/02b/3111.html

HTH,

Marc Schwartz

Possibly Parallel Threads

Search for more seemingly similar threads

R help - Jul 2004 - Improving effeciency - better table()?

[R] Improving effeciency - better table()?

[R] Improving effeciency - better table()?

[R] Improving effeciency - better table()?

[R] Improving effeciency - better table()?

Possibly Parallel Threads