thr3ads.net - R help - [R] Counter in a For Loop

If this information is useful, please help other people find it:
Share via:

Leendert Punt

2011-Feb-02 07:35 UTC

[R] Counter in a For Loop - Efficiency Issue

I have a 32000 x 14 matrix (M) where entry (x, y) corresponds to
person x at a discrete time y. I then have a matrix (M2) of 60000 x 2
where each entry is a an event by a person that is in the above 32000
and at a time that is in the range of the discrete time points above.
I want to populate the another matrix (M3) such that (x, y) is the
number of events of person x between times y-1 and y. This is simple
to code, (I just put the discrete time points and the 32000 people in
their own vectors, V1 and V2):


for (i in 1:32000){
for (j in 1:60000){

for (k in 2:14){

if (M2[1, j] == V2[i]   &   M2[2, j]  > V1[k-1]   &  M2[2, j] <=
V1[k])
M3[i, k] = M3[i, k]+1

}}}

This would work im sure, the problem is that it takes a very long
time. Any advice on helping it run faster would be greatly
appreciated.

Bart Joosen

2011-Feb-02 09:57 UTC

head link

[R] Counter in a For Loop - Efficiency Issue

without a reproducible sample, it is hard to tell, but I will give it a shot.
Maybe it's possible to merge your M with M2: merge(M, M2)
If you only want to count times, you can use seq_along(x) in a by function
eg:
dat <- data.frame(person=rep(c(1,2,3), each=5), time=rnorm(15))
by(dat$person, dat$person, function(x) seq_along(x))

Ofcourse you can use any of the other vectorisation functions (lapply,
apply, aggregate, ....)

HTH


Bart
-- 
View this message in context:
http://r.789695.n4.nabble.com/Counter-in-a-For-Loop-Efficiency-Issue-tp3253674p3253698.html
Sent from the R help mailing list archive at Nabble.com.

Petr Savicky

2011-Feb-02 10:29 UTC

head link

[R] Counter in a For Loop - Efficiency Issue

On Wed, Feb 02, 2011 at 09:35:28AM +0200, Leendert Punt
wrote:> I have a 32000 x 14 matrix (M) where entry (x, y) corresponds to
> person x at a discrete time y. I then have a matrix (M2) of 60000 x 2
> where each entry is a an event by a person that is in the above 32000
> and at a time that is in the range of the discrete time points above.
> I want to populate the another matrix (M3) such that (x, y) is the
> number of events of person x between times y-1 and y. This is simple
> to code, (I just put the discrete time points and the 32000 people in
> their own vectors, V1 and V2):
> 
> 
> for (i in 1:32000){
> for (j in 1:60000){
> 
> for (k in 2:14){
> 
> if (M2[1, j] == V2[i]   &   M2[2, j]  > V1[k-1]   &  M2[2, j]
<= V1[k])
> M3[i, k] = M3[i, k]+1
> 
> }}}
Consider the following approach, which uses M2, V1 and V2 as
used in the code above.

  V1 <- c(0, 10, 20, 30, 40)
  V2 <- 1:3 # list of persons in the required order
  M2 <- rbind(
  c(1, 2, 1, 1, 3, 1, 3, 1),
  c(11, 21, 21, 21, 11, 21, 11, 31))

  person <- factor(M2[1, ], levels=V2)
  interval <- cut(M2[2, ], breaks=V1)
  M3 <- table(person, interval)

  M2

     [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8]
[1,]    1    2    1    1    3    1    3    1
[2,]   11   21   21   21   11   21   11   31

  M3

        interval
  person (0,10] (10,20] (20,30] (30,40]
       1      0       1       3       1
       2      0       0       1       0
       3      0       2       0       0

Does this approach work for your data?

Petr Savicky.

Possibly Parallel Threads

Search for more possibly parallel threads

R help - Feb 2011 - Counter in a For Loop - Efficiency Issue

[R] Counter in a For Loop - Efficiency Issue

[R] Counter in a For Loop - Efficiency Issue

[R] Counter in a For Loop - Efficiency Issue

Possibly Parallel Threads