thr3ads.net - R devel - [Rd] HOW TO AVOID LOOPS [Apr 2008]

If this information is useful, please help other people find it:
Share via:

carlos martinez

2008-Apr-12 16:47 UTC

[Rd] HOW TO AVOID LOOPS

> Looking for a simple, effective a minimum execution time solution.
> 
> For a vector as:
> 
> c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
> To transform it to the following vector without using any loops:
> (0,0,1,0,1,2,3,0,0,1,2,0,1,0,1,2,3,4,5,6)
> Appreciate any suggetions.

	[[alternative HTML version deleted]]

Vincent Goulet

2008-Apr-12 17:30 UTC

head link

[Rd] HOW TO AVOID LOOPS

Le sam. 12 avr. ? 12:47, carlos martinez a ?crit :>> Looking for a simple, effective a minimum execution time solution.
>>
>> For a vector as:
>>
>> c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
>>
> To transform it to the following vector without using any loops:
>
>> (0,0,1,0,1,2,3,0,0,1,2,0,1,0,1,2,3,4,5,6)
>>
> Appreciate any suggetions.
This does it -- but it is admittedly ugly:

 > x <- c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
 > ind <- which(x == 0)
 > unlist(lapply(mapply(seq, ind, c(tail(ind, -1) - 1, length(x))),  
function(y) cumsum(x[y])))
  [1] 0 0 1 0 1 2 3 0 0 1 2 0 1 0 1 2 3 4 5 6

(The mapply() part is used to create the indexes of each sequence in x  
starting with a 0. The rest is then straightforward.)

HTH

---
   Vincent Goulet, Associate Professor
   ?cole d'actuariat
   Universit? Laval, Qu?bec
   Vincent.Goulet at act.ulaval.ca   http://vgoulet.act.ulaval.ca

hadley wickham

2008-Apr-12 21:23 UTC

head link

[Rd] HOW TO AVOID LOOPS

On Sat, Apr 12, 2008 at 11:47 AM, carlos martinez
<martinezbula at earthlink.net> wrote:> > Looking for a simple, effective a minimum execution time solution.
>  >
>  > For a vector as:
>  >
>  > c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
>  >
>  To transform it to the following vector without using any loops:
>
>  > (0,0,1,0,1,2,3,0,0,1,2,0,1,0,1,2,3,4,5,6)
>  >
>  Appreciate any suggetions.
How about:

unlist(lapply(split(x, cumsum(x == 0)), seq_along)) - 1

Hadley


-- 
http://had.co.nz/

carlos martinez

2008-Apr-13 01:33 UTC

head link

[Rd] HOW TO AVOID LOOPS

Appreciate the ingenious and effective suggestions and feedback from:

Dan Davison
Vincent Goulet
Martin Morgan
Hadley Wickham

The variety of technical approaches proposes so far are clear prove of the
strong and flexible capabilites of the R system, and specially the dynamics
and technical understanding of the R user base.

We tested all four recommendations with an input vector of more than 850000
components, and got time-responses from about 40-second to 20-seconds.

All four approches produced the desired vector. The Wickham's approach
produced and extra vector, but the second vector included the correct
format.

Just one additional follow up, to obtain from the same input vector:
c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)

A vector of the following format:
(0,0,1,0,0,0,3,0,0,0,2,0,1,0,0,0,0,0,6)

Will be easier and more efficient to start from the original input vector,
or start from the above second vector
(0,0,1,0,1,2,3,0,0,1,2,0,1,0,1,2,3,4,5,6)

Thanks for your responses.

-------------------------------------------------------------------------
Hadley Wickham Approach

How about:

unlist(lapply(split(x, cumsum(x == 0)), seq_along)) - 1

Hadley
--------------------------------------------------------------------------
-----Original Message-----
From: Martin Morgan [mailto:mtmorgan at fhcrc.org] 
Sent: Saturday, April 12, 2008 5:00 PM
To: Dan Davison
Cc: martinezbula at earthlink.net
Subject: Re: [Rd] HOW TO AVOID LOOPS

(anonymous 'off-list' response; some extra calcs but tidy)
> x=c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
> x * unlist(lapply(rle(x)$lengths, seq)) [1] 0 0 1 0 1 2 3 0 0 1 2 0 1 0 1 2 3 4 5 6


Dan Davison <davison at stats.ox.ac.uk> writes:
> On Sat, Apr 12, 2008 at 06:45:00PM +0100, Dan Davison wrote:
>> On Sat, Apr 12, 2008 at 01:30:13PM -0400, Vincent Goulet wrote:
>> > Le sam. 12 avr. ? 12:47, carlos martinez a ?crit :
>> > >> Looking for a simple, effective a minimum execution time
solution.
>> > >>
>> > >> For a vector as:
>> > >>
>> > >> c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
>> > >>
>> > > To transform it to the following vector without using any
loops:
>> > >
>> > >> (0,0,1,0,1,2,3,0,0,1,2,0,1,0,1,2,3,4,5,6)
>> > >>
>> > > Appreciate any suggetions.
>> > 
>> > This does it -- but it is admittedly ugly:
>> > 
>> >  > x <- c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
>> >  > ind <- which(x == 0)
>> >  > unlist(lapply(mapply(seq, ind, c(tail(ind, -1) - 1,
length(x))),
>> > function(y) cumsum(x[y])))
>> >   [1] 0 0 1 0 1 2 3 0 0 1 2 0 1 0 1 2 3 4 5 6
>> > 
>> > (The mapply() part is used to create the indexes of each sequence 
>> > in x starting with a 0. The rest is then straightforward.)
>> 
>> 
>> Here's my effort. Maybe a bit easier to digest? Only one *apply so
probably more efficient.>> 
>> function(x=c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)) {
>>     d <- diff(c(0,x,0))
>>     starts <- which(d == 1)
>>     ends <- which(d == -1)
>>     x[x == 1] <- unlist(lapply(ends - starts, function(n) 1:n))
>>     x
>> }
>> 
>
> Come to think of it, I suggest using the existing R function rle(), rather
than my dodgy substitute.>
> e.g.
>
> g <- function(x=c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)) {
>
>     runs <- rle(x)
>     runlengths <- runs$lengths[runs$values == 1]
>     x[x == 1] <- unlist(lapply(runlengths, function(n) 1:n))
>     x
> }
>
> Dan
>
> p.s. R-help would perhaps have been more appropriate than R-devel
>
>
>> Dan
>> 
>> 
>> > 
>> > HTH
>> > 
>> > ---
>> >    Vincent Goulet, Associate Professor
>> >    ?cole d'actuariat
>> >    Universit? Laval, Qu?bec
>> >    Vincent.Goulet at act.ulaval.ca   http://vgoulet.act.ulaval.ca
>> > 
>> > ______________________________________________
>> > R-devel at r-project.org mailing list
>> > https://stat.ethz.ch/mailman/listinfo/r-devel
>
> ______________________________________________
> R-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
--
Martin Morgan
Computational Biology / Fred Hutchinson Cancer Research Center 1100 Fairview
Ave. N.
PO Box 19024 Seattle, WA 98109

Location: Arnold Building M2 B169
Phone: (206) 667-2793

Stephen Milborrow

2008-Apr-14 20:22 UTC

head link

[Rd] HOW TO AVOID LOOPS

> Le sam. 12 avr. ? 12:47, carlos martinez a ?crit :
> Looking for a simple, effective a minimum execution time solution.
>
> For a vector as:
>
> c(0,0,1,0,1,1,1,0,0,1,1,0,1,0,1,1,1,1,1,1)
>
> To transform it to the following vector without using any loops:
>
> (0,0,1,0,1,2,3,0,0,1,2,0,1,0,1,2,3,4,5,6)
Here is a fast solution using the Ra just-in-time compiler
www.milbo.users.sonic.net/ra.

jit(1)
if (length(x) > 1)
    for (i in 2:length(x))
        if (x[i])
            x[i] <- x[i-1] + 1

The times in seconds for various solutions mailed to r-devel are listed
below. There is some variation between runs and with the contents of x. The
times shown are for

set.seed(1066);  x <- as.double(runif(1e6) > .5)

This was tested on a WinXP 3 GHz Pentium D with Ra 1.0.7 (based on R 2.6.2).
The code to generate these results is attached.

vin     24
greg   11
had    3.9
dan    1.4
dan2  1.4
jit       0.25    # code is shown above, 7 secs with standard R 2.6.2>

Stephen Milborrow
www.milbo.users.sonic.net
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: cm-post.R.txt
Url:
https://stat.ethz.ch/pipermail/r-devel/attachments/20080414/d4e41782/attachment.txt

Reasonably Related Threads

Search for more possibly parallel threads

R devel - Apr 2008 - HOW TO AVOID LOOPS

[Rd] HOW TO AVOID LOOPS

[Rd] HOW TO AVOID LOOPS

[Rd] HOW TO AVOID LOOPS

[Rd] HOW TO AVOID LOOPS

[Rd] HOW TO AVOID LOOPS

Reasonably Related Threads