thr3ads.net - R help - [R] Generating groupings of ordered observations [Jun 2008]

If this information is useful, please help other people find it:
Share via:

Gavin Simpson

2008-Jun-21 16:40 UTC

[R] Generating groupings of ordered observations

Dear List,

I have a problem I'm finding it difficult to make headway with.

Say I have 6 ordered observations, and I want to find all combinations
of splitting these 6 ordered observations in g groups, where g = 1, ...,
6. Groups can only be formed by adjacent observations, so observations 1
and 4 can't be in a group on their own, only if 1,2,3&4 are all in the
group.

For example, with 6 observations, the columns of the matrices below
represent the groups that can be formed by placing the 6 ordered
observations into 2-5 groups. Think of the columns of these matrices as
being an indicator of group membership. We then cbind these matrices
with the trivial partitions into 1 and 6 groups:

mat2g <- matrix(c(1,1,1,1,1,
                  2,1,1,1,1,
                  2,2,1,1,1,
                  2,2,2,1,1,
                  2,2,2,2,1,
                  2,2,2,2,2),
                nrow = 6, ncol = 5, byrow = TRUE)

mat3g <- matrix(c(1,1,1,1,1,1,1,1,1,1,
                  2,2,2,2,1,1,1,1,1,1,
                  3,2,2,2,2,2,2,1,1,1,
                  3,3,2,2,3,2,2,2,2,1,
                  3,3,3,2,3,3,2,3,2,2,
                  3,3,3,3,3,3,3,3,3,3),
                nrow = 6, ncol = 10, byrow = TRUE)

mat4g <- matrix(c(1,1,1,1,1,1,1,1,1,1,
                  2,2,2,2,2,2,1,1,1,1,
                  3,3,3,2,2,2,2,2,2,1,
                  4,3,3,3,3,2,3,3,2,2,
                  4,4,3,4,3,3,4,3,3,3,
                  4,4,4,4,4,4,4,4,4,4),
                nrow = 6, ncol = 10, byrow = TRUE)

mat5g <- matrix(c(1,1,1,1,1,
                  2,2,2,2,1,
                  3,3,3,2,2,
                  4,4,3,3,3,
                  5,4,4,4,4,
                  5,5,5,5,5),
                nrow = 6, ncol = 5, byrow = TRUE)

cbind(rep(1,6), mat2g, mat3g, mat4g, mat5g, 1:6)

I'd like to be able to do this automagically, for any (reasonable,
small, say n = 10-20) number of observations, n, and for g = 1, ..., n
groups.

I can't see the pattern here or a way forward. Can anyone suggest an
approach?

Thanks in advance,

Gavin

-- 
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
 Dr. Gavin Simpson             [t] +44 (0)20 7679 0522
 ECRC, UCL Geography,          [f] +44 (0)20 7679 0565
 Pearson Building,             [e] gavin.simpsonATNOSPAMucl.ac.uk
 Gower Street, London          [w] http://www.ucl.ac.uk/~ucfagls/
 UK. WC1E 6BT.                 [w] http://www.freshwaters.org.uk
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%

Charles C. Berry

2008-Jun-21 18:30 UTC

head link

[R] Generating groupings of ordered observations

On Sat, 21 Jun 2008, Gavin Simpson wrote:
> Dear List,
>
> I have a problem I'm finding it difficult to make headway with.
>
> Say I have 6 ordered observations, and I want to find all combinations
> of splitting these 6 ordered observations in g groups, where g = 1, ...,
> 6. Groups can only be formed by adjacent observations, so observations 1
> and 4 can't be in a group on their own, only if 1,2,3&4 are all in
the
> group.
>
Right. And in the example below there are 32 distinct patterns.

Which arises from sum( choose( 5, 0:5 ) ) different placements of 0:5 
split positions.

You can represent the splits as a binary number with n-1 bits: 00000 
implies no splits, 10000 implies a split between 1 and 2, 10100 implies 
splits between 1 and 2 and between 3 and 4, et cetera.

So, 32 arises as 2^5, too.

Something like this:
> base10 <- seq(0, length=2^(n-1) )
> base2.bits <- outer(0:(n-2), base10, function(y,x) ( x %/% (2^y)) %%2 )
> sapply(apply( base2.bits==1, 2, which ), function(x) rep(1:(1+length(x)),
diff(c(0,x,n))))
Getting this in the same column order as your example is left as an 
exercise for the reader.

HTH,

Chuck
> For example, with 6 observations, the columns of the matrices below
> represent the groups that can be formed by placing the 6 ordered
> observations into 2-5 groups. Think of the columns of these matrices as
> being an indicator of group membership. We then cbind these matrices
> with the trivial partitions into 1 and 6 groups:
>
> mat2g <- matrix(c(1,1,1,1,1,
>                  2,1,1,1,1,
>                  2,2,1,1,1,
>                  2,2,2,1,1,
>                  2,2,2,2,1,
>                  2,2,2,2,2),
>                nrow = 6, ncol = 5, byrow = TRUE)
>
> mat3g <- matrix(c(1,1,1,1,1,1,1,1,1,1,
>                  2,2,2,2,1,1,1,1,1,1,
>                  3,2,2,2,2,2,2,1,1,1,
>                  3,3,2,2,3,2,2,2,2,1,
>                  3,3,3,2,3,3,2,3,2,2,
>                  3,3,3,3,3,3,3,3,3,3),
>                nrow = 6, ncol = 10, byrow = TRUE)
>
> mat4g <- matrix(c(1,1,1,1,1,1,1,1,1,1,
>                  2,2,2,2,2,2,1,1,1,1,
>                  3,3,3,2,2,2,2,2,2,1,
>                  4,3,3,3,3,2,3,3,2,2,
>                  4,4,3,4,3,3,4,3,3,3,
>                  4,4,4,4,4,4,4,4,4,4),
>                nrow = 6, ncol = 10, byrow = TRUE)
>
> mat5g <- matrix(c(1,1,1,1,1,
>                  2,2,2,2,1,
>                  3,3,3,2,2,
>                  4,4,3,3,3,
>                  5,4,4,4,4,
>                  5,5,5,5,5),
>                nrow = 6, ncol = 5, byrow = TRUE)
>
> cbind(rep(1,6), mat2g, mat3g, mat4g, mat5g, 1:6)
>
> I'd like to be able to do this automagically, for any (reasonable,
> small, say n = 10-20) number of observations, n, and for g = 1, ..., n
> groups.
>
> I can't see the pattern here or a way forward. Can anyone suggest an
> approach?
>
> Thanks in advance,
>
> Gavin
>
> -- 
> %~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
> Dr. Gavin Simpson             [t] +44 (0)20 7679 0522
> ECRC, UCL Geography,          [f] +44 (0)20 7679 0565
> Pearson Building,             [e] gavin.simpsonATNOSPAMucl.ac.uk
> Gower Street, London          [w] http://www.ucl.ac.uk/~ucfagls/
> UK. WC1E 6BT.                 [w] http://www.freshwaters.org.uk
> %~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
Charles C. Berry                            (858) 534-2098
                                             Dept of Family/Preventive Medicine
E mailto:cberry at tajo.ucsd.edu	            UC San Diego
http://famprevmed.ucsd.edu/faculty/cberry/  La Jolla, San Diego 92093-0901

Gabor Grothendieck

2008-Jun-21 21:56 UTC

head link

[R] Generating groupings of ordered observations

On Sat, Jun 21, 2008 at 12:40 PM, Gavin Simpson <gavin.simpson at
ucl.ac.uk> wrote:> Dear List,
>
> I have a problem I'm finding it difficult to make headway with.
>
> Say I have 6 ordered observations, and I want to find all combinations
> of splitting these 6 ordered observations in g groups, where g = 1, ...,
> 6. Groups can only be formed by adjacent observations, so observations 1
> and 4 can't be in a group on their own, only if 1,2,3&4 are all in
the
> group.
>
> For example, with 6 observations, the columns of the matrices below
> represent the groups that can be formed by placing the 6 ordered
> observations into 2-5 groups. Think of the columns of these matrices as
> being an indicator of group membership. We then cbind these matrices
> with the trivial partitions into 1 and 6 groups:
>
> mat2g <- matrix(c(1,1,1,1,1,
>                  2,1,1,1,1,
>                  2,2,1,1,1,
>                  2,2,2,1,1,
>                  2,2,2,2,1,
>                  2,2,2,2,2),
>                nrow = 6, ncol = 5, byrow = TRUE)
>
> mat3g <- matrix(c(1,1,1,1,1,1,1,1,1,1,
>                  2,2,2,2,1,1,1,1,1,1,
>                  3,2,2,2,2,2,2,1,1,1,
>                  3,3,2,2,3,2,2,2,2,1,
>                  3,3,3,2,3,3,2,3,2,2,
>                  3,3,3,3,3,3,3,3,3,3),
>                nrow = 6, ncol = 10, byrow = TRUE)
>
> mat4g <- matrix(c(1,1,1,1,1,1,1,1,1,1,
>                  2,2,2,2,2,2,1,1,1,1,
>                  3,3,3,2,2,2,2,2,2,1,
>                  4,3,3,3,3,2,3,3,2,2,
>                  4,4,3,4,3,3,4,3,3,3,
>                  4,4,4,4,4,4,4,4,4,4),
>                nrow = 6, ncol = 10, byrow = TRUE)
>
> mat5g <- matrix(c(1,1,1,1,1,
>                  2,2,2,2,1,
>                  3,3,3,2,2,
>                  4,4,3,3,3,
>                  5,4,4,4,4,
>                  5,5,5,5,5),
>                nrow = 6, ncol = 5, byrow = TRUE)
>
> cbind(rep(1,6), mat2g, mat3g, mat4g, mat5g, 1:6)
>
> I'd like to be able to do this automagically, for any (reasonable,
> small, say n = 10-20) number of observations, n, and for g = 1, ..., n
> groups.
>
> I can't see the pattern here or a way forward. Can anyone suggest an
> approach?
>
Peter Wolf has APL-style encode/decode functions on his web site that
can readily be used for this.  The output of the encode below are the binary
digits expansions of the numbers 0:15, one per column, and the remainder
transforms that matrix to the required one (but columns are in a different
order than yours):
>
source("http://www.wiwi.uni-bielefeld.de/~wolf/software/R-wtools/decodeencode/decodeencode.R")
> n <- 6
> n1 <- n-1
> apply(rbind(1, encode(0:(2^n1-1), rep(2, n1))), 2, cumsum)     [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] [,11] [,12]
[,13] [,14] [,15] [,16] [,17] [,18] [,19] [,20] [,21] [,22] [,23]
[,24] [,25] [,26]
[1,]    1    1    1    1    1    1    1    1    1     1     1     1
 1     1     1     1     1     1     1     1     1     1     1     1
  1     1
[2,]    1    1    1    1    1    1    1    1    1     1     1     1
 1     1     1     1     2     2     2     2     2     2     2     2
  2     2
[3,]    1    1    1    1    1    1    1    1    2     2     2     2
 2     2     2     2     2     2     2     2     2     2     2     2
  3     3
[4,]    1    1    1    1    2    2    2    2    2     2     2     2
 3     3     3     3     2     2     2     2     3     3     3     3
  3     3
[5,]    1    1    2    2    2    2    3    3    2     2     3     3
 3     3     4     4     2     2     3     3     3     3     4     4
  3     3
[6,]    1    2    2    3    2    3    3    4    2     3     3     4
 3     4     4     5     2     3     3     4     3     4     4     5
  3     4
     [,27] [,28] [,29] [,30] [,31] [,32]
[1,]     1     1     1     1     1     1
[2,]     2     2     2     2     2     2
[3,]     3     3     3     3     3     3
[4,]     3     3     4     4     4     4
[5,]     4     4     4     4     5     5
[6,]     4     5     4     5     5     6

Apparently Analagous Threads

Search for more seemingly similar threads

R help - Jun 2008 - Generating groupings of ordered observations

[R] Generating groupings of ordered observations

[R] Generating groupings of ordered observations

[R] Generating groupings of ordered observations

Apparently Analagous Threads