Parodi, Pietro
2011-Nov-04 12:51 UTC
[R] Counting number of common elements between the rows of two different matrices
Hello
I'm trying to solve this problem without using a for loop but I have so
far failed to find a solution.
I have two matrices of K columns each, e.g. (K=5), and with numbers of
row N_A and N_B respectively
A = (1 5 3 8 15;
2 7 20 11 13;
12 19 20 21 43)
B = (2 6 30 8 16;
3 8 19 11 13)
(the actual matrices have hundreds of thousands of entry, that's why I'm
keen to avoid "for" loops)
And what I need to do is to apply a function which counts the number of
common elements between ANY row of A and ANY row of B, giving a result
like this:
A1 vs B1: 1 # (8 is a common element)
A1 vs B2: 1 # (8 is a common element)
A2 vs B1: 1 # (2 is a common element)
A2 vs B2: 1 # 11, 13 are common elements
Etc.
I've built a function that counts the number of common elements between
two vectors, based on the intersect function in the R manual
common_elements <- function(x,y) length(y[match(x,y,nomatch=0)])
And a double loop who solves my problem would be something like
(pseudo-code)
For(i in 1:N_A){
for(j in 1:N_B){
ce(i,j)=common_elements(a(i),b(j))
}
}
Is there an efficient, clean way to do the same job and give as an
output a matrix N_A x N_B such as that above?
Thanks a lot for your help
Regards
Pietro
______________________________________________________________________
For information pertaining to Willis' email confidentiality and monitoring
policy, usage restrictions, or for specific company registration and regulatory
status information, please visit http://www.willis.com/email_trailer.aspx
We are now able to offer our clients an encrypted email capability for secure
communication purposes. If you wish to take advantage of this service or learn
more about it, please let me know or contact your Client Advocate for full
details. ~W67897
jim holtman
2011-Nov-04 13:37 UTC
[R] Counting number of common elements between the rows of two different matrices
Try this:
# create dummy data
a <- matrix(sample(20, 50, TRUE), ncol = 5)
b <- matrix(sample(20, 50, TRUE), ncol = 5)
# create combinations to test
x <- expand.grid(seq(nrow(a)), seq(nrow(b)))
# test
result <- mapply(function(m1, m2) any(a[m1, ] %in% b[m2, ])
, x[, 1]
, x[, 2]
)
# create the output matrix
result.m <- matrix(result, nrow = nrow(a), ncol = nrow(b))
On Fri, Nov 4, 2011 at 8:51 AM, Parodi, Pietro <Pietro.Parodi at
willis.com> wrote:>
> Hello
>
> I'm trying to solve this problem without using a for loop but I have so
> far failed to find a solution.
>
> I have two matrices of K columns each, e.g. (K=5), and with numbers of
> row N_A and N_B respectively
>
> A = ? ? (1 5 3 8 15;
> ? ? ? ? 2 7 20 11 13;
> ? ? ? ? 12 19 20 21 43)
>
> B = ? ? (2 6 30 8 16;
> ? ? ? ? 3 8 19 11 13)
>
> (the actual matrices have hundreds of thousands of entry, that's why
I'm
> keen to avoid "for" loops)
>
> And what I need to do is to apply a function which counts the number of
> common elements between ANY row of A and ANY row of B, giving a result
> like this:
>
>
> A1 vs B1: ?1 ?# (8 is a common element)
> A1 vs B2: ?1 ?# (8 is a common element)
> A2 vs B1: ?1 ?# (2 is a common element)
> A2 vs B2: ?1 ?# 11, 13 are common elements
> Etc.
>
> I've built a function that counts the number of common elements between
> two vectors, based on the intersect function in the R manual
>
> common_elements <- function(x,y) length(y[match(x,y,nomatch=0)])
>
> And a double loop who solves my problem would be something like
> (pseudo-code)
>
> For(i in 1:N_A){
> ? ? ? ?for(j in 1:N_B){
> ? ? ? ? ? ? ? ?ce(i,j)=common_elements(a(i),b(j))
> ? ? ? ? ? ? ? ?}
> ? ? ? ?}
>
> Is there an efficient, clean way to do the same job and give as an
> output a matrix N_A x N_B such as that above?
>
> Thanks a lot for your help
>
> Regards
>
> Pietro
>
> ______________________________________________________________________
>
> For information pertaining to Willis' email confidentiality and
monitoring policy, usage restrictions, or for specific company registration and
regulatory status information, please visit
http://www.willis.com/email_trailer.aspx
>
> We are now able to offer our clients an encrypted email capability for
secure communication purposes. If you wish to take advantage of this service or
learn more about it, please let me know or contact your Client Advocate for full
details. ~W67897
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
--
Jim Holtman
Data Munger Guru
What is the problem that you are trying to solve?
Tell me what you want to do, not how you want to do it.