Y.Xia - Final Year Mathematician
2007-Nov-25 19:35 UTC
[R] Assessing accuracy of clustering output
Dear All,Say I've got a 2 by 3 data matrix with numerical entries A, B, C, D. The 3rd column labels the real cluster of these values (A and B belong to cluster 1 and C, D belong to cluster 2.) Now if I perform a clustering algorithm such as kmeans on those numerical part of the matrix with pre-specified k=2), which gives, for example, A,C belongs to cluster 3 and B, D belongs to cluster 4, what R codes do I use to check the accuracy of such output based on knowledge of the real clusters in the data matrix? The whole point of this is to compare the performance of various clustering algorithms with the "Real" information. Yours sincerely, Yang [[alternative HTML version deleted]]