TEMPL Matthias
2004-May-24 14:39 UTC
AW: [R] non-hierarchical non-exclusive clustering of large data sets
I think the "cmeans" method in library(e1071) works better for large data sets as "fanny". (note,not for this data: fanny has also problems with standardized large data sets - here produce fanny the same memberships for all observations; cmeans works "correctly") Matthias> -----Urspr??ngliche Nachricht----- > Von: Bhaskar S. Manda [mailto:bhaskar at cacmnet.com] > Gesendet: Montag, 24. Mai 2004 17:12 > An: r-help at stat.math.ethz.ch > Betreff: Re: [R] non-hierarchical non-exclusive clustering of > large data sets > > > On Mon, 24 May 2004 15:58:57 +0200, Murk Wuite wrote: > > I'm trying to use R to cluster words with related meanings. Does > > anyone know of a non-hierarchical clustering method in R > that produces > > non-exclusive clusters? With non-exclusive, I mean that words should > > The "fanny" method in library(cluster) outputs probabilities > of membership in each cluster. > > > the one above, its dimensions would be in the order of (100000, > > 100000). Does anyone know if this would cause practical problems, > > perhaps very slow clustering? > > I had a much smaller matrix, 4000x3, fanny took about 4 > minutes wall clock time on a lightly loaded (there were many > other processes, but none > computational) 1.4 GHz Athlon, It was completely CPU-bound. > > -- > bhaskar > > ______________________________________________ > R-help at stat.math.ethz.ch mailing list > https://www.stat.math.ethz.ch/mailman/listinfo> /r-help > PLEASE > do read the posting guide! > http://www.R-project.org/posting-guide.html >