hi, is there some good summary on clustering methods in R? It seems there are many packages involving it. And I have two questions on clustering here: 1. Is there a way of evaluate the effecitives (or seperation) of clustering (rather than by visualization)? 2. Is there a search method (like genetic search) which can help find the best subset of attributes which gives best seperation? Thanks, -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III
Go the R home page (google for R), click on CRAN in left pane, choose a mirror, click on Task Views in left pane and choose Cluster. On 10/17/06, Weiwei Shi <helprhelp at gmail.com> wrote:> hi, > > is there some good summary on clustering methods in R? It seems there > are many packages involving it. > > And I have two questions on clustering here: > > 1. Is there a way of evaluate the effecitives (or seperation) of > clustering (rather than by visualization)? > > 2. Is there a search method (like genetic search) which can help find > the best subset of attributes which gives best seperation? > > Thanks, > > -- > Weiwei Shi, Ph.D > Research Scientist > GeneGO, Inc. > > "Did you always know?" > "No, I did not. But I believed..." > ---Matrix III > > ______________________________________________ > R-help at stat.math.ethz.ch mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >
On 10/17/06, Weiwei Shi <helprhelp at gmail.com> wrote:> is there some good summary on clustering methods in R? It seems there > are many packages involving it.Gabor provided this very useful link a couple of days back. http://cran.r-project.org/src/contrib/Views/Cluster.html jab -- John Bollinger, CFA, CMT www.BollingerBands.com If you advance far enough, you arrive at the beginning.
hi, I just happened to find that page. But it seems too brief to me. For example, my project involves non-determined cluster number and non-determined attributes for the would-be-clustered samples. What kind of methods should I start with? Thanks a lot for the prompty reply. W. On 10/17/06, Gabor Grothendieck <ggrothendieck at gmail.com> wrote:> Go the R home page (google for R), click on CRAN in left pane, choose > a mirror, click on Task Views in left pane and choose > Cluster. > > On 10/17/06, Weiwei Shi <helprhelp at gmail.com> wrote: > > hi, > > > > is there some good summary on clustering methods in R? It seems there > > are many packages involving it. > > > > And I have two questions on clustering here: > > > > 1. Is there a way of evaluate the effecitives (or seperation) of > > clustering (rather than by visualization)? > > > > 2. Is there a search method (like genetic search) which can help find > > the best subset of attributes which gives best seperation? > > > > Thanks, > > > > -- > > Weiwei Shi, Ph.D > > Research Scientist > > GeneGO, Inc. > > > > "Did you always know?" > > "No, I did not. But I believed..." > > ---Matrix III > > > > ______________________________________________ > > R-help at stat.math.ethz.ch mailing list > > https://stat.ethz.ch/mailman/listinfo/r-help > > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > > and provide commented, minimal, self-contained, reproducible code. > > >-- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III
Dear Weiwei,> 1. Is there a way of evaluate the effecitives (or seperation) of > clustering (rather than by visualization)?The function cluster.stats in package fpc computes several cluster validation statistics (among them the average silhouette width). Function clusterboot in the same package (recent version) assesses cluster stability. There are several interfaces to clustering methods implemented in R which are documented on the help page of kmeansCBI (which gives you kind of an overview of available "general purpose" clustering methods in R though I may have missed some). There are also several methods for the visualization of separation (I know that you didn't ask for that) for which the function plotcluster is an interface. Best, Christian *** --- *** Christian Hennig University College London, Department of Statistical Science Gower St., London WC1E 6BT, phone +44 207 679 1698 chrish at stats.ucl.ac.uk, www.homepages.ucl.ac.uk/~ucakche