thr3ads.net - similar to: "plot and validation in clustering"

Displaying 20 results from an estimated 5000 matches similar to: "plot and validation in clustering"

How to access to sum of dissimilarities in CLARA

2005 May 30

How to access to sum of dissimilarities in CLARA

Dear All , Since dissimilarity is one of quality measures in clustering , I'm trying to access to the sum of dissimilarity as a whole measure. But after running my data using CLARA I obtain : 1128 dissimilarities, summarized : Min. 1st Qu. Median Mean 3rd Qu. Max. 0.033155 0.934630 2.257000 2.941600 4.876600 8.943700 But I can not find the sum of dissimilarity.How can i

Cluster analysis, defining center seeds or number of clusters

2009 Jun 11

Cluster analysis, defining center seeds or number of clusters

I use kmeans to classify spectral events in high and low 1/3 octave bands: #Do cluster analysis CyclA<-data.frame(LlowA,LhghA) CntrA<-matrix(c(0.9,0.8,0.8,0.75,0.65,0.65), nrow = 3, ncol=2, byrow=TRUE) ClstA<-kmeans(CyclA,centers=CntrA,nstart=50,algorithm="MacQueen") This works well when the actual data shows 1,2 or 3 groups that are not "too close" in a cross plot.

About clustering techniques

2008 Jul 29

About clustering techniques

Hello R users It's some time I am playing with a dataset to do some cluster analysis. The data set consists of 14 columns being geographical coordinates and monthly temperatures in annual files latitutde - longitude - temperature 1 -..... - temperature 12 I have some missing values in some cases, maybe there are 8 monthly valid values at some points with four non valid. I don't want to

Clustering Large Applications..sort of

2011 Aug 10

Clustering Large Applications..sort of

Hello all, I am using the clustering functions in R in order to work with large masses of binary time series data, however the clustering functions do not seem able to fit this size of practical problem. Library 'hclust' is good (though it may be sub par for this size of problem, thus doubly poor for this application) in that I do not want to make assumptions about the number of

silhouette plot for kmeans result

2008 Mar 03

silhouette plot for kmeans result

Dear All, Is there any existing code for plotting silhouette for kmeans clustering results? Many thanks! Linda [[alternative HTML version deleted]]

"partitioning cluster function"

2006 Apr 05

"partitioning cluster function"

Hi All, For the function "bclust"(e1071), the argument "base.method" is explained as "must be the name of a partitioning cluster function returning a list with the same components as the return value of 'kmeans'. In my understanding, there are three partitioning cluster functions in R, which are "clara, pam, fanny". Then I check each of them to

Cluster Analysis - Number of Clusters

2006 Feb 05

Cluster Analysis - Number of Clusters

Hello, I'm playing around with cluster analysis, and am looking for methods to select the number of clusters. I am aware of methods based on a 'pseudo F' or a 'pseudo T^2'. Are there packages in R that will generate these statistics, and/or other statistics to aid in cluster number selection? Thanks, John. --

bug (?!) in "pam()" clustering from fpc package ?

2008 Dec 17

bug (?!) in "pam()" clustering from fpc package ?

Hello all. I wish to run k-means with "manhattan" distance. Since this is not supported by the function "kmeans", I turned to the "pam" function in the "fpc" package. Yet, when I tried to have the algorithm run with different starting points, I found that pam ignores and keep on starting the algorithm from the same starting-points (medoids). For my

weithed clustering (was: Re: problems with a large data set)

2001 Apr 27

weithed clustering (was: Re: problems with a large data set)

kmeans and clara work great. Thank you for the tip. I have another question: Is it possible to weight the observations in a cluster analysis ? I haven't found any mention of this in the kmeans of clara help texts. Moritz Lennert Charg? de recherche IGEAT - ULB t?l: 32-2-650.65.16 fax: 32-2-650.50.92 email: mlennert at ulb.ac.be > On Wed, 25 Apr 2001, Moritz Lennert wrote: >

Clustering large data matrix

2008 Mar 06

Clustering large data matrix

Hello, I have a large data matrix (68x13112), each row corresponding to one observation (patients) and each column corresponding to the variables (points within an NMR spectrum). I would like to carry out some kind of clustering on these data to see how many clusters are there. I have tried the function clara() from the package cluster. If I use the matrix as is, I can perform the clara

Plotting K-means clustering results on an MDS

2010 Aug 18

Plotting K-means clustering results on an MDS

Hello All, I'm having some trouble figuring out what the clearest way to plot my k-means clustering result on an my existing MDS. First I performed MDS on my distance matrix (note: I performed k-means on the MDS coordinates because applying a euclidean distance measure to my raw data would have been inappropriate) canto.MDS<-cmdscale(canto) I then figured out what would be my optimum

cluster size

2009 Dec 11

cluster size

hi r-help, i am doing kmeans clustering in stats. i tried for five clusters clustering using: kcl1 <- kmeans(as1[,c("contlife","somlife","agglife","sexlife", "rellife","hordlife","doutlife","symtlife","washlife",

Clustering and Rand Index

2006 Jan 07

Clustering and Rand Index

Dear WizaRds, I am trying to compute the (adjusted) Rand Index in order to comprehend the variable selection heuristic (VS-KM) according to Brusco/ Cradit 2001 (Psychometrika 66 No.2 p.249-270, 2001). Unfortunately, I am unable to correctly use cl_ensemble and cl_agreement (package: clue). Here is what I am trying to do: library(clue) ## Let p1..p4 be four partitions of the kind

K-Means clustering Algorithm

2012 Aug 28

K-Means clustering Algorithm

I was wondering if there was an R equivalent to the two phased approach that MATLAB uses in performing the Kmeans algorithm. If not is there away that I can determine if the kmeans in R and the kmeans in MATLAB are essentially giving me the same clustering information within a small amount of error? -- View this message in context:

kmeans clustering

2003 Apr 14

kmeans clustering

Hi, I am using kmeans to cluster a dataset. I test this example: > data<-matrix(scan("data100.txt"),100,37,byrow=T) (my dataset is 100 rows and 37 columns--clustering rows) > c1<-kmeans(data,3,20) > c1 $cluster [1] 1 1 1 1 1 1 1 3 3 3 1 3 1 3 3 1 1 1 1 3 1 3 3 1 1 1 3 3 1 1 3 1 1 1 1 3 3 [38] 3 1 1 1 3 1 1 1 1 3 3 3 1 1 1 1 1 1 3 1 3 1 1 3 1 1 1 1 3 1 1 1 1 1 1 3

K-Means Clustering

2008 May 09

K-Means Clustering

Hello, I am hoping you can help me with a question concerning kmeans clustering in R. I am working with the following data-set (abbreviated): BMW Ford Infiniti Jeep Lexus Chrysler Mercedes Saab Porsche Volvo [1,] 6 8 2 8 4 5 4 4 7 7 [2,] 8 7 4 6 4 1 6 7 8 5 [3,] 8 2 4

Cluster analysis, factor variables, large data set

2011 Mar 31

Cluster analysis, factor variables, large data set

Dear R helpers, I have a large data set with 36 variables and about 50.000 cases. The variabels represent labour market status during 36 months, there are 8 different variable values (e.g. Full-time Employment, Student,...) Only cases with at least one change in labour market status is included in the data set. To analyse sub sets of the data, I have used daisy in the cluster-package to create

Can any one help me on format file data.

2007 Jul 18

Can any one help me on format file data.

Hi all. I'd like know what is the format file saved by Leica Microsystems TCS SP2-AOBS equipped with a SP2-FCS2 Leica Microsystems workstation its datas. Cause it save in *.fcs extention file but ins't flow cytometry standart format file... Tahnks Horacio.

EM unsupervised clustering

2007 Jul 18

EM unsupervised clustering

Hi All, I have a n x m matrix. The n rows are individuals, the m columns are variables. The matrix is in itself a collection of 1s (if a variable is observed for an individual), and 0s (is there is no observation). Something like: [,1] [,2] [,3] [,4] [,5] [,6] [1,] 1 0 1 1 0 0 [2,] 1 0 1 1 0 0 [3,] 1 0 1 1 0 0 [4,] 0 1 0

Validation of clustering

2003 Jan 30

Validation of clustering

Hi, I'm using the library cluster to cluster a set of figures (method CLARA). Somebody that it work with clustering would know informs what I make to evaluate the clustering? Tks VM, Francisco. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Francisco Júnior, Computer Science - UFPE-Brazil "One life has more value that the world whole" ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

similar to: plot and validation in clustering