similar to: Clustering quality measure

Displaying 20 results from an estimated 4000 matches similar to: "Clustering quality measure"

2003 Jun 12
3
Multiple imputation
Hi all, I'm currently working with a dataset that has quite a few missing values and after some investigation I figured that multiple imputation is probably the best solution to handle the missing data in my case. I found several references to functions in S-Plus that perform multiple imputation (NORM, CAT, MIX, PAN). Does R have corresponding functions? I searched the archives but was not
2003 Jul 24
3
Plotting math functions
Hi all, I was wondering whether it is possible to plot math functions, for example sin, cos or a Gaussian type function, in R, and if so, how to do it. I have been searching through the archives and the R manual but had no luck in finding any hints on how to go about this. Any help is much appreciated! Thanks, Jonck
2003 Jun 11
3
Multiple match function?
Hi all, I have (yet another) question about a function in R. What I would like to do is test for the presence of a certain value in a vector, and have the positions that this value is at returned to me. For example, let's say I have a vector: x <- c(1,1,2,2,3,3,4,4) Now I would like a function that would return positions 3 and 4 should I test for the value "2". Or 5 and 6
2003 Apr 16
1
Question on SOM and clustering
Hello everyone, I'm new to this list, so let me introduce myself: my name is Jonck van der Kogel and I am a graduate student at the Erasmus University of Rotterdam. I am currently working on my thesis which is on the use of artificial intelligence for large data-sets. To do an analysis of a certain data-set I want to use Kohonen's SOM algorithm. However, as I understand it, the SOM
2006 Feb 05
3
Cluster Analysis - Number of Clusters
Hello, I'm playing around with cluster analysis, and am looking for methods to select the number of clusters. I am aware of methods based on a 'pseudo F' or a 'pseudo T^2'. Are there packages in R that will generate these statistics, and/or other statistics to aid in cluster number selection? Thanks, John. --
2003 Jan 30
2
Validation of clustering
Hi, I'm using the library cluster to cluster a set of figures (method CLARA). Somebody that it work with clustering would know informs what I make to evaluate the clustering? Tks VM, Francisco. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Francisco JĂșnior, Computer Science - UFPE-Brazil "One life has more value that the world whole" ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2003 Jun 09
2
Appending elements to an array
Hi all, I am having a bit of trouble with the array structure of R. What I want to do is dynamically add/remove elements to an array. For example: Let's say I have created an array: > myArray <- array(c(3,8), dim=c(1,2)) > myArray [,1] [,2] [1,] 3 8 And I now want to, for example, push an element (5,6) on to this array so it will read: [,1] [,2] [1,] 3 8
2003 Dec 11
1
cutree with agnes
Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")
2003 Dec 11
1
cutree with agnes
Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")
2011 Aug 10
4
Clustering Large Applications..sort of
Hello all, I am using the clustering functions in R in order to work with large masses of binary time series data, however the clustering functions do not seem able to fit this size of practical problem. Library 'hclust' is good (though it may be sub par for this size of problem, thus doubly poor for this application) in that I do not want to make assumptions about the number of
2008 Jun 13
3
cluster.stats
Dear list, I just tried to use the function cluster.stat in the package fpc. I just have a couple of questions about the syntax: cluster.stats(d,clustering,alt.clustering=NULL, silhouette=TRUE,G2=FALSE,G3=FALSE) 1) the distance object (d) is an object obtained by the function dist() on my own original matrix? 2) clustering is the clusters vector as result of one of the many clustering methods?
2011 Aug 25
1
question on silhouette colours
I'm fairly new to the silhouette functionality in the cluster package, so apologize if I'm asking something naive. If I run the 'agnes(ruspini)' example from the silhouette section of the cluster package vignette, and assign colours to clusters, two clusters have what appear to be incorrect colours in the silhouette plot. library(cluster) data(ruspini) ar<- agnes(ruspini)
2011 Jun 09
1
k-nn hierarchical clustering
Hi there, is there any R-function for k-nearest neighbour agglomerative hierarchical clustering? By this I mean standard agglomerative hierarchical clustering as in hclust or agnes, but with the k-nearest neighbour distance between clusters used on the higher levels where there are at least k>1 distances between two clusters (single linkage is 1-nearest neighbour clustering)? Best regards,
2003 Apr 24
1
estimating number of clusters ("Null or more")
Hi all, once more about the old subj :-) My data has too much various distribution families and for every particular experiment I need just to decide whether the data is "quite homogeneous" or it has two or more clusters. I've revisited the following libraries: amap, clust, cclust, mclust, multiv, normix, survey. And I didn't find any ready-to-use general
2006 Oct 17
4
cluster in R
hi, is there some good summary on clustering methods in R? It seems there are many packages involving it. And I have two questions on clustering here: 1. Is there a way of evaluate the effecitives (or seperation) of clustering (rather than by visualization)? 2. Is there a search method (like genetic search) which can help find the best subset of attributes which gives best seperation? Thanks,
2002 Aug 07
2
cluster-analysis and NA's
Hi, exist a special cluster-analysis algorithms which can work with NA's. a further "problem" is that i want cluster variables not cases to identify special variable-set's. Is it a common way turn the data.frame and use kmeans,because this works with NA's, or have anybody another method for finding "variable-sets" , with exception of factor analysis. thanks for
2005 Mar 04
2
Clustering of Binary data in R
Good afternoon! I would like to ask you about similarity measures and clustering in R for Binary data. Would you please kindly help me and let me know about that commands in R? Thanks in advance for your kind attentions. I look forward to hearing from you as soon as possible. Best regards, Sima
2003 May 07
1
-means, hybrid clustering or similar implementations on R
Hi, I would like to know if someone knows an extended implementation of k-means in R to find appropriate number of clusters for a given k-dimensional data. Also, I am working on clustering for forecasting, if someone is interested or has knowledge on implementational details please mail me, I would appreciate it. Regards Skanda Kallur "Cogito, ergo sum" (I think, therefore I
2004 May 11
1
stability measures for heirarchical clustering
Dear R users, I'm interested in measuring the stability of a heirarchical clustering, of the overall clustering and finding sub clusters (from cutting the heirarchical clustering at different levels) which demonstrate stability. I saw some postings on the R help from a while back about bootstrapping for clustering (using sample and generating a consesus tree with a web based tool CONSENSE)
2003 Jun 10
1
SOM random seed
Hi all, I have a question about the SOM routine. You can either supply the initial representatives for the lattice yourself or else they are chosen randomly from the dataset. Is it possible to pass the random-seed as an argument somehow, when choosing the random initialisation of the lattice? As it is now, each time I run a SOM on a dataset with the same settings the resulting SOM will still