thr3ads.net - similar to: "Clustering quality measure"

Displaying 20 results from an estimated 4000 matches similar to: "Clustering quality measure"

2003 Jun 12

Multiple imputation

Hi all, I'm currently working with a dataset that has quite a few missing values and after some investigation I figured that multiple imputation is probably the best solution to handle the missing data in my case. I found several references to functions in S-Plus that perform multiple imputation (NORM, CAT, MIX, PAN). Does R have corresponding functions? I searched the archives but was not

Plotting math functions

2003 Jul 24

Plotting math functions

Hi all, I was wondering whether it is possible to plot math functions, for example sin, cos or a Gaussian type function, in R, and if so, how to do it. I have been searching through the archives and the R manual but had no luck in finding any hints on how to go about this. Any help is much appreciated! Thanks, Jonck

Multiple match function?

2003 Jun 11

Multiple match function?

Hi all, I have (yet another) question about a function in R. What I would like to do is test for the presence of a certain value in a vector, and have the positions that this value is at returned to me. For example, let's say I have a vector: x <- c(1,1,2,2,3,3,4,4) Now I would like a function that would return positions 3 and 4 should I test for the value "2". Or 5 and 6

Question on SOM and clustering

2003 Apr 16

Question on SOM and clustering

Hello everyone, I'm new to this list, so let me introduce myself: my name is Jonck van der Kogel and I am a graduate student at the Erasmus University of Rotterdam. I am currently working on my thesis which is on the use of artificial intelligence for large data-sets. To do an analysis of a certain data-set I want to use Kohonen's SOM algorithm. However, as I understand it, the SOM

Cluster Analysis - Number of Clusters

2006 Feb 05

Cluster Analysis - Number of Clusters

Hello, I'm playing around with cluster analysis, and am looking for methods to select the number of clusters. I am aware of methods based on a 'pseudo F' or a 'pseudo T^2'. Are there packages in R that will generate these statistics, and/or other statistics to aid in cluster number selection? Thanks, John. --

Validation of clustering

2003 Jan 30

Validation of clustering

Hi, I'm using the library cluster to cluster a set of figures (method CLARA). Somebody that it work with clustering would know informs what I make to evaluate the clustering? Tks VM, Francisco. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Francisco Júnior, Computer Science - UFPE-Brazil "One life has more value that the world whole" ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Appending elements to an array

2003 Jun 09

Appending elements to an array

Hi all, I am having a bit of trouble with the array structure of R. What I want to do is dynamically add/remove elements to an array. For example: Let's say I have created an array: > myArray <- array(c(3,8), dim=c(1,2)) > myArray [,1] [,2] [1,] 3 8 And I now want to, for example, push an element (5,6) on to this array so it will read: [,1] [,2] [1,] 3 8

cutree with agnes

2003 Dec 11

cutree with agnes

Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")

cutree with agnes

2003 Dec 11

cutree with agnes

Clustering Large Applications..sort of

2011 Aug 10

Clustering Large Applications..sort of

Hello all, I am using the clustering functions in R in order to work with large masses of binary time series data, however the clustering functions do not seem able to fit this size of practical problem. Library 'hclust' is good (though it may be sub par for this size of problem, thus doubly poor for this application) in that I do not want to make assumptions about the number of

cluster.stats

2008 Jun 13

cluster.stats

Dear list, I just tried to use the function cluster.stat in the package fpc. I just have a couple of questions about the syntax: cluster.stats(d,clustering,alt.clustering=NULL, silhouette=TRUE,G2=FALSE,G3=FALSE) 1) the distance object (d) is an object obtained by the function dist() on my own original matrix? 2) clustering is the clusters vector as result of one of the many clustering methods?

question on silhouette colours

2011 Aug 25

question on silhouette colours

I'm fairly new to the silhouette functionality in the cluster package, so apologize if I'm asking something naive. If I run the 'agnes(ruspini)' example from the silhouette section of the cluster package vignette, and assign colours to clusters, two clusters have what appear to be incorrect colours in the silhouette plot. library(cluster) data(ruspini) ar<- agnes(ruspini)

k-nn hierarchical clustering

2011 Jun 09

k-nn hierarchical clustering

Hi there, is there any R-function for k-nearest neighbour agglomerative hierarchical clustering? By this I mean standard agglomerative hierarchical clustering as in hclust or agnes, but with the k-nearest neighbour distance between clusters used on the higher levels where there are at least k>1 distances between two clusters (single linkage is 1-nearest neighbour clustering)? Best regards,

estimating number of clusters ("Null or more")

2003 Apr 24

estimating number of clusters ("Null or more")

Hi all, once more about the old subj :-) My data has too much various distribution families and for every particular experiment I need just to decide whether the data is "quite homogeneous" or it has two or more clusters. I've revisited the following libraries: amap, clust, cclust, mclust, multiv, normix, survey. And I didn't find any ready-to-use general

cluster in R

2006 Oct 17

cluster in R

hi, is there some good summary on clustering methods in R? It seems there are many packages involving it. And I have two questions on clustering here: 1. Is there a way of evaluate the effecitives (or seperation) of clustering (rather than by visualization)? 2. Is there a search method (like genetic search) which can help find the best subset of attributes which gives best seperation? Thanks,

cluster-analysis and NA's

2002 Aug 07

cluster-analysis and NA's

Hi, exist a special cluster-analysis algorithms which can work with NA's. a further "problem" is that i want cluster variables not cases to identify special variable-set's. Is it a common way turn the data.frame and use kmeans,because this works with NA's, or have anybody another method for finding "variable-sets" , with exception of factor analysis. thanks for

Clustering of Binary data in R

2005 Mar 04

Clustering of Binary data in R

Good afternoon! I would like to ask you about similarity measures and clustering in R for Binary data. Would you please kindly help me and let me know about that commands in R? Thanks in advance for your kind attentions. I look forward to hearing from you as soon as possible. Best regards, Sima

-means, hybrid clustering or similar implementations on R

2003 May 07

-means, hybrid clustering or similar implementations on R

Hi, I would like to know if someone knows an extended implementation of k-means in R to find appropriate number of clusters for a given k-dimensional data. Also, I am working on clustering for forecasting, if someone is interested or has knowledge on implementational details please mail me, I would appreciate it. Regards Skanda Kallur "Cogito, ergo sum" (I think, therefore I

stability measures for heirarchical clustering

2004 May 11

stability measures for heirarchical clustering

Dear R users, I'm interested in measuring the stability of a heirarchical clustering, of the overall clustering and finding sub clusters (from cutting the heirarchical clustering at different levels) which demonstrate stability. I saw some postings on the R help from a while back about bootstrapping for clustering (using sample and generating a consesus tree with a web based tool CONSENSE)

SOM random seed

2003 Jun 10

SOM random seed

Hi all, I have a question about the SOM routine. You can either supply the initial representatives for the lattice yourself or else they are chosen randomly from the dataset. Is it possible to pass the random-seed as an argument somehow, when choosing the random initialisation of the lattice? As it is now, each time I run a SOM on a dataset with the same settings the resulting SOM will still

similar to: Clustering quality measure