thr3ads.net - similar to: "How to access to sum of dissimilarities in CLARA"

Displaying 20 results from an estimated 600 matches similar to: "How to access to sum of dissimilarities in CLARA"

2008 Mar 03

silhouette plot for kmeans result

Dear All, Is there any existing code for plotting silhouette for kmeans clustering results? Many thanks! Linda [[alternative HTML version deleted]]

PAM clustering: using my own dissimilarity matrix

2004 Jun 29

PAM clustering: using my own dissimilarity matrix

Hello, I would like to use my own dissimilarity matrix in a PAM clustering with method "pam" (cluster package) instead of a dissimilarity matrix created by daisy. I read data from a file containing the dissimilarity values using "read.csv". This creates a matrix (alternatively: an array or vector) which is not accepted by "pam": A call

clustering dissimilarities

2002 Dec 13

clustering dissimilarities

Hello. I know my dissimilarity matrix but not my original data. Is there any way i could use the clustering function Mclust or EMclust with this dissimilarity matrix? or at least some equivalent of these functions? As this is model based clustering i dont know if it is actually possible to do it without the original data thanks in advance for your help [[alternate HTML version deleted]]

plot and validation in clustering

2006 Mar 20

plot and validation in clustering

Hi there, I use function "kmeans" and "clara" to cluster one flow cytometry dataset. By using function "plot", the clusters got from "clara" can be graphed, while "kmeans" not. How can I get the plot of the clusters of "kmeans"? And, I hope to compare the two methods "kmeans" and "clara", or in other word, I

CLARA

2003 Nov 17

CLARA

I need informations about the clara routine. The on-line doc say that the argument stand is a logical, indicating if the measurements in x are standardized before calculating the dissimilarities. Measurements are standardized for each variable (column), by subtracting the variable's mean value and dividing by the variable's mean absolute deviation. If we note STAND = TRUE, I suppose that

[cluster package question] What is the "sum of the dissimilarities" in the pam command ?

2009 Mar 29

[cluster package question] What is the "sum of the dissimilarities" in the pam command ?

Hello Martin Maechler and All, A simple question (I hope): How can I compute the "sum of the dissimilarities" that appears in the pam command (from the cluster package) ? Is it the "manhattan" distance (such as the one implemented by "dist") ? I am asking since I am running clustering on a dataset. I found 7 medoids with the pam command, and from it I have the

R function calling. Do I understand this right?

2000 Aug 28

R function calling. Do I understand this right?

I need to write a nonhierarchical clustering routine and I'm studying the way hclust (in the mva library) is built in R to see how things are done and what I can modify. I ran f2c on the hclust.f file (so I could read it in a language I know!) and there is one thing I don't quite understand about the way it gets called and the way it returns values. That Fortran function gets called in

results from sammon()

2005 Apr 20

results from sammon()

Dear all, I'm trying to get a two dimensional embedding of some data using different meythods, among which princomp(), cmds(), sammon() and isoMDS(). I have a problem with sammon() because the coordinates I get are all equal to NA. What does it mean? Why the method fails in finding the coordinates? Can I do anything to get some meaningful results? Thank you very much Domenico

standardization of values before call to pam() or clara()

2006 May 23

standardization of values before call to pam() or clara()

Greetings, Experimenting with the cluster package, and am starting to scratch my head in regards to the *best* way to standardize my data. Both functions can pre-standardize columns in a dataframe. according to the manual: Measurements are standardized for each variable (column), by subtracting the variable's mean value and dividing by the variable's mean absolute deviation. This

agnes clustering and NAs

2011 Jan 27

agnes clustering and NAs

Hello, In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like : > m <- matrix(c( 1, 1, 1, 2, 1, NA, 1, 1, 1, 2, 2, 2), nrow = 3, byrow = TRUE) > agnes(m) Call: agnes(x = m) Agglomerative coefficient: 0.1614168 Order of objects: [1] 1 2 3 Height (summary): Min. 1st Qu. Median Mean 3rd

PAM Clustering

2017 Aug 17

PAM Clustering

Dear Germano, Thank you for your fast reply, In the above code, *MYData *is the actual data set. Do not we need to convert *MYData to *the dissimilarity matrix using *pam(as.dist(**MYData**), k = 10, diss = TRUE*)* code line?* *Regards.* On Thu, Aug 17, 2017 at 2:58 PM, Germano Rossi <germano.rossi at gmail.com> wrote: > try this > > MYdata <-

New to R, trying to use agnes, but can't load my ditance matrix

2011 Jun 27

New to R, trying to use agnes, but can't load my ditance matrix

Hi, I'm mighty new to R. I'm using it on Windows. I'm trying to cluster using a distance matrix I created from the data on my own and called it D10.dist. I loaded the cluster package. Then tried the following command... > agnes("E:D10.dist", diss = TRUE, metric = "euclidean", stand = FALSE, > method = "average", par.method, keep.diss = n < 1000,

Cluster Analysis - Number of Clusters

2006 Feb 05

Cluster Analysis - Number of Clusters

Hello, I'm playing around with cluster analysis, and am looking for methods to select the number of clusters. I am aware of methods based on a 'pseudo F' or a 'pseudo T^2'. Are there packages in R that will generate these statistics, and/or other statistics to aid in cluster number selection? Thanks, John. --

cluster a distance(analogue)-object using agnes(cluster)

2008 Sep 02

cluster a distance(analogue)-object using agnes(cluster)

I try to perform a clustering using an existing dissimilarity matrix that I calculated using distance (analogue) I tried two different things. One of them worked and one not and I don`t understand why. Here the code: not working example library(cluster) library(analogue) iris2<-as.data.frame(iris) str(iris2) 'data.frame': 150 obs. of 5 variables: $ Sepal.Length: num 5.1 4.9 4.7

Clustering

2007 Nov 28

Clustering

Hello all! I am performingsome clustering analysis on microarray data using agnes{cluster} and I have created my own dissimilarity matrix according to a distance measure different from "euclidean" or "manhattan" etc. My question is, if I choose for example method="complete", how are the distances between the elements calculated? Are they taken form the dissimilarity

cauculating dissimilarities in R

2006 Sep 26

cauculating dissimilarities in R

Dear All, I?ve got a statistical question on calculating dissimilarities in R. I want to calculate the different types of dissimilarities on the ?flower? dataset found in the package ?cluster?. Flower is a data frame with 18 observations on 8 variables. Variable 1 and 2 are binary, variable 3 is asymmetric binary, variable 4 is nominal, variable 5 and 6 are ordered and variable 7 and 8 are

Cluster package broken in 1.4.0?

2002 Jan 28

Cluster package broken in 1.4.0?

Greetings, I am reasonably experienced with R but I recently tried to do some clustering using the "cluster" package, in order to see if it would help. I only tried this once with the 1.3.1 version and it worked (I don't quite remember which method I used). Now, I tried with the 1.4.0 version and no clustering function seems to work with matrices that contain NAs, even though

How to build a "Amalgamation Schedule"? help!

2012 Jan 23

How to build a "Amalgamation Schedule"? help!

Dear all, I need to process large amounts of data (two or three variables for 6,000 cases) cluster analysis. In the end I need to fill the source data to the obtained clusters. I need to trace the sequence of data fusion. In this case, I can fill in a cluster (with any level of linkage distance) by data. This procedure is implemented in the package Statistica, but this package can not work with

PAM Clustering

2017 Aug 17

PAM Clustering

Sorry, I never use pam. In the help, you can see that pam require a dataframe OR a dissimilarity matrix. If diss=FALSE then "euclidean" was use.So, I interpret that a matrix of dissimilarity is generated automatically. Problems may be in your data. Indeed pam(ruspini, 4)$diss write a dissimilaty matrix while pam(MYdata,10)$diss wite NULL 2017-08-17 16:03 GMT+02:00 Sema Atasever

Why daisy() in cluster library failed to exclude NA when computing dissimilarity

2013 Dec 08

Why daisy() in cluster library failed to exclude NA when computing dissimilarity

Hi, According to daisy function from cluster documentation, it can compute dissimilarity when NA (missing) value(s) is present. http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html But why when I tried this code library(cluster) x <- c(1.115,NA,NA,0.971,NA) y <- c(NA,1.006,NA,NA,0.645) df <- as.data.frame(rbind(x,y)) daisy(df,metric="gower") It gave this

similar to: How to access to sum of dissimilarities in CLARA