Displaying 20 results from an estimated 600 matches similar to: "How to access to sum of dissimilarities in CLARA"
2008 Mar 03
1
silhouette plot for kmeans result
Dear All,
Is there any existing code for plotting silhouette for kmeans clustering
results?
Many thanks!
Linda
[[alternative HTML version deleted]]
2004 Jun 29
1
PAM clustering: using my own dissimilarity matrix
Hello,
I would like to use my own dissimilarity matrix in a PAM clustering with
method "pam" (cluster package) instead of a dissimilarity matrix created
by daisy.
I read data from a file containing the dissimilarity values using
"read.csv". This creates a matrix (alternatively: an array or vector)
which is not accepted by "pam": A call
2002 Dec 13
1
clustering dissimilarities
Hello. I know my dissimilarity matrix but not my original data. Is there any way i could use the clustering function Mclust or EMclust with this dissimilarity matrix? or at least some equivalent of these functions? As this is model based clustering i dont know if it is actually possible to do it without the original data
thanks in advance for your help
[[alternate HTML version deleted]]
2006 Mar 20
1
plot and validation in clustering
Hi there,
I use function "kmeans" and "clara" to cluster one flow cytometry
dataset. By using function "plot", the clusters got from "clara" can be
graphed, while "kmeans" not. How can I get the plot of the clusters of
"kmeans"?
And, I hope to compare the two methods "kmeans" and "clara", or in other
word, I
2003 Nov 17
1
CLARA
I need informations about the clara routine. The on-line doc say that the
argument stand is a logical, indicating if the measurements in x are
standardized before calculating the dissimilarities. Measurements are
standardized for each variable (column), by subtracting the variable's mean
value and dividing by the variable's mean absolute deviation. If we note
STAND = TRUE, I suppose that
2009 Mar 29
1
[cluster package question] What is the "sum of the dissimilarities" in the pam command ?
Hello Martin Maechler and All,
A simple question (I hope):
How can I compute the "sum of the dissimilarities" that appears in the pam
command (from the cluster package) ?
Is it the "manhattan" distance (such as the one implemented by "dist") ?
I am asking since I am running clustering on a dataset. I found 7 medoids
with the pam command, and from it I have the
2000 Aug 28
2
R function calling. Do I understand this right?
I need to write a nonhierarchical clustering routine and I'm studying
the way hclust (in the mva library) is built in R to see how things are
done and what I can modify. I ran f2c on the hclust.f file (so I could
read it in a language I know!) and there is one thing I don't quite
understand about the way it gets called and the way it returns values.
That Fortran function gets called in
2005 Apr 20
4
results from sammon()
Dear all,
I'm trying to get a two dimensional embedding of some data using different
meythods, among which princomp(), cmds(), sammon() and isoMDS(). I have a
problem with sammon() because the coordinates I get are all equal to NA.
What does it mean? Why the method fails in finding the coordinates? Can I do
anything to get some meaningful results?
Thank you very much
Domenico
2006 May 23
1
standardization of values before call to pam() or clara()
Greetings,
Experimenting with the cluster package, and am starting to scratch my head in
regards to the *best* way to standardize my data. Both functions can
pre-standardize columns in a dataframe. according to the manual:
Measurements are standardized for each variable (column), by subtracting the
variable's mean value and dividing by the variable's mean absolute deviation.
This
2011 Jan 27
3
agnes clustering and NAs
Hello,
In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like :
> m <- matrix(c(
1, 1, 1, 2,
1, NA, 1, 1,
1, 2, 2, 2), nrow = 3, byrow = TRUE)
> agnes(m)
Call: agnes(x = m)
Agglomerative coefficient: 0.1614168
Order of objects:
[1] 1 2 3
Height (summary):
Min. 1st Qu. Median Mean 3rd
2017 Aug 17
2
PAM Clustering
Dear Germano,
Thank you for your fast reply,
In the above code, *MYData *is the actual data set.
Do not we need to convert *MYData to *the dissimilarity matrix using
*pam(as.dist(**MYData**), k = 10, diss = TRUE*)* code line?*
*Regards.*
On Thu, Aug 17, 2017 at 2:58 PM, Germano Rossi <germano.rossi at gmail.com>
wrote:
> try this
>
> MYdata <-
2011 Jun 27
3
New to R, trying to use agnes, but can't load my ditance matrix
Hi,
I'm mighty new to R. I'm using it on Windows. I'm trying to cluster using a
distance matrix I created from the data on my own and called it D10.dist. I
loaded the cluster package. Then tried the following command...
> agnes("E:D10.dist", diss = TRUE, metric = "euclidean", stand = FALSE,
> method = "average", par.method, keep.diss = n < 1000,
2006 Feb 05
3
Cluster Analysis - Number of Clusters
Hello,
I'm playing around with cluster analysis, and am looking for methods to
select the number of clusters. I am aware of methods based on a 'pseudo
F' or a 'pseudo T^2'. Are there packages in R that will generate these
statistics, and/or other statistics to aid in cluster number selection?
Thanks,
John.
--
2008 Sep 02
2
cluster a distance(analogue)-object using agnes(cluster)
I try to perform a clustering using an existing dissimilarity matrix that I
calculated using distance (analogue)
I tried two different things. One of them worked and one not and I don`t
understand why.
Here the code:
not working example
library(cluster)
library(analogue)
iris2<-as.data.frame(iris)
str(iris2)
'data.frame': 150 obs. of 5 variables:
$ Sepal.Length: num 5.1 4.9 4.7
2007 Nov 28
2
Clustering
Hello all!
I am performingsome clustering analysis on microarray data using
agnes{cluster} and I have created my own dissimilarity matrix according to a
distance measure different from "euclidean" or "manhattan" etc. My question
is, if I choose for example method="complete", how are the distances
between the elements calculated? Are they taken form the dissimilarity
2006 Sep 26
0
cauculating dissimilarities in R
Dear All,
I?ve got a statistical question on calculating
dissimilarities in R.
I want to calculate the different types of dissimilarities
on the ?flower? dataset found in the package
?cluster?. Flower is a data frame with 18 observations
on 8 variables. Variable 1 and 2 are binary, variable 3 is
asymmetric binary, variable 4 is nominal, variable 5 and 6
are ordered and variable 7 and 8 are
2002 Jan 28
1
Cluster package broken in 1.4.0?
Greetings,
I am reasonably experienced with R but I recently tried to
do some clustering using the "cluster" package, in order
to see if it would help.
I only tried this once with the 1.3.1 version and it worked
(I don't quite remember which method I used).
Now, I tried with the 1.4.0 version and no clustering function
seems to work with matrices that contain NAs, even
though
2012 Jan 23
2
How to build a "Amalgamation Schedule"? help!
Dear all,
I need to process large amounts of data (two or three variables for 6,000
cases) cluster analysis. In the end I need to fill the source data to the
obtained clusters. I need to trace the sequence of data fusion. In this
case, I can fill in a cluster (with any level of linkage distance) by data.
This procedure is implemented in the package Statistica, but this package
can not work with
2017 Aug 17
0
PAM Clustering
Sorry, I never use pam. In the help, you can see that pam require a
dataframe OR a dissimilarity matrix. If diss=FALSE then "euclidean" was use.So,
I interpret that a matrix of dissimilarity is generated automatically.
Problems may be in your data. Indeed
pam(ruspini, 4)$diss
write a dissimilaty matrix
while
pam(MYdata,10)$diss
wite NULL
2017-08-17 16:03 GMT+02:00 Sema Atasever
2013 Dec 08
3
Why daisy() in cluster library failed to exclude NA when computing dissimilarity
Hi,
According to daisy function from cluster documentation, it can compute
dissimilarity when NA (missing) value(s) is present.
http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html
But why when I tried this code
library(cluster)
x <- c(1.115,NA,NA,0.971,NA)
y <- c(NA,1.006,NA,NA,0.645)
df <- as.data.frame(rbind(x,y))
daisy(df,metric="gower")
It gave this