Displaying 20 results from an estimated 8000 matches similar to: "Converting a Dissimilarity Matrix"
2002 Apr 29
2
cluster analyses
I'm clustering rather large data sets and would like to cut the dendrograms
to get a better view of specific components. I calculate the dissimilarity
matrix using daisy() because I have a mixture of variable types: factors,
ordered factors and numerical variables. If I want one dendrogram, I use
agnes() for the agglomerative nesting and pltree() to draw the dendrogram.
That way, I get the
2011 Jan 27
3
agnes clustering and NAs
Hello,
In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like :
> m <- matrix(c(
1, 1, 1, 2,
1, NA, 1, 1,
1, 2, 2, 2), nrow = 3, byrow = TRUE)
> agnes(m)
Call: agnes(x = m)
Agglomerative coefficient: 0.1614168
Order of objects:
[1] 1 2 3
Height (summary):
Min. 1st Qu. Median Mean 3rd
2016 Apr 12
1
Dissimilarity matrix and number clusters determination
Hi,
I already have a dissimilarity matrix and I am submitting the results to
the elbow.obj method to get an optimal number of clusters. Am I reading
the below output correctly that I should have 17 clusters?
code:
top150 <- sampleset[1:150,]
{cluster1 <- daisy(top150
, metric = c("gower")
, stand = TRUE
, type = list(symm
2008 Sep 02
2
cluster a distance(analogue)-object using agnes(cluster)
I try to perform a clustering using an existing dissimilarity matrix that I
calculated using distance (analogue)
I tried two different things. One of them worked and one not and I don`t
understand why.
Here the code:
not working example
library(cluster)
library(analogue)
iris2<-as.data.frame(iris)
str(iris2)
'data.frame': 150 obs. of 5 variables:
$ Sepal.Length: num 5.1 4.9 4.7
2006 Nov 09
1
dissimilarity matrices
Dear All,
I have a dissimilarity matrix which I happily convert to a distance object
by running:
X <- as.dist(Y)
and I can happily now run either hclust(X) or agnes(X).
So that the various bits of output are labelled correctly I would dearly
like to be able to give names to the columns and rows of X, as would happen
if I ran:
mydata<-read.table("clipboard",header=T)
2004 Jun 29
1
PAM clustering: using my own dissimilarity matrix
Hello,
I would like to use my own dissimilarity matrix in a PAM clustering with
method "pam" (cluster package) instead of a dissimilarity matrix created
by daisy.
I read data from a file containing the dissimilarity values using
"read.csv". This creates a matrix (alternatively: an array or vector)
which is not accepted by "pam": A call
2007 Jul 23
1
Cluster prediction from factor/numeric datasets
Hi all,
I have a dataset with numeric and factor columns of data which I developed a
Gower Dissimilarity Matrix for (Daisy) and used Agglomerative Nesting
(Agnes) to develop 20 clusters.
I would like to use the 20 clusters to determine cluster membership for a
new dataset (using predict) but cannot find a way to do this (no way to
"predict" in the cluster package).
I know I can use
2013 Dec 08
3
Why daisy() in cluster library failed to exclude NA when computing dissimilarity
Hi,
According to daisy function from cluster documentation, it can compute
dissimilarity when NA (missing) value(s) is present.
http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html
But why when I tried this code
library(cluster)
x <- c(1.115,NA,NA,0.971,NA)
y <- c(NA,1.006,NA,NA,0.645)
df <- as.data.frame(rbind(x,y))
daisy(df,metric="gower")
It gave this
2002 Aug 27
1
2 questions about cluster analysis
Hello everyone!
I have two questions regarding cluster analysis with the package 'cluster'
:
1. I have a dissimilarity matrix in csv format, and I would like to use
it to generate a dendogram using 'hclust'. How do I make that matrix
compatible with the function? I think it has to be similar to the objects
generated by the 'dist' function. Any ideas?
2. Does
2006 Apr 07
1
fuzzy classification and dissimilarity matrix
Hello,
I want to make a fuzzy classification from a dissimilarity matrix
(calculated with daisy from package 'cluster'). I have tried to use
fanny (package cluster) but I have the same problems than described in a
previous message
(http://tolstoy.newcastle.edu.au/R/help/05/05/4546.html) i.e. it always
gives me two clusters in the results (even if k is different from 2)
with the same
2001 May 30
3
Transformation of dissimilarity or distance matrix
Dear List,
is there an elegant (or even not elegant) way how to transform
dissimilarity or distance matrix A
(or, in general, arbitrary symmetrical matrix) by transposition of rows and
columns into a form
closest to "block diagonal" matrix B?
The matrix A is adjusted the following way
A[A<epsilon] <-0 #(epsilon is given "small" number)
B: (in its ideal form)
2007 Nov 28
2
Clustering
Hello all!
I am performingsome clustering analysis on microarray data using
agnes{cluster} and I have created my own dissimilarity matrix according to a
distance measure different from "euclidean" or "manhattan" etc. My question
is, if I choose for example method="complete", how are the distances
between the elements calculated? Are they taken form the dissimilarity
2013 Jul 22
1
about mix type clust algorithm
Hi:
I have tried to find the appropriate clust algorithm for mixed type of data.
The suggested way I see is:
1. use daisy to get the dissimilarity matrix
2. use PAM/hclust by providing the dissimilarity matrix, to get the clusters
but by following this, when the data set grows bigger say 10,000 rows of data, the dissimilarity matrix will be O(n^2), and out of memory will occur.
I am
2006 Jan 26
1
Clustering Question
Hi group,
My case has N physicians with each seeing M patients.
One physician could have seen a group of patients, or,
a patient could have been seen by multiple number of
physicians. In order words, there are overlaps. Now,
I have the following NxM matrix
Patient#1 Patient#2 Patient#3 .......
Patient#m
Physician#1 1 0 1 .......
0
Physician#2
2010 Aug 09
1
Need help on heatmap, K-means and hhierarchical clustering methods
Hi folks,
I am new to the R software. I have been going through different materials to
know more about R.
I have the R software installed on my windows machine.I would like to know
the R source code for the following problems on iris flower data set.
I need to do the cluster analysis project with the iris data set. The goal
is to cluster the flowers
according to their Sepal.Length, Sepal.Width,
2011 Aug 31
1
agnes not working
Hello!
I created a distances matrix for 13 objects using daisy (see the attached file).
I am trying to clusteranalyse it using agnes but it's not working.
What might be the problem:
mydistances<-read.csv("Results of daisy.csv")
mycluster<-agnes(mydistances, method="ward")
I am getting:
Error in agnes(mydistances, method = "ward") :
NA/NaN/Inf in foreign
2001 Jan 09
2
PAM clustering (using triangular matrix)
Hi,
I'm trying to use a similarity matrix (triangular) as input for pam() or
fanny() clustering algorithms.
The problem is that this algorithms can only accept a dissimilarity
matrix, normally generated by daisy().
However, daisy only accept 'data matrix or dataframe. Dissimilarities
will be computed between the rows of x'.
Is there any way to say to that your data are already a
2011 Jun 16
1
Specify ID variable in daisy{cluster}
Hi All - I am using the daisy function from the cluster library to create a
dissimilarity matrix. I'm going to use that matrix to run a cluster
analysis. My participants are identified with the variable, hhid. However,
when I try to keep hhid in the dataset that I use to create the
dissimilarity matrix, daisy uses it to create the matrix rather than
ignoring it as an ID variable. I need to
2003 Aug 04
1
hclust() and agnes() method="average" divergence (PR#3648)
This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.
------_=_NextPart_000_01C35A53.75780090
Content-Type: text/plain;
charset="iso-8859-1"
Anyone have a clue why hclust() and agnes() produce different results in the
example below when both use method="average"?? I'm not able to reproduce
2006 Jan 17
1
Clustering function
Dear mailing group,
I have loaded an Excel file into R by calling it ".csv" and using the
"read.csv" function in R. However then I want to use the (limma package
specific, I believe) function "hclust", which clusters data in a tree
dendrogram, by similarity. However, I receive the errors msg.s: 1) "missing
observations in cov/cor" 2) "In