Displaying 20 results from an estimated 4000 matches similar to: "clustering dissimilarities"
2004 Sep 08
8
isoMDS
Dear List:
I have a question regarding an MDS procedure that I am accustomed to
using. I have searched around the archives a bit and the help doc and
still need a little assistance. The package isoMDS is what I need to
perform the non-metric scaling, but I am working with similarity
matrices, not dissimilarities. The question may end up being resolved
simply.
Here is a bit of substantive
2005 May 30
2
How to access to sum of dissimilarities in CLARA
Dear All ,
Since dissimilarity is one of quality measures in clustering , I'm trying to access to the sum of dissimilarity as a whole measure. But after running my data using CLARA I obtain :
1128 dissimilarities, summarized :
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.033155 0.934630 2.257000 2.941600 4.876600 8.943700
But I can not find the sum of dissimilarity.How can i
2004 Mar 26
2
Fwd: MDS problems [ajtee@ajtee.uklinux.net]
Hi all,
I'm trying to perform an MDS of some data that I have. When I use
cmdscale everything is fine and I get some interesting results however,
the tends to be low.
What I wnat to do is compare this with the Non-Metric MDS using isoMDS
or sammon. However, when I try using these I get the following message.
Error in isoMDS(x.dist) : zero or negative distance between objects 2
and 4
2004 Jun 29
1
PAM clustering: using my own dissimilarity matrix
Hello,
I would like to use my own dissimilarity matrix in a PAM clustering with
method "pam" (cluster package) instead of a dissimilarity matrix created
by daisy.
I read data from a file containing the dissimilarity values using
"read.csv". This creates a matrix (alternatively: an array or vector)
which is not accepted by "pam": A call
2011 Jul 08
1
Visualizing a dissimilarity matrix in Euclidean space
Hi,
I have a set of nodes and a dissimilarity matrix for them, as well as a csv
file in which the diss matrix has been converted to [node_1, node_2,
dissimilarity] format. I would like to visualize this as a graph in
Euclidean space (that is, similar nodes clumped together in clusters),
rather than the seriation visualization given by dissplot(). I am using
Network WorkBench for my
2004 May 28
6
distance in the function kmeans
Hi,
I want to know which distance is using in the function kmeans
and if we can change this distance.
Indeed, in the function pam, we can put a distance matrix in
parameter (by the line "pam<-pam(dist(matrixdata),k=7)" ) but
we can't do it in the function kmeans, we have to put the
matrix of data directly ...
Thanks in advance,
Nicolas BOUGET
2002 Feb 14
1
Subsets in mclust
Dear group,
I want to use the mclust package on large data, and therefore I want to use
a subset in the initial clustering phase. From help(mclust):
k: If `k' is specified, the hierarchical clustering phase will
use a sample of size `k' of the data in the initial
hierarchical clustering phase. The default is to use the
entire data set.
m2 is a
2005 Nov 04
1
Stress in multidimensional scaling
Hello,
We are trying to find a function to compute "stress" in our
multidimensional scaling analysis of a dissimilarity matrix. We've used
"dist()" to create the matrix and "cmdscale()" for the scaling. In order
to determine the number of dimensions we would like to plot stress vs.
dimensions. However, we cannot find a pre-made command. It seems that
other
2005 Sep 12
4
Document clustering for R
I'm working on a project related to document clustering. I know that R
has clustering algorithms such as clara, but only supports two distance
metrics: euclidian and manhattan, which are not very useful for
clustering documents. I was wondering how easy it would be to extend the
clustering package in R to support other distance metrics, such as
cosine distance, or if there was an API for
2004 Dec 09
1
more clustering questions
Sorry to bother you kind folks again with my questions. I am trying to
learn as much as I can about all this, and I will admit that I don't
have the proper background, but I hope that someone can at least point
me in the correct direction.
I have created a test matrix for what I want to do:
s1 s2 s3 s4 s5
s1 10 5 0 8 7
s2 5 10 0 0 5
s3 0 0 10 0 0
s4 8 0 0 10 0
s5 7
2004 Feb 26
2
Multidimensional scaling and distance matrices
Dear All,
I am in the somewhat unfortunate position of having to reproduce the
results previously obtained from (non-metric?) MDS on a "kinship" matrix
using Statistica. A kinship matrix measures affinity between groups, and
has its maximum values on the diagonal.
Apparently, starting with a nxn kinship matrix, all it was needed to do
was to feed it to Statistica flagging that the
2001 Jan 09
2
PAM clustering (using triangular matrix)
Hi,
I'm trying to use a similarity matrix (triangular) as input for pam() or
fanny() clustering algorithms.
The problem is that this algorithms can only accept a dissimilarity
matrix, normally generated by daisy().
However, daisy only accept 'data matrix or dataframe. Dissimilarities
will be computed between the rows of x'.
Is there any way to say to that your data are already a
2013 Dec 08
3
Why daisy() in cluster library failed to exclude NA when computing dissimilarity
Hi,
According to daisy function from cluster documentation, it can compute
dissimilarity when NA (missing) value(s) is present.
http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html
But why when I tried this code
library(cluster)
x <- c(1.115,NA,NA,0.971,NA)
y <- c(NA,1.006,NA,NA,0.645)
df <- as.data.frame(rbind(x,y))
daisy(df,metric="gower")
It gave this
2005 Oct 21
1
finite mixture model (2-component gaussian): plotting component gaussian components?
Dear Knowledgeable R Community Members,
Please excuse my ignorance, I apologize in advance if this is an easy question, but I am a bit stumped and could use a little guidance.
I have a finite mixture modeling problem -- for example, a 2-component gaussian mixture -- where the components have a large overlap, and
I am trying to use the "mclust" package to solve this problem.
I need
2003 Sep 11
3
Rgui access violation
Dear All;
While using EMclust() in the mclust package, I frequently
encountered a program error. A message window popped up with the message "
Rgui.exe has generated errors and will be closed by Windows. You will need
to restart the program. An error log is be created."
> version
_
platform i386-pc-mingw32
arch i386
os mingw32
2003 Apr 23
1
clustering
Dear R-users,
I have a two - dimensional data set which needs to be clustered into
groups:
I'm searching for groups of points which show a positive
correlation (in a twodimensional plot of the data set), but I do not have
any knowledge about how many groups there might be.
Do you know of a clustering algorithm in R (or
in general) which can use a-priori information about the cluster's
2003 Jan 30
2
Validation of clustering
Hi,
I'm using the library cluster to cluster a set of figures (method CLARA).
Somebody that it work with clustering would know informs what I make to
evaluate the clustering?
Tks VM,
Francisco.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Francisco JĂșnior,
Computer Science - UFPE-Brazil
"One life has more value that the
world whole"
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2003 Jan 07
2
Extracting means for given strata from dissimilarity object
Is there a way of extracting mean distance or dissimilarity for a given
strata from a 'dist' or 'dissimilarity' object, e.g. extract mean distances
for each species in Anderson's iris data?
data(iris)
iris.dist<-dist(iris[,1:4])
then what?
Mikkel Grum, PhD
Genetic Diversity Scientist
International Plant Genetic Resources Institute (IPGRI)
Sub-Saharan Africa Group
***
2009 Mar 29
1
[cluster package question] What is the "sum of the dissimilarities" in the pam command ?
Hello Martin Maechler and All,
A simple question (I hope):
How can I compute the "sum of the dissimilarities" that appears in the pam
command (from the cluster package) ?
Is it the "manhattan" distance (such as the one implemented by "dist") ?
I am asking since I am running clustering on a dataset. I found 7 medoids
with the pam command, and from it I have the
2016 Apr 25
1
how to create initial configuraton for isoMDS
Hi,
I'm trying to use isoMDS to project a directed graph to 2-dim vectors, but I got an error.
#here is the code to create the graph using igraph package and run isoMDS on it.
library(igraph)
library(MASS)
g<-make_graph(c(1,2, 2,3, 2,4, 3,4, 4,5, 5,6, 3,6, 1,6, 2,5),directed=TRUE)
dist<-distances(g, mode="out")
loc<-isoMDS(dist)
# below is content of the dist matrix