thr3ads.net - similar to: "clustering dissimilarities"

Displaying 20 results from an estimated 4000 matches similar to: "clustering dissimilarities"

2004 Sep 08

isoMDS

Dear List: I have a question regarding an MDS procedure that I am accustomed to using. I have searched around the archives a bit and the help doc and still need a little assistance. The package isoMDS is what I need to perform the non-metric scaling, but I am working with similarity matrices, not dissimilarities. The question may end up being resolved simply. Here is a bit of substantive

How to access to sum of dissimilarities in CLARA

2005 May 30

How to access to sum of dissimilarities in CLARA

Dear All , Since dissimilarity is one of quality measures in clustering , I'm trying to access to the sum of dissimilarity as a whole measure. But after running my data using CLARA I obtain : 1128 dissimilarities, summarized : Min. 1st Qu. Median Mean 3rd Qu. Max. 0.033155 0.934630 2.257000 2.941600 4.876600 8.943700 But I can not find the sum of dissimilarity.How can i

Fwd: MDS problems [ajtee@ajtee.uklinux.net]

2004 Mar 26

Fwd: MDS problems [ajtee@ajtee.uklinux.net]

Hi all, I'm trying to perform an MDS of some data that I have. When I use cmdscale everything is fine and I get some interesting results however, the tends to be low. What I wnat to do is compare this with the Non-Metric MDS using isoMDS or sammon. However, when I try using these I get the following message. Error in isoMDS(x.dist) : zero or negative distance between objects 2 and 4

PAM clustering: using my own dissimilarity matrix

2004 Jun 29

PAM clustering: using my own dissimilarity matrix

Hello, I would like to use my own dissimilarity matrix in a PAM clustering with method "pam" (cluster package) instead of a dissimilarity matrix created by daisy. I read data from a file containing the dissimilarity values using "read.csv". This creates a matrix (alternatively: an array or vector) which is not accepted by "pam": A call

Visualizing a dissimilarity matrix in Euclidean space

2011 Jul 08

Visualizing a dissimilarity matrix in Euclidean space

Hi, I have a set of nodes and a dissimilarity matrix for them, as well as a csv file in which the diss matrix has been converted to [node_1, node_2, dissimilarity] format. I would like to visualize this as a graph in Euclidean space (that is, similar nodes clumped together in clusters), rather than the seriation visualization given by dissplot(). I am using Network WorkBench for my

distance in the function kmeans

2004 May 28

distance in the function kmeans

Hi, I want to know which distance is using in the function kmeans and if we can change this distance. Indeed, in the function pam, we can put a distance matrix in parameter (by the line "pam<-pam(dist(matrixdata),k=7)" ) but we can't do it in the function kmeans, we have to put the matrix of data directly ... Thanks in advance, Nicolas BOUGET

Subsets in mclust

2002 Feb 14

Subsets in mclust

Dear group, I want to use the mclust package on large data, and therefore I want to use a subset in the initial clustering phase. From help(mclust): k: If `k' is specified, the hierarchical clustering phase will use a sample of size `k' of the data in the initial hierarchical clustering phase. The default is to use the entire data set. m2 is a

Stress in multidimensional scaling

2005 Nov 04

Stress in multidimensional scaling

Hello, We are trying to find a function to compute "stress" in our multidimensional scaling analysis of a dissimilarity matrix. We've used "dist()" to create the matrix and "cmdscale()" for the scaling. In order to determine the number of dimensions we would like to plot stress vs. dimensions. However, we cannot find a pre-made command. It seems that other

Document clustering for R

2005 Sep 12

Document clustering for R

I'm working on a project related to document clustering. I know that R has clustering algorithms such as clara, but only supports two distance metrics: euclidian and manhattan, which are not very useful for clustering documents. I was wondering how easy it would be to extend the clustering package in R to support other distance metrics, such as cosine distance, or if there was an API for

similar to: clustering dissimilarities