thr3ads.net - similar to: "Converting a Dissimilarity Matrix"

Displaying 20 results from an estimated 8000 matches similar to: "Converting a Dissimilarity Matrix"

2002 Apr 29

cluster analyses

I'm clustering rather large data sets and would like to cut the dendrograms to get a better view of specific components. I calculate the dissimilarity matrix using daisy() because I have a mixture of variable types: factors, ordered factors and numerical variables. If I want one dendrogram, I use agnes() for the agglomerative nesting and pltree() to draw the dendrogram. That way, I get the

agnes clustering and NAs

2011 Jan 27

agnes clustering and NAs

Hello, In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like : > m <- matrix(c( 1, 1, 1, 2, 1, NA, 1, 1, 1, 2, 2, 2), nrow = 3, byrow = TRUE) > agnes(m) Call: agnes(x = m) Agglomerative coefficient: 0.1614168 Order of objects: [1] 1 2 3 Height (summary): Min. 1st Qu. Median Mean 3rd

2016 Apr 12

Dissimilarity matrix and number clusters determination

Hi, I already have a dissimilarity matrix and I am submitting the results to the elbow.obj method to get an optimal number of clusters. Am I reading the below output correctly that I should have 17 clusters? code: top150 <- sampleset[1:150,] {cluster1 <- daisy(top150 , metric = c("gower") , stand = TRUE , type = list(symm

cluster a distance(analogue)-object using agnes(cluster)

2008 Sep 02

cluster a distance(analogue)-object using agnes(cluster)

I try to perform a clustering using an existing dissimilarity matrix that I calculated using distance (analogue) I tried two different things. One of them worked and one not and I don`t understand why. Here the code: not working example library(cluster) library(analogue) iris2<-as.data.frame(iris) str(iris2) 'data.frame': 150 obs. of 5 variables: $ Sepal.Length: num 5.1 4.9 4.7

dissimilarity matrices

2006 Nov 09

dissimilarity matrices

Dear All, I have a dissimilarity matrix which I happily convert to a distance object by running: X <- as.dist(Y) and I can happily now run either hclust(X) or agnes(X). So that the various bits of output are labelled correctly I would dearly like to be able to give names to the columns and rows of X, as would happen if I ran: mydata<-read.table("clipboard",header=T)

PAM clustering: using my own dissimilarity matrix

2004 Jun 29

PAM clustering: using my own dissimilarity matrix

Hello, I would like to use my own dissimilarity matrix in a PAM clustering with method "pam" (cluster package) instead of a dissimilarity matrix created by daisy. I read data from a file containing the dissimilarity values using "read.csv". This creates a matrix (alternatively: an array or vector) which is not accepted by "pam": A call

Cluster prediction from factor/numeric datasets

2007 Jul 23

Cluster prediction from factor/numeric datasets

Hi all, I have a dataset with numeric and factor columns of data which I developed a Gower Dissimilarity Matrix for (Daisy) and used Agglomerative Nesting (Agnes) to develop 20 clusters. I would like to use the 20 clusters to determine cluster membership for a new dataset (using predict) but cannot find a way to do this (no way to "predict" in the cluster package). I know I can use

Why daisy() in cluster library failed to exclude NA when computing dissimilarity

2013 Dec 08

Why daisy() in cluster library failed to exclude NA when computing dissimilarity

Hi, According to daisy function from cluster documentation, it can compute dissimilarity when NA (missing) value(s) is present. http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html But why when I tried this code library(cluster) x <- c(1.115,NA,NA,0.971,NA) y <- c(NA,1.006,NA,NA,0.645) df <- as.data.frame(rbind(x,y)) daisy(df,metric="gower") It gave this

2 questions about cluster analysis

2002 Aug 27

2 questions about cluster analysis

Hello everyone! I have two questions regarding cluster analysis with the package 'cluster' : 1. I have a dissimilarity matrix in csv format, and I would like to use it to generate a dendogram using 'hclust'. How do I make that matrix compatible with the function? I think it has to be similar to the objects generated by the 'dist' function. Any ideas? 2. Does

fuzzy classification and dissimilarity matrix

2006 Apr 07

fuzzy classification and dissimilarity matrix

Hello, I want to make a fuzzy classification from a dissimilarity matrix (calculated with daisy from package 'cluster'). I have tried to use fanny (package cluster) but I have the same problems than described in a previous message (http://tolstoy.newcastle.edu.au/R/help/05/05/4546.html) i.e. it always gives me two clusters in the results (even if k is different from 2) with the same

Transformation of dissimilarity or distance matrix

2001 May 30

Transformation of dissimilarity or distance matrix

Dear List, is there an elegant (or even not elegant) way how to transform dissimilarity or distance matrix A (or, in general, arbitrary symmetrical matrix) by transposition of rows and columns into a form closest to "block diagonal" matrix B? The matrix A is adjusted the following way A[A<epsilon] <-0 #(epsilon is given "small" number) B: (in its ideal form)

Clustering

2007 Nov 28

Clustering

Hello all! I am performingsome clustering analysis on microarray data using agnes{cluster} and I have created my own dissimilarity matrix according to a distance measure different from "euclidean" or "manhattan" etc. My question is, if I choose for example method="complete", how are the distances between the elements calculated? Are they taken form the dissimilarity

about mix type clust algorithm

2013 Jul 22

about mix type clust algorithm

Hi: I have tried to find the appropriate clust algorithm for mixed type of data. The suggested way I see is: 1. use daisy to get the dissimilarity matrix 2. use PAM/hclust by providing the dissimilarity matrix, to get the clusters but by following this, when the data set grows bigger say 10,000 rows of data, the dissimilarity matrix will be O(n^2), and out of memory will occur. I am

Clustering Question

2006 Jan 26

Clustering Question

Hi group, My case has N physicians with each seeing M patients. One physician could have seen a group of patients, or, a patient could have been seen by multiple number of physicians. In order words, there are overlaps. Now, I have the following NxM matrix Patient#1 Patient#2 Patient#3 ....... Patient#m Physician#1 1 0 1 ....... 0 Physician#2

Need help on heatmap, K-means and hhierarchical clustering methods

2010 Aug 09

Need help on heatmap, K-means and hhierarchical clustering methods

Hi folks, I am new to the R software. I have been going through different materials to know more about R. I have the R software installed on my windows machine.I would like to know the R source code for the following problems on iris flower data set. I need to do the cluster analysis project with the iris data set. The goal is to cluster the flowers according to their Sepal.Length, Sepal.Width,

agnes not working

2011 Aug 31

agnes not working

Hello! I created a distances matrix for 13 objects using daisy (see the attached file). I am trying to clusteranalyse it using agnes but it's not working. What might be the problem: mydistances<-read.csv("Results of daisy.csv") mycluster<-agnes(mydistances, method="ward") I am getting: Error in agnes(mydistances, method = "ward") : NA/NaN/Inf in foreign

PAM clustering (using triangular matrix)

2001 Jan 09

PAM clustering (using triangular matrix)

Hi, I'm trying to use a similarity matrix (triangular) as input for pam() or fanny() clustering algorithms. The problem is that this algorithms can only accept a dissimilarity matrix, normally generated by daisy(). However, daisy only accept 'data matrix or dataframe. Dissimilarities will be computed between the rows of x'. Is there any way to say to that your data are already a

Specify ID variable in daisy{cluster}

2011 Jun 16

Specify ID variable in daisy{cluster}

Hi All - I am using the daisy function from the cluster library to create a dissimilarity matrix. I'm going to use that matrix to run a cluster analysis. My participants are identified with the variable, hhid. However, when I try to keep hhid in the dataset that I use to create the dissimilarity matrix, daisy uses it to create the matrix rather than ignoring it as an ID variable. I need to

hclust() and agnes() method="average" divergence (PR#3648)

2003 Aug 04

hclust() and agnes() method="average" divergence (PR#3648)

This message is in MIME format. Since your mail reader does not understand this format, some or all of this message may not be legible. ------_=_NextPart_000_01C35A53.75780090 Content-Type: text/plain; charset="iso-8859-1" Anyone have a clue why hclust() and agnes() produce different results in the example below when both use method="average"?? I'm not able to reproduce

Clustering function

2006 Jan 17

Clustering function

Dear mailing group, I have loaded an Excel file into R by calling it ".csv" and using the "read.csv" function in R. However then I want to use the (limma package specific, I believe) function "hclust", which clusters data in a tree dendrogram, by similarity. However, I receive the errors msg.s: 1) "missing observations in cov/cor" 2) "In

similar to: Converting a Dissimilarity Matrix