thr3ads.net - similar to: "Hierarchical Clustering Using Mutual Information"

Displaying 20 results from an estimated 5000 matches similar to: "Hierarchical Clustering Using Mutual Information"

2003 Nov 08

help with hierarchical clustering

I have a large excel file with data in it. I converted it to a 'csv' format. I imported this dataset to R using the follownig command mldata <- read.csv("c:\\temp\\mldata.csv", header=T) all the column names and the rows seems to be correct. Now that I have this object, I need to perfrom hclust. I used the following hc <- hclust(dist(mldata), method="single")

Finding overlaps in vector

2007 Dec 21

Finding overlaps in vector

<posted & mailed> Dear all, I'm trying to solve the problem, of how to find clusters of values in a vector that are closer than a given value. Illustrated this might look as follows: vector <- c(0,0.45,1,2,3,3.25,3.33,3.75,4.1,5,6,6.45,7,7.1,8) When using '0.5' as the proximity requirement, the following groups would result: 0,0.45 3,3.25,3.33,3.75,4.1 6,6.45 7,7.1

k-nn hierarchical clustering

2011 Jun 09

k-nn hierarchical clustering

Hi there, is there any R-function for k-nearest neighbour agglomerative hierarchical clustering? By this I mean standard agglomerative hierarchical clustering as in hclust or agnes, but with the k-nearest neighbour distance between clusters used on the higher levels where there are at least k>1 distances between two clusters (single linkage is 1-nearest neighbour clustering)? Best regards,

hierarchical clustering within a size limit

2011 May 11

hierarchical clustering within a size limit

Hello List, I am trying to implement a hierarchical cluster using the hclust method agglomerative single linkage method with a small wrinkle. I would like to cluster a set of numbers on a number line only if they are within a distance of 500. I would then like to print out the members of this list. So far I can put a vector: > x<-c(2,10,200,300,600,700) into a distance matrix: >

Interpreting the result of 'cutree' from hclust/heatmap.2

2013 Aug 22

Interpreting the result of 'cutree' from hclust/heatmap.2

I have the following code that perform hiearchical clustering and plot them in heatmap. __ library(gplots) set.seed(538) # generate data y <- matrix(rnorm(50), 10, 5, dimnames=list(paste("g", 1:10, sep=""), paste("t", 1:5, sep=""))) # the actual data is much larger that the above # perform hiearchical clustering and plot heatmap test <- heatmap.2(y)

Multiv / hierclust / plclust

2000 Aug 31

Multiv / hierclust / plclust

I use hierclust (hierarchical clustering) in multiv package. In the documentation it is said that plclust (plotting a dendrogram) is available in S-plus. Can I find it anywhere (I have searched through http://lib.stat.cmu.edu/S and found only quotations of plclust in multiv) or is it only part of the S package (which I don't have)? Thanks --------------- Charles RAUX, Laboratoire

cophenetic matrix

2001 Jun 12

cophenetic matrix

Hello, I analyse some free-sorting data so I use hierarchical clustering. I want to compare my proximity matrix with the tree representation to evalute the fitting. (stress, cophenetic correlation (pearson's correlation)...) "The cophenetic similarity of two objects a and b is defined as the similarity level at wich objects a and b become members of the same cluster during the course of

Clustering algorithms don't find obvious clusters

2010 Jun 11

Clustering algorithms don't find obvious clusters

I have a directed graph which is represented as a matrix on the form 0 4 0 1 6 0 0 0 0 1 0 5 0 0 4 0 Each row correspond to an author (A, B, C, D) and the values says how many times this author have cited the other authors. Hence the first row says that author A have cited author B four times and author D one time. Thus the matrix represents two groups of authors: (A,B) and (C,D) who cites

Cannot allocate vector of size

2015 Dec 23

Cannot allocate vector of size

Antes de nada, me gustaría daros las gracias por toda vuestra ayuda. He estado probando todo lo que me habéis dicho a la vez, y no hay manera, sigo teniendo el problema con el espacio. En cuanto al tamaño de la base de datos, es más grande de lo que puse, me equivoqué y puse el tamaño de una base anterior con la que estuve trabajando, la actual tiene 36866 filas x 6500 columnas. He seguido todas

Clustering Question

2006 Jan 26

Clustering Question

Hi group, My case has N physicians with each seeing M patients. One physician could have seen a group of patients, or, a patient could have been seen by multiple number of physicians. In order words, there are overlaps. Now, I have the following NxM matrix Patient#1 Patient#2 Patient#3 ....... Patient#m Physician#1 1 0 1 ....... 0 Physician#2

agnes clustering and NAs

2011 Jan 27

agnes clustering and NAs

Hello, In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like : > m <- matrix(c( 1, 1, 1, 2, 1, NA, 1, 1, 1, 2, 2, 2), nrow = 3, byrow = TRUE) > agnes(m) Call: agnes(x = m) Agglomerative coefficient: 0.1614168 Order of objects: [1] 1 2 3 Height (summary): Min. 1st Qu. Median Mean 3rd

Re: Clustering in R

2004 Jun 17

Re: Clustering in R

Thanks a lot, Michael! I cc to R-help, where this question really belongs {as the 'Subject' suggests itself...} -- please drop 'bioconductor' from CC'ing further replies. >>>>> "michael" == michael watson (IAH-C) <michael.watson at bbsrc.ac.uk> >>>>> on Thu, 17 Jun 2004 09:16:59 +0100 writes: michael> OK, admittedly it

hc2Newick is different than th hclust dendrogram

2011 Apr 01

hc2Newick is different than th hclust dendrogram

Hi R helpers... I am having troubles because of the discrepancy between the dendrogram plotted from hclust and what is wrote in the hc2Newick file. I've got a matrix C: > hc <- hclust(dist(C)) > plot(hc) with the: > write(hc2Newick(hc),file='test.newick') both things draw completely different "trees"... I have also tried with the raw distance matrix D and

Manually modifying an hclust dendrogram to remove singletons

2012 May 24

Manually modifying an hclust dendrogram to remove singletons

Dear R-Help, I have a clustering problem with hclust that I hope someone can help me with. Consider the classic hclust example: hc <- hclust(dist(USArrests), "ave") plot(hc) I would like to cut the tree up in such a way so as to avoid small clusters, so that we get a minimum number of items in each cluster, and therefore avoid singletons. e.g. in this example, you can see

hierarchical clustering of large dataset

2012 Mar 08

hierarchical clustering of large dataset

Hello All, i've a set of observations that is in the form : a, b, c, d, e, f 67.12, 4.28, 1.7825, 30, 3, 16001 67.12, 4.28, 1.7825, 30, 3, 16001 66.57, 4.28, 1.355, 30, 3, 16001 66.2, 4.28, 1.3459, 13, 3, 16001 66.2, 4.28, 1.3459, 13, 3, 16001 66.2, 4.28, 1.3459, 13, 3, 16001 66.2,

Hierarchical clustering using own distance matrices

2010 May 25

Hierarchical clustering using own distance matrices

Hey Everyone! I wanted to carry out Hierarchical clustering using distance matrices i have calculated ( instead of euclidean distance etc.) I understand as.dist is the function for this, but the distances in the dendrogram i got by using the following script(1) were not the distances defined in my distance matrices. script: var<-read.table("the distance matrix i calculated",

Clustering with 'agnes'

2004 Feb 04

Clustering with 'agnes'

Hello, I had a question regarding clustering using the agnes() function from the 'cluster' package. I was wondering if anyone knew how I can identify cluster points after running the agnes function. For example, I created a dataset with points randomly scattered around (0,0), (0,1) and (1,0). After clustering, the dendrogram shows all the clustered points and I get the ordering and

cutree with agnes

2003 Dec 11

cutree with agnes

Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")

cutree with agnes

2003 Dec 11

cutree with agnes

Converting a Dissimilarity Matrix

2004 Feb 06

Converting a Dissimilarity Matrix

Hi all, I'm trying to perform a hierarchical clustering on some dissimilarity data that I have but the data matrix I have already contains the dissimilarity values. These values are calculated using a separate program. The dissimilarity matrix in complete with no missing values but the hclust, and agnes routines require it in the form produced by daisy or dist. Is there any of converting

similar to: Hierarchical Clustering Using Mutual Information