similar to: Hierarchical Clustering Using Mutual Information

Displaying 20 results from an estimated 5000 matches similar to: "Hierarchical Clustering Using Mutual Information"

2003 Nov 08
2
help with hierarchical clustering
I have a large excel file with data in it. I converted it to a 'csv' format. I imported this dataset to R using the follownig command mldata <- read.csv("c:\\temp\\mldata.csv", header=T) all the column names and the rows seems to be correct. Now that I have this object, I need to perfrom hclust. I used the following hc <- hclust(dist(mldata), method="single")
2007 Dec 21
3
Finding overlaps in vector
<posted & mailed> Dear all, I'm trying to solve the problem, of how to find clusters of values in a vector that are closer than a given value. Illustrated this might look as follows: vector <- c(0,0.45,1,2,3,3.25,3.33,3.75,4.1,5,6,6.45,7,7.1,8) When using '0.5' as the proximity requirement, the following groups would result: 0,0.45 3,3.25,3.33,3.75,4.1 6,6.45 7,7.1
2011 Jun 09
1
k-nn hierarchical clustering
Hi there, is there any R-function for k-nearest neighbour agglomerative hierarchical clustering? By this I mean standard agglomerative hierarchical clustering as in hclust or agnes, but with the k-nearest neighbour distance between clusters used on the higher levels where there are at least k>1 distances between two clusters (single linkage is 1-nearest neighbour clustering)? Best regards,
2011 May 11
2
hierarchical clustering within a size limit
Hello List, I am trying to implement a hierarchical cluster using the hclust method agglomerative single linkage method with a small wrinkle. I would like to cluster a set of numbers on a number line only if they are within a distance of 500. I would then like to print out the members of this list. So far I can put a vector: > x<-c(2,10,200,300,600,700) into a distance matrix: >
2013 Aug 22
1
Interpreting the result of 'cutree' from hclust/heatmap.2
I have the following code that perform hiearchical clustering and plot them in heatmap. __ library(gplots) set.seed(538) # generate data y <- matrix(rnorm(50), 10, 5, dimnames=list(paste("g", 1:10, sep=""), paste("t", 1:5, sep=""))) # the actual data is much larger that the above # perform hiearchical clustering and plot heatmap test <- heatmap.2(y)
2000 Aug 31
2
Multiv / hierclust / plclust
I use hierclust (hierarchical clustering) in multiv package. In the documentation it is said that plclust (plotting a dendrogram) is available in S-plus. Can I find it anywhere (I have searched through http://lib.stat.cmu.edu/S and found only quotations of plclust in multiv) or is it only part of the S package (which I don't have)? Thanks --------------- Charles RAUX, Laboratoire
2001 Jun 12
1
cophenetic matrix
Hello, I analyse some free-sorting data so I use hierarchical clustering. I want to compare my proximity matrix with the tree representation to evalute the fitting. (stress, cophenetic correlation (pearson's correlation)...) "The cophenetic similarity of two objects a and b is defined as the similarity level at wich objects a and b become members of the same cluster during the course of
2010 Jun 11
2
Clustering algorithms don't find obvious clusters
I have a directed graph which is represented as a matrix on the form 0 4 0 1 6 0 0 0 0 1 0 5 0 0 4 0 Each row correspond to an author (A, B, C, D) and the values says how many times this author have cited the other authors. Hence the first row says that author A have cited author B four times and author D one time. Thus the matrix represents two groups of authors: (A,B) and (C,D) who cites
2015 Dec 23
2
Cannot allocate vector of size
Antes de nada, me gustaría daros las gracias por toda vuestra ayuda. He estado probando todo lo que me habéis dicho a la vez, y no hay manera, sigo teniendo el problema con el espacio. En cuanto al tamaño de la base de datos, es más grande de lo que puse, me equivoqué y puse el tamaño de una base anterior con la que estuve trabajando, la actual tiene 36866 filas x 6500 columnas. He seguido todas
2006 Jan 26
1
Clustering Question
Hi group, My case has N physicians with each seeing M patients. One physician could have seen a group of patients, or, a patient could have been seen by multiple number of physicians. In order words, there are overlaps. Now, I have the following NxM matrix Patient#1 Patient#2 Patient#3 ....... Patient#m Physician#1 1 0 1 ....... 0 Physician#2
2011 Jan 27
3
agnes clustering and NAs
Hello, In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like : > m <- matrix(c( 1, 1, 1, 2, 1, NA, 1, 1, 1, 2, 2, 2), nrow = 3, byrow = TRUE) > agnes(m) Call: agnes(x = m) Agglomerative coefficient: 0.1614168 Order of objects: [1] 1 2 3 Height (summary): Min. 1st Qu. Median Mean 3rd
2004 Jun 17
1
Re: Clustering in R
Thanks a lot, Michael! I cc to R-help, where this question really belongs {as the 'Subject' suggests itself...} -- please drop 'bioconductor' from CC'ing further replies. >>>>> "michael" == michael watson (IAH-C) <michael.watson at bbsrc.ac.uk> >>>>> on Thu, 17 Jun 2004 09:16:59 +0100 writes: michael> OK, admittedly it
2011 Apr 01
2
hc2Newick is different than th hclust dendrogram
Hi R helpers... I am having troubles because of the discrepancy between the dendrogram plotted from hclust and what is wrote in the hc2Newick file. I've got a matrix C: > hc <- hclust(dist(C)) > plot(hc) with the: > write(hc2Newick(hc),file='test.newick') both things draw completely different "trees"... I have also tried with the raw distance matrix D and
2012 May 24
4
Manually modifying an hclust dendrogram to remove singletons
Dear R-Help, I have a clustering problem with hclust that I hope someone can help me with. Consider the classic hclust example: hc <- hclust(dist(USArrests), "ave") plot(hc) I would like to cut the tree up in such a way so as to avoid small clusters, so that we get a minimum number of items in each cluster, and therefore avoid singletons. e.g. in this example, you can see
2012 Mar 08
2
hierarchical clustering of large dataset
Hello All, i've a set of observations that is in the form : a, b, c, d, e, f 67.12, 4.28, 1.7825, 30, 3, 16001 67.12, 4.28, 1.7825, 30, 3, 16001 66.57, 4.28, 1.355, 30, 3, 16001 66.2, 4.28, 1.3459, 13, 3, 16001 66.2, 4.28, 1.3459, 13, 3, 16001 66.2, 4.28, 1.3459, 13, 3, 16001 66.2,
2010 May 25
1
Hierarchical clustering using own distance matrices
Hey Everyone! I wanted to carry out Hierarchical clustering using distance matrices i have calculated ( instead of euclidean distance etc.) I understand as.dist is the function for this, but the distances in the dendrogram i got by using the following script(1) were not the distances defined in my distance matrices. script: var<-read.table("the distance matrix i calculated",
2004 Feb 04
1
Clustering with 'agnes'
Hello, I had a question regarding clustering using the agnes() function from the 'cluster' package. I was wondering if anyone knew how I can identify cluster points after running the agnes function. For example, I created a dataset with points randomly scattered around (0,0), (0,1) and (1,0). After clustering, the dendrogram shows all the clustered points and I get the ordering and
2003 Dec 11
1
cutree with agnes
Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")
2003 Dec 11
1
cutree with agnes
Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")
2004 Feb 06
2
Converting a Dissimilarity Matrix
Hi all, I'm trying to perform a hierarchical clustering on some dissimilarity data that I have but the data matrix I have already contains the dissimilarity values. These values are calculated using a separate program. The dissimilarity matrix in complete with no missing values but the hclust, and agnes routines require it in the form produced by daisy or dist. Is there any of converting