Displaying 20 results from an estimated 800 matches similar to: "clustering problem"
2002 Mar 05
1
no labels when plotting dendrograms
I'd like to be able to cut dendrograms at a height I specify
and then plot the resulting subtrees.  I wanted to use the
dendrogram object for this purpose because there doesn't seem
to be a canned way to cut a hclust object and get a list of
hclust objects, but there is a function (cut) that does that
for dendrograms.  The problem I'm having is that when I plot
a dendrogram, I
2007 Oct 26
2
cut.dendrogram and cutree
Hi!
In the example:
hc <- hclust(dist(USArrests), "ave")
dend1 <- as.dendrogram(hc)
dend2 <- cut(dend1, h=70)
Do the branches "Branch 1", "Branch 2", "Branch 2"...in dend2$upper
  str(dend2$upper)
--[dendrogram w/ 2 branches and 4 members at h = 152]
   |--[dendrogram w/ 2 branches and 2 members at h = 77.6]
   |  |--leaf "Branch 1" (h=
2017 Jul 03
2
R memory limits on table(x, y) (and bigtabulate)
I have two character vectors x and y that have the following characteristics:
length(x)  # same as
length(y) # 872099
length(unique(x))  # 47740
length(unique(y)) # 52478
I need to crosstabulate them, which would lead to a table with
47740*52478 # 2505299720
cells, which is more than
2^31 # 2147483648
cells, which seems to be R's limit because I am getting the error message
Error in
2012 Mar 29
2
hclust and plot functions work, cutree does not
Hi,
I have the distance matrix computed and I feed it to hclust function. The
plot function produces a dense dendrogram as well. But, the cutree function
applied does not produce the desired list.
Here is the code
        x=data.frame(similarity_matrix)
colnames(x) = c(source_tags_vec)
rownames(x) = c(source_tags_vec)
clust_tree=hclust(as.dist(x),method="complete")
plot(clust_tree)
2012 Oct 11
2
extracting groups from hclust() for a very large matrix
Hello,
I'm having trouble figuring out how to see resulting groups (clusters) 
from my hclust() output. I have a very large matrix of 4371 plots and 29 
species, so simply looking at the graph is impossible. There must be a 
way to 'print' the results to a table that shows which plots were in 
what group, correct?
I've attached the matrix I'm working with (the whole thing
2002 Feb 08
0
packages for extracting subtrees
Hi.  I did write those functions, and sent them (I thought) to one of the R
maintainers to see whether they would be appropriate for inclusion (because
I'd seen some requests on the mailing lists). 
However, I'm happy to post them -- I should have thought of it before.
WARNING:  I've tested these functions on some data arising in my work and
also on the USArrests data that comes with
2017 Jul 03
0
R memory limits on table(x, y) (and bigtabulate)
Sorry, don't know enough to give you trustworthy answers,  but I can
say that crashes due to (or linked to) packages should usually be
reported to the package maintainer, who can be found by the
?maintainer function. That person may not monitor this list.
Cheers,
Bert
Bert Gunter
"The trouble with having an open mind is that people keep coming along
and sticking things into it."
2013 Nov 16
0
selecting optimal cluster validation score
Hi:
I have calculated the Silhouette score and Dunn score after
hierarchical clustering for 3 clusters:
#Distance measure
d <- dist(USArrests, method = "euclidean")
#Hierarchical clustering
hc <- hclust(dist(USArrests), "ave")
#calculating silhouette value for 3 clusters
sil<- silhouette(cutree(hc, k=3), d)
#calculating Dunn index for 3 clusters
clus <- cutree(hc,
2010 Jun 23
1
Clustering
Hi,
I use the following clustering methods and get the
corresponding dendrograms for single, complete, average, ward and
kmeans clustering.
This gives the dendrograms, but doesn't show the calculation-way.
My question: is there a possibility to show this calculation steps
(cluster steps) in matrix or graphical form?
Mit freundlichen Gr??en
Ralph Modjesch
2012 Aug 12
0
Different cluster orderings from cutree() and cut.dendrogram()
Hi!
I just discovered that cutree() and cut.dendrogram() do not assign the
same cluster numberings when called on the same tree. More specifically,
cutree() assigns cluster numbers by order of appearance in the data,
while cut.dendrogram() sorts clusters by height (see example below). I
guess this is for historical reasons?
I'm hit by this difference when I want to get a vector of cluster
2015 Jun 06
2
Request: making cutree S3 in R?
Hello all,
A question/suggestion:
I was wondering if there is a chance of changing stats::cutree to be S3 and
use cutree.hclust?
For example:
cutree <- function(tree, k = NULL, h = NULL,...)
{
  UseMethod("cutree")
}
cutree.hclust <- stats::cutree
# This will obviously need the actual content of stats::cutree
This would be nicer for people like me to add new methods to
2010 Nov 25
1
RODBC
Hi,
I am running the RODBC examples form the help guide. I am trying to
UPDATE a table in an Access data base but I am having an error.
library(RODBC)
library(termstrc)
path = getwd()
setwd(getwd())
dbName = "data.mdb"
pathdbname = paste(path,"/",dbName,sep="")
accesChannel = odbcConnectAccess(pathdbname, uid = "", pwd = "")
2003 Dec 11
1
cutree with agnes
Hi,
this is rather a (presumed) bug report than a question because I can solve
my personal statistical problem by working with hclust instead of agnes. 
I have done a complete linkage clustering on a dist object dm with 30
objects with agnes (R 1.8.0 on
RedHat) and I want to obtain the partition that results from a cut at
height=0.4.
I run
> cl1a <- agnes(dm, method="complete")
2003 Dec 11
1
cutree with agnes
Hi,
this is rather a (presumed) bug report than a question because I can solve
my personal statistical problem by working with hclust instead of agnes. 
I have done a complete linkage clustering on a dist object dm with 30
objects with agnes (R 1.8.0 on
RedHat) and I want to obtain the partition that results from a cut at
height=0.4.
I run
> cl1a <- agnes(dm, method="complete")
2014 Jul 02
0
How do I call a C++ function (for k-means) within R?
I am trying to call a C++ k-means function within R and I am struggling. I
know that the below code is used to call a C++ function for gbm but how do I
do it for k-means?
gbm.obj <- .Call("gbm",
                    Y=as.double(y),
                    Offset=as.double(offset),
                    X=as.double(x),
                    X.order=as.integer(x.order),
                   
2004 Nov 25
0
Problem with ODBC access to SQL database
I have the following problem in getting the "sqlSave" function from the
example code in the R package RODBC to work with MySQL as ODBC server:
 - a new database is created, but no data is written to it
 - the example code works just fine when I use MS Access as ODBC server.
 
----------- offending code and output -------------------
> library(RODBC);
> channel <-
2012 Feb 06
0
sqsSave() test using Test.R script returning error suggesting no values upon "insert"
Hi.
I attempted to use RODBC and decided to execute all the tests provided in "Test.r" under the unpacked package RODBC folder.
Beginning with the following:
library(RODBC)
library(MASS)
USArrests[1,2] <- NA
hills <- hills[1:15,]
row.names(hills)[12] <- "Dollar ('$')"
set.seed(1)
# MySQL
## testdb3 is ODBC/Connector 3.51.x, testdb5 is 5.1.x
channel <-
2009 Dec 23
1
prcomp : plotting only explanatory axis arrows
Dear all,
I have a very large dataset (1712351 , 20) and would like
to plot only the arrows that represent the
contribution of each variables.
On the sample below I woild like to plot
only the explanatory variables (Murder, Assault..)
and not the sites.
prcomp(USArrests)  # inappropriate
prcomp(USArrests, scale = TRUE)
prcomp(~ Murder + Assault + Rape, data = USArrests, scale = TRUE)
2011 Sep 13
2
help with hclust and cutree
Hello,
I would like to cut a hclust tree into several groups at a specific
similarity. I assume this can be achieved by specifying the "h" argument
with the specified similarity, e.g.:
clust<-hclust(dist,"average")
cut<-cutree(clust,h=0.65)
Now, I would like to draw rectangles around the branches of the
dendrogram highlighting the corresponding clusters, as is done by
2004 Jul 21
2
Cutting heatmap dendrogram
Hello,
I've been clustering my data using hclust and cutting the resulting tree
with cutree.  Separately, I visualize the clusterings with heatmap.  Is it
possible to have the dendrogram on the heatmap reflect the cutree results?
That is, instead of having one large dendrogram, it would have 4 or 25 in
the example below.  Any guidance on if that's possible or not, and what
kinds of