Displaying 20 results from an estimated 20000 matches similar to: "Save Cluster results to data frame"
2008 Jun 13
3
cluster.stats
Dear list,
I just tried to use the function cluster.stat in the package fpc.
I just have a couple of questions about the syntax:
cluster.stats(d,clustering,alt.clustering=NULL,
silhouette=TRUE,G2=FALSE,G3=FALSE)
1) the distance object (d) is an object obtained by the function dist() on
my own original matrix?
2) clustering is the clusters vector as result of one of the many clustering
methods?
2005 Sep 29
5
Regression slope confidence interval
Hi list,
is there any direct way to obtain confidence intervals for the regression
slope from lm, predict.lm or the like?
(If not, is there any reason? This is also missing in some other statistics
softwares, and I thought this would be quite a standard application.)
I know that it's easy to implement but it's for
explanation to people who faint if they have to do their own
programming...
2011 Jun 09
1
k-nn hierarchical clustering
Hi there,
is there any R-function for k-nearest neighbour agglomerative hierarchical
clustering?
By this I mean standard agglomerative hierarchical clustering as in hclust
or agnes, but with the k-nearest neighbour distance between clusters used
on the higher levels where there are at least k>1 distances between two
clusters (single linkage is 1-nearest neighbour clustering)?
Best regards,
2010 Apr 24
4
DICE Coefficient of similarity measure
Hi,
I wanted the DICE coefficient (similarity measure for binary variables)
to be calculated in R and found that the "igraph" package has the option
of "similarity.dice" to do this. But, for this command, the input object
should be an igraph object. But, I have a dataframe of columns
containing 1's and 0's. Can I convert this dataframe into an igraph
object, so that
2006 Nov 01
1
cluster analysis using Dmax
Dear All,
a long time ago I ran a cluster analysis where the dissimilarity matrix used
consisted of Dmax (or Kolmogorov-Smirnov distance) values. In other words
the maximum difference between two cumulative proportion curves. This all
worked very well indeed. The matrix was calculated using Dbase III+ and
took a day and a half and the clustering was done using MV-ARCH, with the
resultant
2005 Aug 08
2
selecting outliers
Hi everybody,
I'd like to know if there's an easy way for extracting
outliers record from a dataset, in order to perform
further analysis on them.
Thanks
Alessandro
2005 Jul 25
1
cluster
Dear listers:
Here I have a question on clustering methods available in R. I am
trying to down-sampling the majority class in a classification problem
on an imbalanced dataset. Since I don't want to lose information in
the original dataset, I don't want to use naive down-sampling: I think
using clustering on the majority class' side to select
"representative" samples might
2011 Aug 10
4
Clustering Large Applications..sort of
Hello all,
I am using the clustering functions in R in order to work with large
masses of binary time series data, however the clustering functions do not
seem able to fit this size of practical problem. Library 'hclust' is good
(though it may be sub par for this size of problem, thus doubly poor for
this application) in that I do not want to make assumptions about the number
of
2010 Jul 02
2
K-means result - variance between cluster
Hi,
I like to present the results from the clustering method k-means in
terms of variances: within and between Cluster. The k-means object
gives only the within cluster sum of squares by cluster, so the between
variance part is missing,for calculation the following table, which I
try to get.
Number of | Variance within | Var between | Var total | F-value
Cluster k | cluster | cluster
2006 Aug 09
2
R CMD check error
Dear list,
R CMD check on my updated package now generated the following error:
"LaTeX errors when creating DVI version.
This typically indicates Rd problems."
But the Rd files (and everything else) were checked as "OK" (I
removed the problem about which I asked the list some hours ago, but
answers are still appreciated because I rather created a rough
workaround than
2010 Sep 01
2
Rd-file error: non-ASCII input and no declared encoding
Dear list,
I came across the following error for three of my newly written Rd-files:
non-ASCII input and no declared encoding
I can't make sense of this.
Below I copied in one of the three files.
Can anybody please tell me what's wrong with it?
Thank you,
Christian
\name{tetragonula}
\alias{tetragonula}
\alias{tetragonula.coord}
\docType{data}
% \non_function{}
\title{Microsatellite
2005 Aug 08
2
computationally singular
Hi,
I have a dataset which has around 138 variables and 30,000 cases. I am
trying to calculate a mahalanobis distance matrix for them and my
procedure is like this:
Suppose my data is stored in mymatrix
> S<-cov(mymatrix) # this is fine
> D<-sapply(1:nrow(mymatrix), function(i) mahalanobis(mymatrix, mymatrix[i,], S))
Error in solve.default(cov, ...) : system is computationally
2009 Dec 11
1
cluster size
hi r-help,
i am doing kmeans clustering in stats. i tried for five clusters clustering using:
kcl1 <- kmeans(as1[,c("contlife","somlife","agglife","sexlife",
"rellife","hordlife","doutlife","symtlife","washlife",
2005 Sep 21
3
Clustering and bootstrap
Dear Listers,
I emailed the list a few days ago about how to bootstrap a community
matrix (species by sites) and get a consensus tree with node support. A
friend pointed out that a similar question remained unanswered in 2004.
I wish to re-word my question: is anyone aware of a package / method to
obtain a majority-rule consensus tree from x distance matrices ? Is
anyone using R to generate
2010 Oct 10
1
Package "prabclus" not available?
Hi there,
I just tried to install the package prabclus on a computer running Ubuntu
Linux 9.04 using install.packages from within R.
This gave me a message:
Warning message:
In install.packages("prabclus") : package ?prabclus? is not available
I tried to do this selecting two different CRAN mirrors (same result) and
with other packages (installing them works fine).
Looking up the
2005 Oct 19
1
clustering algorithm detail
Hi all,
I wanted to run the hclust (or any other clustering algorithm) on a distance matrix. I have formed the distance matrix as:
distmat:
a b c d e
a 0.00 0.96 1.60 1.60 1.68
b 0.96 0.00 0.96 1.80 2.64
c 1.60 0.96 0.00 0.84 1.80
d 1.60 1.80 0.84 0.00 0.96
e 1.68 2.64 1.80 0.96 0.00
2011 May 16
1
pam() clustering for large data sets
Hello everyone,
I need to do k-medoids clustering for data which consists of 50,000
observations. I have computed distances between the observations
separately and tried to use those with pam().
I got the "cannot allocate vector of length" error and I realize this
job is too memory intensive. I am at a bit of a loss on what to do at
this point.
I can't use clara(), because I
2006 Apr 30
1
Number of Clusters
Dear R users, I am interested in clustering in R. In SAS we have some criteria for determining the number of clusters using the PROC CLUSTER procedure, which are "CCC" cubic clustering criterion (Sarl 1981), Psuedo F (PSF), and Psuedo T square (PST). My question is do thsese criterion exists in R, I tried to search and got one hit (BIC) in Mclust, which I am aware of, any input is
2010 Jan 14
1
Clustering with clara
Hello everyone
I am trying to use CLARA method for finding clusters in my spatial surface
temperature data and noticed one problem. My data are in the form
lat,lon,temperature. I extract lat,lon and cluster number for each point in
the dataset. When I plotted a map of cluster numbers I found empty areas in
the map. The point is that the number of points that were assigned a cluster
number are
2009 Apr 15
1
clustering, don't understand this error
Hello,
I am using the dunn metric, but something is wrong and I dont understand
what or what that this error mean. Please can you help me with this?
The instructions are:
#Indice de Dunn
disbupa=dist(bupa[,1:6])
a=hclust(disbupa)
cluster.stats(disbupa,a,bupa[,7])$dunn
And the error is:
Error in max(clustering) : invalid 'type' (list) of argument
thank you so much.
Ana Maria