Displaying 20 results from an estimated 5000 matches similar to: "plot and validation in clustering"
2005 May 30
2
How to access to sum of dissimilarities in CLARA
Dear All ,
Since dissimilarity is one of quality measures in clustering , I'm trying to access to the sum of dissimilarity as a whole measure. But after running my data using CLARA I obtain :
1128 dissimilarities, summarized :
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.033155 0.934630 2.257000 2.941600 4.876600 8.943700
But I can not find the sum of dissimilarity.How can i
2009 Jun 11
1
Cluster analysis, defining center seeds or number of clusters
I use kmeans to classify spectral events in high and low 1/3 octave bands:
#Do cluster analysis
CyclA<-data.frame(LlowA,LhghA)
CntrA<-matrix(c(0.9,0.8,0.8,0.75,0.65,0.65), nrow = 3, ncol=2, byrow=TRUE)
ClstA<-kmeans(CyclA,centers=CntrA,nstart=50,algorithm="MacQueen")
This works well when the actual data shows 1,2 or 3 groups that are not
"too close" in a cross plot.
2008 Jul 29
2
About clustering techniques
Hello R users
It's some time I am playing with a dataset to do some cluster analysis. The
data set consists of 14 columns being geographical coordinates and monthly
temperatures in annual files
latitutde - longitude - temperature 1 -..... - temperature 12
I have some missing values in some cases, maybe there are 8 monthly valid
values at some points with four non valid. I don't want to
2011 Aug 10
4
Clustering Large Applications..sort of
Hello all,
I am using the clustering functions in R in order to work with large
masses of binary time series data, however the clustering functions do not
seem able to fit this size of practical problem. Library 'hclust' is good
(though it may be sub par for this size of problem, thus doubly poor for
this application) in that I do not want to make assumptions about the number
of
2008 Mar 03
1
silhouette plot for kmeans result
Dear All,
Is there any existing code for plotting silhouette for kmeans clustering
results?
Many thanks!
Linda
[[alternative HTML version deleted]]
2006 Apr 05
1
"partitioning cluster function"
Hi All,
For the function "bclust"(e1071), the argument "base.method" is
explained as "must be the name of a partitioning cluster function
returning a list with the same components as the return value of
'kmeans'.
In my understanding, there are three partitioning cluster functions in
R, which are "clara, pam, fanny". Then I check each of them to
2006 Feb 05
3
Cluster Analysis - Number of Clusters
Hello,
I'm playing around with cluster analysis, and am looking for methods to
select the number of clusters. I am aware of methods based on a 'pseudo
F' or a 'pseudo T^2'. Are there packages in R that will generate these
statistics, and/or other statistics to aid in cluster number selection?
Thanks,
John.
--
2008 Dec 17
1
bug (?!) in "pam()" clustering from fpc package ?
Hello all.
I wish to run k-means with "manhattan" distance.
Since this is not supported by the function "kmeans", I turned to the "pam"
function in the "fpc" package.
Yet, when I tried to have the algorithm run with different starting points,
I found that pam ignores and keep on starting the algorithm from the same
starting-points (medoids).
For my
2001 Apr 27
0
weithed clustering (was: Re: problems with a large data set)
kmeans and clara work great. Thank you for the tip.
I have another question:
Is it possible to weight the observations in a cluster analysis ? I haven't
found any mention of this in the kmeans of clara help texts.
Moritz Lennert
Charg? de recherche
IGEAT - ULB
t?l: 32-2-650.65.16
fax: 32-2-650.50.92
email: mlennert at ulb.ac.be
> On Wed, 25 Apr 2001, Moritz Lennert wrote:
>
2008 Mar 06
2
Clustering large data matrix
Hello,
I have a large data matrix (68x13112), each row corresponding to one
observation (patients) and each column corresponding to the variables
(points within an NMR spectrum). I would like to carry out some kind of
clustering on these data to see how many clusters are there. I have
tried the function clara() from the package cluster. If I use the matrix
as is, I can perform the clara
2010 Aug 18
1
Plotting K-means clustering results on an MDS
Hello All,
I'm having some trouble figuring out what the clearest way to plot my
k-means clustering result on an my existing MDS.
First I performed MDS on my distance matrix (note: I performed k-means on
the MDS coordinates because applying a euclidean distance measure to my raw
data would have been inappropriate)
canto.MDS<-cmdscale(canto)
I then figured out what would be my optimum
2009 Dec 11
1
cluster size
hi r-help,
i am doing kmeans clustering in stats. i tried for five clusters clustering using:
kcl1 <- kmeans(as1[,c("contlife","somlife","agglife","sexlife",
"rellife","hordlife","doutlife","symtlife","washlife",
2006 Jan 07
1
Clustering and Rand Index
Dear WizaRds,
I am trying to compute the (adjusted) Rand Index in order to comprehend
the variable selection heuristic (VS-KM) according to Brusco/ Cradit
2001 (Psychometrika 66 No.2 p.249-270, 2001).
Unfortunately, I am unable to correctly use
cl_ensemble and cl_agreement (package: clue). Here is what I am trying
to do:
library(clue)
## Let p1..p4 be four partitions of the kind
2012 Aug 28
1
K-Means clustering Algorithm
I was wondering if there was an R equivalent to the two phased approach that
MATLAB uses in performing the Kmeans algorithm. If not is there away that I
can determine if the kmeans in R and the kmeans in MATLAB are essentially
giving me the same clustering information within a small amount of error?
--
View this message in context:
2003 Apr 14
2
kmeans clustering
Hi,
I am using kmeans to cluster a dataset.
I test this example:
> data<-matrix(scan("data100.txt"),100,37,byrow=T)
(my dataset is 100 rows and 37 columns--clustering rows)
> c1<-kmeans(data,3,20)
> c1
$cluster
[1] 1 1 1 1 1 1 1 3 3 3 1 3 1 3 3 1 1 1 1 3 1 3 3 1 1 1 3 3 1 1 3 1 1 1 1 3
3
[38] 3 1 1 1 3 1 1 1 1 3 3 3 1 1 1 1 1 1 3 1 3 1 1 3 1 1 1 1 3 1 1 1 1 1 1 3
2008 May 09
2
K-Means Clustering
Hello,
I am hoping you can help me with a question concerning kmeans clustering
in R. I am working with the following data-set (abbreviated):
BMW Ford Infiniti Jeep Lexus Chrysler Mercedes Saab Porsche
Volvo
[1,] 6 8 2 8 4 5 4 4 7 7
[2,] 8 7 4 6 4 1 6 7 8 5
[3,] 8 2 4
2011 Mar 31
1
Cluster analysis, factor variables, large data set
Dear R helpers,
I have a large data set with 36 variables and about 50.000 cases. The
variabels represent labour market status during 36 months, there are 8
different variable values (e.g. Full-time Employment, Student,...)
Only cases with at least one change in labour market status is
included in the data set.
To analyse sub sets of the data, I have used daisy in the
cluster-package to create
2007 Jul 18
1
Can any one help me on format file data.
Hi all.
I'd like know what is the format file saved by Leica
Microsystems TCS SP2-AOBS equipped with a SP2-FCS2 Leica Microsystems
workstation its datas. Cause it save in *.fcs extention file but
ins't flow cytometry standart format file...
Tahnks Horacio.
2007 Jul 18
2
EM unsupervised clustering
Hi All,
I have a n x m matrix. The n rows are individuals, the m columns are variables.
The matrix is in itself a collection of 1s (if a variable is observed for an
individual), and 0s (is there is no observation).
Something like:
[,1] [,2] [,3] [,4] [,5] [,6]
[1,] 1 0 1 1 0 0
[2,] 1 0 1 1 0 0
[3,] 1 0 1 1 0 0
[4,] 0 1 0
2003 Jan 30
2
Validation of clustering
Hi,
I'm using the library cluster to cluster a set of figures (method CLARA).
Somebody that it work with clustering would know informs what I make to
evaluate the clustering?
Tks VM,
Francisco.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Francisco Júnior,
Computer Science - UFPE-Brazil
"One life has more value that the
world whole"
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^