Displaying 20 results from an estimated 4000 matches similar to: "Clustering quality measure"
2003 Jun 12
3
Multiple imputation
Hi all,
I'm currently working with a dataset that has quite a few missing
values and after some investigation I figured that multiple imputation
is probably the best solution to handle the missing data in my case. I
found several references to functions in S-Plus that perform multiple
imputation (NORM, CAT, MIX, PAN). Does R have corresponding functions?
I searched the archives but was not
2003 Jul 24
3
Plotting math functions
Hi all,
I was wondering whether it is possible to plot math functions, for
example sin, cos or a Gaussian type function, in R, and if so, how to
do it. I have been searching through the archives and the R manual but
had no luck in finding any hints on how to go about this.
Any help is much appreciated!
Thanks, Jonck
2003 Jun 11
3
Multiple match function?
Hi all,
I have (yet another) question about a function in R. What I would like
to do is test for the presence of a certain value in a vector, and have
the positions that this value is at returned to me.
For example, let's say I have a vector:
x <- c(1,1,2,2,3,3,4,4)
Now I would like a function that would return positions 3 and 4 should
I test for the value "2". Or 5 and 6
2003 Apr 16
1
Question on SOM and clustering
Hello everyone,
I'm new to this list, so let me introduce myself: my name is Jonck van
der Kogel and I am a graduate student at the Erasmus University of
Rotterdam. I am currently working on my thesis which is on the use of
artificial intelligence for large data-sets.
To do an analysis of a certain data-set I want to use Kohonen's SOM
algorithm. However, as I understand it, the SOM
2006 Feb 05
3
Cluster Analysis - Number of Clusters
Hello,
I'm playing around with cluster analysis, and am looking for methods to
select the number of clusters. I am aware of methods based on a 'pseudo
F' or a 'pseudo T^2'. Are there packages in R that will generate these
statistics, and/or other statistics to aid in cluster number selection?
Thanks,
John.
--
2003 Jan 30
2
Validation of clustering
Hi,
I'm using the library cluster to cluster a set of figures (method CLARA).
Somebody that it work with clustering would know informs what I make to
evaluate the clustering?
Tks VM,
Francisco.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Francisco JĂșnior,
Computer Science - UFPE-Brazil
"One life has more value that the
world whole"
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2003 Jun 09
2
Appending elements to an array
Hi all,
I am having a bit of trouble with the array structure of R. What I want
to do is dynamically add/remove elements to an array. For example:
Let's say I have created an array:
> myArray <- array(c(3,8), dim=c(1,2))
> myArray
[,1] [,2]
[1,] 3 8
And I now want to, for example, push an element (5,6) on to this array
so it will read:
[,1] [,2]
[1,] 3 8
2003 Dec 11
1
cutree with agnes
Hi,
this is rather a (presumed) bug report than a question because I can solve
my personal statistical problem by working with hclust instead of agnes.
I have done a complete linkage clustering on a dist object dm with 30
objects with agnes (R 1.8.0 on
RedHat) and I want to obtain the partition that results from a cut at
height=0.4.
I run
> cl1a <- agnes(dm, method="complete")
2003 Dec 11
1
cutree with agnes
Hi,
this is rather a (presumed) bug report than a question because I can solve
my personal statistical problem by working with hclust instead of agnes.
I have done a complete linkage clustering on a dist object dm with 30
objects with agnes (R 1.8.0 on
RedHat) and I want to obtain the partition that results from a cut at
height=0.4.
I run
> cl1a <- agnes(dm, method="complete")
2011 Aug 10
4
Clustering Large Applications..sort of
Hello all,
I am using the clustering functions in R in order to work with large
masses of binary time series data, however the clustering functions do not
seem able to fit this size of practical problem. Library 'hclust' is good
(though it may be sub par for this size of problem, thus doubly poor for
this application) in that I do not want to make assumptions about the number
of
2008 Jun 13
3
cluster.stats
Dear list,
I just tried to use the function cluster.stat in the package fpc.
I just have a couple of questions about the syntax:
cluster.stats(d,clustering,alt.clustering=NULL,
silhouette=TRUE,G2=FALSE,G3=FALSE)
1) the distance object (d) is an object obtained by the function dist() on
my own original matrix?
2) clustering is the clusters vector as result of one of the many clustering
methods?
2011 Aug 25
1
question on silhouette colours
I'm fairly new to the silhouette functionality in the cluster package, so apologize if I'm asking something naive.
If I run the 'agnes(ruspini)' example from the silhouette section of the cluster package vignette, and assign colours to clusters, two clusters have what appear to be incorrect colours in the silhouette plot.
library(cluster)
data(ruspini)
ar<- agnes(ruspini)
2011 Jun 09
1
k-nn hierarchical clustering
Hi there,
is there any R-function for k-nearest neighbour agglomerative hierarchical
clustering?
By this I mean standard agglomerative hierarchical clustering as in hclust
or agnes, but with the k-nearest neighbour distance between clusters used
on the higher levels where there are at least k>1 distances between two
clusters (single linkage is 1-nearest neighbour clustering)?
Best regards,
2003 Apr 24
1
estimating number of clusters ("Null or more")
Hi all,
once more about the old subj :-)
My data has too much various distribution families and for every
particular experiment
I need just to decide whether the data is "quite homogeneous" or it has
two or more
clusters. I've revisited the following libraries:
amap, clust, cclust, mclust, multiv, normix, survey.
And I didn't find any ready-to-use general
2006 Oct 17
4
cluster in R
hi,
is there some good summary on clustering methods in R? It seems there
are many packages involving it.
And I have two questions on clustering here:
1. Is there a way of evaluate the effecitives (or seperation) of
clustering (rather than by visualization)?
2. Is there a search method (like genetic search) which can help find
the best subset of attributes which gives best seperation?
Thanks,
2002 Aug 07
2
cluster-analysis and NA's
Hi,
exist a special cluster-analysis algorithms
which can work with NA's.
a further "problem" is that i want cluster
variables not cases to identify special variable-set's.
Is it a common way turn the data.frame and use
kmeans,because this works with NA's, or have anybody another
method for finding "variable-sets" , with exception of factor analysis.
thanks for
2005 Mar 04
2
Clustering of Binary data in R
Good afternoon!
I would like to ask you about similarity measures and clustering in R for Binary data.
Would you please kindly help me and let me know about that commands in R?
Thanks in advance for your kind attentions.
I look forward to hearing from you as soon as possible.
Best regards,
Sima
2003 May 07
1
-means, hybrid clustering or similar implementations on R
Hi,
I would like to know if someone knows an extended implementation of k-means in R to find appropriate number of clusters for a given k-dimensional data.
Also, I am working on clustering for forecasting, if someone is interested or has knowledge on implementational details please mail me, I would appreciate it.
Regards
Skanda Kallur
"Cogito, ergo sum" (I think, therefore I
2004 May 11
1
stability measures for heirarchical clustering
Dear R users,
I'm interested in measuring the stability of a heirarchical clustering, of
the overall clustering and finding sub clusters (from cutting the
heirarchical clustering at different levels) which demonstrate stability.
I saw some postings on the R help from a while back about bootstrapping for
clustering (using sample and generating a consesus tree with a web based
tool CONSENSE)
2003 Jun 10
1
SOM random seed
Hi all,
I have a question about the SOM routine. You can either supply the
initial representatives for the lattice yourself or else they are
chosen randomly from the dataset. Is it possible to pass the
random-seed as an argument somehow, when choosing the random
initialisation of the lattice?
As it is now, each time I run a SOM on a dataset with the same settings
the resulting SOM will still