Displaying 20 results from an estimated 4000 matches similar to: "distance in the function kmeans"
2006 Jul 09
2
distance in kmeans algorithm?
Hello.
Is it possible to choose the distance in the kmeans algorithm?
I have m vectors of n components and I want to cluster them using kmeans
algorithm but I want to use the Mahalanobis distance or another distance.
How can I do it in R?
If I use kmeans, I have no option to choose the distance.
Thanks in advance,
Arnau.
2010 May 05
2
custom metric for dist for use with hclust/kmeans
Hi guys,
I've been using the kmeans and hclust functions for some time now and
was wondering if I could specify a custom metric when passing my data
frame into hclust as a distance matrix. Actually, kmeans doesn't even
take a distance matrix; it takes the data frame directly. I was
wondering if there's a way or if there's a package that lets you
create distance matrices from
2006 Aug 07
5
kmeans and incom,plete distance matrix concern
Hi there
I have been using R to perform kmeans on a dataset. The data is fed in using read.table and then a matrix (x) is created
i.e:
[
mat <- matrix(0, nlevels(DF$V1), nlevels(DF$V2),
dimnames = list(levels(DF$V1), levels(DF$V2)))
mat[cbind(DF$V1, DF$V2)] <- DF$V3
This matrix is then taken and a distance matrix (y) created using dist() before performing the kmeans clustering.
My query
2008 Mar 03
1
silhouette plot for kmeans result
Dear All,
Is there any existing code for plotting silhouette for kmeans clustering
results?
Many thanks!
Linda
[[alternative HTML version deleted]]
2006 Apr 07
2
cclust causes R to crash when using manhattan kmeans
Dear R users,
When I run the following code, R crashes:
require(cclust)
x <- matrix(c(0,0,0,1.5,1,-1), ncol=2, byrow=TRUE)
cclust(x, centers=x[2:3,], dist="manhattan", method="kmeans")
While this works:
cclust(x, centers=x[2:3,], dist="euclidean", method="kmeans")
I'm posting this here because I am not sure if it is a bug.
I've been searching
2011 Oct 03
4
distance coefficient for amatrix with ngative valus
Hi,
I need to run a PCoA (PCO) for a data set wich has both positive and negative values for variables. I could not find any distancecoefficient other than euclidean distace running for the data set. Are there any other coefficient works with negtive values.Also I cannot get summary out put (the eigen values) for PCO as for PCA.
Thanks.
Dilshan
[[alternative HTML version deleted]]
2007 Jun 21
1
MDS size limitations
What are the limitations on size of matrix for MDS functions?
steve
--
Steve Antos
Priva-Technologies
847-640-9020 x3114
cell (540)409-1231
[[alternative HTML version deleted]]
2013 May 02
2
Calculating distance matrix for large dataset
Dear R users
I wondered if any of you ever tried to calculate distance matrix with very
large data set, and if anyone out there can confirm this error message I
got actually mean that my data is too large for this task.
negative length vectors are not allowed
My data size and code used
dim(mydata_nor)[1] 365000 144> d <- dist(mydata_nor, method = "euclidean")
Here my
2003 Sep 12
2
partial mantel
Dear all,
Has anyone written R code for partial Mantel Tests- as described for
instance in Legtendre & Legendre (1998) ?
In other words, in a community ecology analysis, I would like to calculate
the correlation between two dissimilarity matrices, controlling for a third
distance matrix representing geographical distances between sites.
Thanks!
Christophe Bouget
Biodiversité et gestion des
2007 Apr 22
2
distance method in kmeans
I am trying to cluster some binary data using k-means . As the regular "kmeans" available from stats package in R does'nt provide the option to change the distance method. I was wondering there is any package available to specify type of distance measure to be used in k means clustering in R. Especially distances like "Jaccard" which is good for binary data.
2010 Aug 18
1
Plotting K-means clustering results on an MDS
Hello All,
I'm having some trouble figuring out what the clearest way to plot my
k-means clustering result on an my existing MDS.
First I performed MDS on my distance matrix (note: I performed k-means on
the MDS coordinates because applying a euclidean distance measure to my raw
data would have been inappropriate)
canto.MDS<-cmdscale(canto)
I then figured out what would be my optimum
2007 Jul 23
2
cmdscale question
Hi.
I know matrices that use distances between places works fine when using
cmdscale. However, what about matricies such as:
A B C D E
A 0 1 23 12 9
B 1 0 10 12 3
C 23 10 0 23 4
D 12 12 23 0 21
E 9 3 4 21 0
i.e. matrices which do not represent physical distances between places (as
they would not make sense for real distances such as the one above)
2004 Feb 26
2
Multidimensional scaling and distance matrices
Dear All,
I am in the somewhat unfortunate position of having to reproduce the
results previously obtained from (non-metric?) MDS on a "kinship" matrix
using Statistica. A kinship matrix measures affinity between groups, and
has its maximum values on the diagonal.
Apparently, starting with a nxn kinship matrix, all it was needed to do
was to feed it to Statistica flagging that the
2011 May 18
3
Help with 2-D plot of k-mean clustering analysis
Hi, all
I would like to use R to perform k-means clustering on my data which
included 33 samples measured with ~1000 variables. I have already used
kmeans package for this analysis, and showed that there are 4 clusters in my
data. However, it's really difficult to plot this cluster in 2-D format
since the "huge" number of variables. One possible way is to project the
2008 May 12
2
k means
Hi the devel list,
I am using K means with a non standard distance. As far as I see, the
function kmeans is able to deal with 4 differents algorithm, but not
with a user define distance.
In addition, kmeans is not able to deal with missing value whereas
there is several solution that k-means can use to deal with them ; one
is using a distance that takes the missing value in account, like a
2005 Mar 31
2
Using kmeans given cluster centroids and data with NAs
Hello,
I have used the functions agnes and cutree to cluster my data (4977
objects x 22 variables) into 8 clusters. I would like to refine the
solution using a k-means or similar algorithm, setting the initial
cluster centres as the group means from agnes. However my data matrix
has NA's in it and the function kmeans does not appear to accept this?
> dim(centres)
[1] 8 22
> dim(data)
2004 Sep 08
8
isoMDS
Dear List:
I have a question regarding an MDS procedure that I am accustomed to
using. I have searched around the archives a bit and the help doc and
still need a little assistance. The package isoMDS is what I need to
perform the non-metric scaling, but I am working with similarity
matrices, not dissimilarities. The question may end up being resolved
simply.
Here is a bit of substantive
2007 Mar 20
1
about hcluster
Hi all:
As to hcluster,how can I control the cluster is performed according to rows(genes for instance) or columns(samples for instance)?
I can't find the parameters for it.
Thanks a lot!
My best!
2011 Jul 08
1
Visualizing a dissimilarity matrix in Euclidean space
Hi,
I have a set of nodes and a dissimilarity matrix for them, as well as a csv
file in which the diss matrix has been converted to [node_1, node_2,
dissimilarity] format. I would like to visualize this as a graph in
Euclidean space (that is, similar nodes clumped together in clusters),
rather than the seriation visualization given by dissplot(). I am using
Network WorkBench for my
2008 Apr 27
1
An ANOVA test that uses a distance matrix like hierarchical cluster analysis?
Hi All,
I have a question which does not pertain directly to the use of R but comes
from my use of R!
I have data which can be described as 3-dimensional e.g. (x,y,z), with no
negative component. The suggested way to analyze this data is via
multivariate techniques or by calculating what amounts to a levene's test on
the data and then an ANOVA on the three components if the first test is