Displaying 20 results from an estimated 800 matches similar to: "classification algorithms with distance matrix"
2006 Apr 10
2
passing known medoids to clara() in the cluster package
Greetings,
I have had good success using the clara() function to perform a simple cluster
analysis on a large dataset (1 million+ records with 9 variables).
Since the clara function is a wrapper to pam(), which will accept known medoid
data - I am wondering if this too is possible with clara() ... The
documentation does not suggest that this is possible.
Essentially I am trying to
2008 Feb 22
2
Looping and Pasting
Hello R-community: Much of the time I want to use loops to look at graphs,
etc. For example,
I have 25 plots, for which the names are m.1$medoids, m.2$medoids, ...,
m.25$medoids.
I want to index the object number (1:25) as below (just to show concept).
for (i in 1:25){
plot(m.i$medoids)
}
I've tried the following, with negative results
for ...
2009 Mar 29
1
[cluster package question] What is the "sum of the dissimilarities" in the pam command ?
Hello Martin Maechler and All,
A simple question (I hope):
How can I compute the "sum of the dissimilarities" that appears in the pam
command (from the cluster package) ?
Is it the "manhattan" distance (such as the one implemented by "dist") ?
I am asking since I am running clustering on a dataset. I found 7 medoids
with the pam command, and from it I have the
2024 Sep 17
1
Getting individual co-ordinate points in k medoids cluster
Hello I am using k medoids in R to generate sets of clusters for datasets
through time. I can plot the individual clusters OK but what I cannot find
is a way of pulling out the co-ordinates of the individual points in the
cluster diagrams - none of the kmed$... info sets seems to be this.
Beneath is an example of a k medoid prog using the built in US arrests
dataset - this is not the data I am
2008 Jul 02
1
Extracting regression coef. and p-values in JRClient
Hi there,
I am using JRClient to build logistic regression model in the
following manner :
Rconnection c = new Rconnection();
c.eval("KSN<-read.table(\"/Users/amine/Documents/Research/
Tools/R/D2R1.txt\",header=T,sep=\",\")");
c.eval("result <- glm(Nausea ~ Kaletra*Sustiva, family =
binomial(link = logit), data =KSN)");
2015 Apr 29
2
cantidad de datos
Hola.
Yo en vez de utilizar análisis cluster que impliquen distancias,
probaría con un kmedias o con un pam (partition around medoids) pero
utilizando muestras, la función clara de la librería cluster puede
ayudarte. Pego el details de la ayuda de 'clara'
Details
clara is fully described in chapter 3 of Kaufman and Rousseeuw (1990).
Compared to other partitioning methods such as pam,
2011 May 16
1
pam() clustering for large data sets
Hello everyone,
I need to do k-medoids clustering for data which consists of 50,000
observations. I have computed distances between the observations
separately and tried to use those with pam().
I got the "cannot allocate vector of length" error and I realize this
job is too memory intensive. I am at a bit of a loss on what to do at
this point.
I can't use clara(), because I
2010 Apr 24
4
DICE Coefficient of similarity measure
Hi,
I wanted the DICE coefficient (similarity measure for binary variables)
to be calculated in R and found that the "igraph" package has the option
of "similarity.dice" to do this. But, for this command, the input object
should be an igraph object. But, I have a dataframe of columns
containing 1's and 0's. Can I convert this dataframe into an igraph
object, so that
2015 Apr 29
2
cantidad de datos
El inconveniente con un K-medias, es que se tiene que se tiene que pre definir el número de segmentos, pero eso es algo con lo q no cuento. La solución de Javier me parece q sería la única opción.
Atte.
Ricardo Alva Valiente
-----Mensaje original-----
De: R-help-es [mailto:r-help-es-bounces en r-project.org] En nombre de javier.ruben.marcuzzi en gmail.com
Enviado el: miércoles, 29 de abril de
2008 Dec 17
1
bug (?!) in "pam()" clustering from fpc package ?
Hello all.
I wish to run k-means with "manhattan" distance.
Since this is not supported by the function "kmeans", I turned to the "pam"
function in the "fpc" package.
Yet, when I tried to have the algorithm run with different starting points,
I found that pam ignores and keep on starting the algorithm from the same
starting-points (medoids).
For my
2015 Apr 29
2
cantidad de datos
Buen aporte?excelente!!
Atte.
Ricardo Alva Valiente
De: Jose Luis Cañadas Reche [mailto:canadasreche en gmail.com]
Enviado el: miércoles, 29 de abril de 2015 12:51 PM
Para: Alva Valiente, Ricardo (RIAV); 'javier.ruben.marcuzzi en gmail.com'; R-help-es en r-project.org
Asunto: Re: [R-es] cantidad de datos
Podrías hacer varios kmedias con diferente número de clusters y comprobar como
2004 Jun 29
1
give PAM my own medoids
Hello,
When using PAM (partitioning around medoids), I would like to skip the
build-step and give the fonction my own medoids.
Do you know if it is possible, and how ?
Thank you very much.
Isabel
2005 May 19
2
Bug#305932: rsync on a directory transfers the files of this directory
Hi,
I got the following report from a Debian user, about --files-from
transferring the contents of a dir (i.e. including the files in it)
specified in the input, even thugh the files aren't listed in the input.
This happens only when the dir name ends with a slash. I asked him to
cook up a script to reproduce this (as it wasn't quite clear to me at
first what happened exactly).
Any
2005 Jun 07
1
Specifying medoids in PAM?
I am using the PAM algorithm in the CLUSTER library.
When I allow PAM to seed the medoids using the default __build__
algorithm things work
well:
> pam(stats.table, metric="euclidean", stand=TRUE, k=5)
But I have some clusters from a Hierarchical analysis that I would
like to use as seeds for the PAM algorithm. I can't figure what the
mediod argument wants. When I put in the
2004 May 24
3
CVS pserver available again
I''ve built and installed 1.11.15-5 from the Fedora updates.
-Tom
--
Tom Eastep \ Nothing is foolproof to a sufficiently talented fool
Shoreline, \ http://shorewall.net
Washington USA \ teastep@shorewall.net
2009 Apr 24
2
Sound on PV Guest
Hello
I am running Xen 3.3.1 on Debian x86 with 2.6.26 Dom0. I have a PV
guest running another Debian with 2.6.18 kernel. all kernels are
precompiled Debian ones.
Everything work quite well. I have a modified hypervisor running to
intercept syscalls issued by the PV guest and analyze the behavior of a
given PID.
But this is not an issue, as it works fine. My concern is about the
applications
2006 Apr 07
1
fuzzy classification and dissimilarity matrix
Hello,
I want to make a fuzzy classification from a dissimilarity matrix
(calculated with daisy from package 'cluster'). I have tried to use
fanny (package cluster) but I have the same problems than described in a
previous message
(http://tolstoy.newcastle.edu.au/R/help/05/05/4546.html) i.e. it always
gives me two clusters in the results (even if k is different from 2)
with the same
2006 Apr 05
1
"partitioning cluster function"
Hi All,
For the function "bclust"(e1071), the argument "base.method" is
explained as "must be the name of a partitioning cluster function
returning a list with the same components as the return value of
'kmeans'.
In my understanding, there are three partitioning cluster functions in
R, which are "clara, pam, fanny". Then I check each of them to
2015 Apr 29
2
cantidad de datos
Estimados
Creo que se puede presentar un problema con el sistema operativo, al ser de 32 bit si no recuerdo mal soporta hasta 4 GB, aunque no estoy del todo seguro.
Los 292 GB que informa Carlos son una enormidad, esos requerimientos son complicados.
¿Qué posibilidad hay de trabajar con memoria virtual en windows? Aunque me parece que no sería optimo, prefiero intentar en Linux y R.
Su
2008 Aug 01
2
Exporting data to a text file
HI R users
With clara function I get a data frame (maybe this is not the exact word,
I'm new to R) with the following variables:
> names(myclara)
[1] "sample" "medoids" "i.med" "clustering" "objective"
[6] "clusinfo" "diss" "call" "silinfo" "data"
I want to