similar to: Clustering with clara

Displaying 20 results from an estimated 400 matches similar to: "Clustering with clara"

2008 Aug 01
2
Exporting data to a text file
HI R users With clara function I get a data frame (maybe this is not the exact word, I'm new to R) with the following variables: > names(myclara) [1] "sample" "medoids" "i.med" "clustering" "objective" [6] "clusinfo" "diss" "call" "silinfo" "data" I want to
2008 Jul 29
2
About clustering techniques
Hello R users It's some time I am playing with a dataset to do some cluster analysis. The data set consists of 14 columns being geographical coordinates and monthly temperatures in annual files latitutde - longitude - temperature 1 -..... - temperature 12 I have some missing values in some cases, maybe there are 8 monthly valid values at some points with four non valid. I don't want to
2008 Sep 30
1
CLARA and determining the right number of clusters
Hi everyone I have a question about clustering. I've managed using CLARA to get a clustering analysis of a large data set. But now I want to find which is the right number of clusters. The clara.object gives some information like the ratio between maximal and minimal dissimilarity that says (maybe if lower than 1??) if a cluster is well-separated from the other. I've also read something
2010 Jun 14
1
Install Rmpi
Hi everyone As I couldn't succeed with manual installation of Rmpi I decided to start again from the beginning. I removed R and MPICH in my Ubuntu Hardy installation. Then, to avoid any dependencies problems I have installed MPICH and R from synaptic, not from sources. But now I can't install Rmpi. An error message appears when trying to install Rmpi, you can find in
2010 Jun 08
1
Problem installing Rmpi
Hi everyone I want to install Rmpi to use R in parallel mode in a Linux cluster (Ubuntu, Hardy Heron). It seems to be properly installed but a problem appears when loading Rmpi library. R version 2.11.1 (2010-05-31) > library("Rmpi") Error: package 'Rmpi' was built before R 2.10.0: please re-install it Should I remove R-2.11 and install R-2.10? I have tried to
2007 Apr 25
1
Help with saptial analysis (cluster)
Hi R-users I'm a beginner with R and statistics, so I need some help to start my data analysis. I've been reading some docs and tutorials on R and cluster analysis. I've got a large dataset (102000 points) with values of longitude, latitude and temperature and want to see if I can find groups (clusters). Following some tutorials I can look for principal components but get an error
2008 Jun 23
2
Handle missing values
Hi everyone I am new to R and have a question about missing values. I am trying to do a cluster analysis of monthly temperatures and my data are 14 columns with spatial coordinates (lat,lon) and 12 monthly values: /lat - lon - temp1 - //temp2 - temp3 - .... - //temp12/ If I omit missing values (my missing values are 99.00) with /mydata <- na.omit(mydata)/ every row with a
2003 Dec 11
0
AW: Paper on PAM and Clara
Maybe the following article is of interest for you (co-authored by P.J. Rousseeuw, like the book you mentioned): http://www.jstatsoft.org/v01/i04/paper/clus.pdf -Heinrich. > -----Urspr?ngliche Nachricht----- > Von: r-help-bounces at stat.math.ethz.ch > [mailto:r-help-bounces at stat.math.ethz.ch] Im Auftrag von Fucang Jia > Gesendet: Donnerstag, 11. Dezember 2003 14:37 > An:
2005 Jun 02
0
Selecting input and output variables in clara (cluster-package)
Moin, is there any possibility to choose the input variables which are involved in the analysis, " ..., and each column correspondends to a variable. ..." (R-Reference-Manual) or do I have to delete them from the data.frame? An then I'd like to get the original dataset with the classification of the cluster, at the moment i do it this way clusterFrame <-
2004 Jan 14
1
Using pam, agnes or clara as prediction models?
Hello list, I am new to R, so if the question is rather silly, please ignore it. I was wondering wether it would be possible to use the models generated by pam, clara and the like as predictors? Scanning through the available documentation shed no light (for me) upon the subject. Regards, Renald
2010 Nov 15
0
[LLVMdev] LLVM Software engineer at Intel, CA(Santa Clara or Folsom)
Resumes can be sent to Sujatha.gurumurthy at intel.com<mailto:Sujatha.gurumurthy at intel.com> LLVM Software engineer at Intel,CA(Santa Clara or Folsom) In this position, you will be responsible for designing and developing highly competitive OpenCL (Open Compute Language, a new industry standard for heterogeneous data and task parallel computing across GPU's and CPU's). You will
2006 Apr 10
2
passing known medoids to clara() in the cluster package
Greetings, I have had good success using the clara() function to perform a simple cluster analysis on a large dataset (1 million+ records with 9 variables). Since the clara function is a wrapper to pam(), which will accept known medoid data - I am wondering if this too is possible with clara() ... The documentation does not suggest that this is possible. Essentially I am trying to
2012 Feb 01
1
Reminder: Xen Hackathon hosted by Oracle, March 6-8, Santa Clara, CA, USA
Hi everybody, just a quick reminder that the Xen Hackathon is in Oracle, March 6-8, Santa Clara, CA, USA. We have originally planned for about 25 people, but already we have 14 people signed up. If you are planning to attend please sign up quickly, such that I get a feeling for the number of attendees. I can then go back to Oracle and see whether we can accommodate more people than we
2006 May 23
1
standardization of values before call to pam() or clara()
Greetings, Experimenting with the cluster package, and am starting to scratch my head in regards to the *best* way to standardize my data. Both functions can pre-standardize columns in a dataframe. according to the manual: Measurements are standardized for each variable (column), by subtracting the variable's mean value and dividing by the variable's mean absolute deviation. This
2007 Sep 12
3
Xen Summit #5 : 14/16 Nov 2007 Santa Clara
Folks, I''m happy to report that dates have been finalized for the next Xen Summit: November 14-16 2007; Santa Clara CA, USA Sun Microsystems have kindly agreed to host the event at their Santa Clara campus. The plan is to start at 1pm on Wednesday the 14th, then run until lunchtime on Friday. The next steps are to put together a program committee and send out a formal Call For
2007 Sep 12
3
Xen Summit #5 : 14/16 Nov 2007 Santa Clara
Folks, I''m happy to report that dates have been finalized for the next Xen Summit: November 14-16 2007; Santa Clara CA, USA Sun Microsystems have kindly agreed to host the event at their Santa Clara campus. The plan is to start at 1pm on Wednesday the 14th, then run until lunchtime on Friday. The next steps are to put together a program committee and send out a formal Call For
2005 May 30
2
How to access to sum of dissimilarities in CLARA
Dear All , Since dissimilarity is one of quality measures in clustering , I'm trying to access to the sum of dissimilarity as a whole measure. But after running my data using CLARA I obtain : 1128 dissimilarities, summarized : Min. 1st Qu. Median Mean 3rd Qu. Max. 0.033155 0.934630 2.257000 2.941600 4.876600 8.943700 But I can not find the sum of dissimilarity.How can i
2003 Nov 17
1
CLARA
I need informations about the clara routine. The on-line doc say that the argument stand is a logical, indicating if the measurements in x are standardized before calculating the dissimilarities. Measurements are standardized for each variable (column), by subtracting the variable's mean value and dividing by the variable's mean absolute deviation. If we note STAND = TRUE, I suppose that
2005 Aug 03
3
clara - memory limit
Dear all, I'm trying to estimate clusters from a very large dataset using clara but the program stops with a memory error. The (very simple) code and the error: mydata<-read.dbf(file="fnorsel_4px.dbf") my.clara.7k<-clara(mydata,k=7) >Error: cannot allocate vector of size 465108 Kb The dataset contains >3,000,000 rows and 15 columns. I'm using a windows computer
2005 Nov 14
1
Tidiest way of modifying S4 classes?
I wish to make modifications to the plot.pedigree function in the kinship package. My attempts to contact the maintainer have been unsuccessful, but my question is general, so specifics of the kinship package might not be an issue. My first attempt was to make a new function Plot.pedigree in the .GlobalEnv which mostly achieved what I wanted to. However, I'm sure that's not the tidiest