similar to: clv/cluster - cluster id vector format

Displaying 20 results from an estimated 40000 matches similar to: "clv/cluster - cluster id vector format"

2009 Jun 29
0
Naive knn question
Dear list, I have two dissimilarity matrices, one for a training data set which I then clustered using PAM. The second is a diss matrix for a validation data set (an independent field sample). I have been trying to use knn to distinguish distances between the validation data set and the 6 mediods of the training data defined by using PAM. I continue to get error messages in regards to either the
2002 Oct 23
0
Obtaining covariance matrices for kmeans output clusters
I am having trouble getting a covariance matrix for each cluster which is output by kmeans(). My input looks like: > imagedat <- read.table("table", header=TRUE) > imagedat Red Green Blue 0_0 5 7 8 1_0 5 5 18 2_0 7 8 49 3_0 22 8 76 4_0 54 10 67 5_0 50 9 28 6_0 18 10 15 7_0 9 7 6
2002 Aug 23
2
Surprising result from integrate
Hi all, sorry for this extensive question, but I think that I'm missing something fundamental. I stepped into a surprising result with the integrate function and I would be glad if someone could put some light onto this. I try to integrate over an s-shaped growth-function. The results from this calculation seem to be correct for small values of t. Just for fun I tried some large values and
2012 Jun 06
1
Data scientist // Berlin-based startup using probabilistic models in ecommerce
*Fluidshopping is a Berlin-based startup working on a customer analytics tool for online retailers. Customer Lifefitime Value (CLV) is the mythical 'magic number', the amount of money a particular customer will ever bring in. Knowing your CLV makes it trivial to: - optimize marketing spend for different inbound channels. - identify your highest value customers, - identify those in danger
2009 Mar 03
0
clust with "hclustWard"
Hello everyone I tried to use clust function using "hclustWard" clustering method The distance mesaured I have used is "rf" The data used is in the form of data frame (it is not a distance matrix) - sd.df The piece of code I have used is: c1 <- clust(sd.df,k=6,method="hclustWard",distMethod="rf") But I get the following error message, Error in if (n
2007 Nov 28
2
Clustering
Hello all! I am performingsome clustering analysis on microarray data using agnes{cluster} and I have created my own dissimilarity matrix according to a distance measure different from "euclidean" or "manhattan" etc. My question is, if I choose for example method="complete", how are the distances between the elements calculated? Are they taken form the dissimilarity
2010 Apr 11
0
cluster analysis :: urgent
hi, how can i do cluster analysis on spatial data? (longitude & latitude) Because i used the function clust of the clustTool package and it did'nt work at all: cl <- clust(dv,3,method="hclustAverage",distMethod="euclidean") thanks a lot Karine HEERAH Master 2 , océanographie et environnements marins Université Pierre et Marie Curie (Paris 6)
2009 Jul 24
1
Making rq and bootcov play nice
I have a quick question, and I apologize in advance if, in asking, I expose my woeful ignorance of R and its packages. I am trying to use the bootcov function to estimate the standard errors for some regression quantiles using a cluster bootstrap. However, it seems that bootcov passes arguments that rq.fit doesn't like, preventing the command from executing. Here is an example:
2009 Nov 15
1
Problem building package for R 2.10.0 on Mac OS X
Hi I have submitted a package (rioja) to CRAN. It checks OK for all R versions and OS's except r-release-macosx-ix86 where it fails when checking the examples. Specifically, it fails because R can't find the package vegan which is needed in a function. Here is the snippet from the check results: ### Begin snippet checking examples ... ERROR Running examples in 'rioja-Ex.R'
2000 Mar 16
1
stepAIC and coxph objects with cluster(id)
Is it appropriate to use stepAIC (library MASS) with coxph objects (from library survival5) that use "cluster(id)"? It is my understanding that, when using "cluster(id)", we can test for sets of terms by using the methods in Wei et al., (1989; JASA, 84: 1065-1073), or as explained in pp. 53 and ff. of the survival.ps document. But if we use a likelihood ratio test instead
2004 Jun 10
1
question about similarities cluster using hierclust
my major is bioinformatics, and i'm trying to cluster ( agglomerate the closest pari of observations ) in R. i have already got my own similarities metric, but do not know how to clust it based on similarities instead of dissimilarities. since the help document of hierclust mentions the parameter "sim", which seems good to me, but it doesn't appear in the code of hierclust()
2008 Mar 26
0
out of colors in Mclust with 12 clusters
I'm running Mclust 3.0 in R-2.7.0 and have run into a situation where the optimal number of clusters, 12, seems to be greater than the number of colors available to Mclust. The code below, which demonstrates errors, does not if the max no of clusters is set at 10. sessionInfo, after a restart of R and loading of packages, follows the output. Is this a known problem with Mclust? Can I
2006 Sep 07
0
counting process form of a cox model (cluster(id))?
Hi, I am currently analysising a counting process form of a cox model allowing for the inclusion of time dependent covariates. An example model I have fitted is modlqol<-coxph(Surv(Tstart,Tstop,cens.time)~tmt.first+risk +lqol+cluster(id),data=cat) summary(modlqol) My question is quick. I am looking at 1 event (death), and repeated measurements (the time dependent covariate 'lqol')
2009 Oct 21
1
Use of the command 'replicate'
Dear R users, I'd like to ask u whether you know how to sort out the following: I'm trying to reproduce a dataset of clusters, and for that I need to build up a cluster index (inside a function) using the command "replicate" as follows: dataset<- function( clusters=100, cluster.size=50, outcome.mean=list(mean=3, sd=1), outcome.var=list(mean=0, sd=1), treat=1, unbal =
2007 Jul 04
0
Kmeans performance difference
Hi All, A question from a newbie using R 2-5-0 on windows XP. Why is it that kmeans clustering with apparently the exact same parameters behaves so differently between the two following examples : > cl1 <- kmeans(subset(pointsUXO15555, select = c(2:4)), 10) Takes about 2 seconds to deliver a result > cl1 <- clust(subset(pointsUXO15555, select = c(2:4)), k=10,
2011 Sep 13
2
help with hclust and cutree
Hello, I would like to cut a hclust tree into several groups at a specific similarity. I assume this can be achieved by specifying the "h" argument with the specified similarity, e.g.: clust<-hclust(dist,"average") cut<-cutree(clust,h=0.65) Now, I would like to draw rectangles around the branches of the dendrogram highlighting the corresponding clusters, as is done by
2009 Apr 09
1
Random Cluster Generation Question
Hello, I am fairly new to R, but I am not new to programming at all. I want to generate random clusters in a 1,000x1,000 box such that I end up with a total of about 2,000 points. Once done, I need to export the X,Y coordinates of the points. I have looked around, and it seems that the spatstat package has what I need. The rMatClust() function can generate random clusters, but I have run into
2006 Jan 17
0
Cannot convert from phylo to hclust , error!!???
Hello, The following code does'nt work for me. The last command reports an error. I have created a consensus tree using the consensus comand from phylo but cannot manipulate the phylo object afterwards to create a dendogram , by transforming the phylo object into a hclust object and then into a dendogram ?? Thanks for any help library(ade4) library(cluster) library(stats) library(ape)
2009 Jul 24
1
Fwd: Making rq and bootcov play nice
John, You can make a local version of bootcov which either: deletes these arguments from the call to fitter, or modify the switch statement to include rq.fit, the latter would need to also modify rq() to return a fitFunction component, so the first option is simpler. One of these days I'll incorporate clustered se's into summary.rq, but meanwhile this seems to be a good alternative.
2003 Apr 24
1
estimating number of clusters ("Null or more")
Hi all, once more about the old subj :-) My data has too much various distribution families and for every particular experiment I need just to decide whether the data is "quite homogeneous" or it has two or more clusters. I've revisited the following libraries: amap, clust, cclust, mclust, multiv, normix, survey. And I didn't find any ready-to-use general