thr3ads.net - similar to: "k-nn hierarchical clustering"

Displaying 20 results from an estimated 3000 matches similar to: "k-nn hierarchical clustering"

2010 Aug 06

Latex errors when checking package

Dear listers, I just run R CMD check on an update of one of my packages. All seems fine but after having gone through all the Rd-file and example checking and so on, I get the following kind of errors: LaTeX errors when creating PDF version. This typically indicates Rd problems. LaTeX errors found: ! Font T1/ptm/m/n/10=ptmr8t at 10.0pt not loadable: Metric (TFM) file not found . <to be read

Regression slope confidence interval

2005 Sep 29

Regression slope confidence interval

Hi list, is there any direct way to obtain confidence intervals for the regression slope from lm, predict.lm or the like? (If not, is there any reason? This is also missing in some other statistics softwares, and I thought this would be quite a standard application.) I know that it's easy to implement but it's for explanation to people who faint if they have to do their own programming...

R CMD check error

2006 Aug 09

R CMD check error

Dear list, R CMD check on my updated package now generated the following error: "LaTeX errors when creating DVI version. This typically indicates Rd problems." But the Rd files (and everything else) were checked as "OK" (I removed the problem about which I asked the list some hours ago, but answers are still appreciated because I rather created a rough workaround than

Package "prabclus" not available?

2010 Oct 10

Package "prabclus" not available?

Hi there, I just tried to install the package prabclus on a computer running Ubuntu Linux 9.04 using install.packages from within R. This gave me a message: Warning message: In install.packages("prabclus") : package ?prabclus? is not available I tried to do this selecting two different CRAN mirrors (same result) and with other packages (installing them works fine). Looking up the

R-update - what about packages and ESS?

2006 Aug 18

R-update - what about packages and ESS?

Hi there, it seems that if I update R, it doesn't find previously installed packages anymore and is also not found by ESS. Actually the update has been done by our system administrator who assumed that there would be no problems with these things (I don't have root access to this system) and will perhaps not be too keen on installing everything else again. Is there any simple way how

Rd-file error: non-ASCII input and no declared encoding

2010 Sep 01

Rd-file error: non-ASCII input and no declared encoding

Dear list, I came across the following error for three of my newly written Rd-files: non-ASCII input and no declared encoding I can't make sense of this. Below I copied in one of the three files. Can anybody please tell me what's wrong with it? Thank you, Christian \name{tetragonula} \alias{tetragonula} \alias{tetragonula.coord} \docType{data} % \non_function{} \title{Microsatellite

R CMD build error with data files

2012 Aug 21

R CMD build error with data files

Dear list, I want to update my prabclus package which I haven't done for quite a while. In the previous version, I had .dat files in my data subdirectory, which I read using .R files. Now R CMD check gives me a warning that .dat files are no longer accepted there. So I changed my filenames to .txt, but actually some of these files are only there in order to be read by .R, not in order

cluster.stats

2008 Jun 13

cluster.stats

Dear list, I just tried to use the function cluster.stat in the package fpc. I just have a couple of questions about the syntax: cluster.stats(d,clustering,alt.clustering=NULL, silhouette=TRUE,G2=FALSE,G3=FALSE) 1) the distance object (d) is an object obtained by the function dist() on my own original matrix? 2) clustering is the clusters vector as result of one of the many clustering methods?

Sampling theory

2010 Jan 19

Sampling theory

Hi there, are there any R-packages for computations required in sampling theury (such as confidence intervals under random, stratified, cluster sampling; I'd be partoculary interested in confidence intervals for the population variance, which is difficult enough to find even in books)? Thanks, Christian *** --- *** Christian Hennig University College London, Department of Statistical

Summary method needed?

2006 Aug 02

Summary method needed?

Hi list, I'm updating my fpc package at the moment and will add some new functions. I learned that there should be print and summary methods for the key functions. The purpose of the summary methods seems to be to reduce the possibly incredibly complex information in the function's output and the print method (print.summary.foo) should print an overview of the result. But in some

order a matrix

2007 Nov 05

order a matrix

Dear list, order(x,y,z) returns a permutation to order x, ties broken by y, remaining ties broken by z. (And so on.) What I'd like to do is order(X), where X is a matrix (or a list or data frame if necessary) of unspecified size, which orders X[,1], ties broken by X[,2], remaining ties broken by X[,3] and so on - without having to know and to write down how many columns X has. Any

intToUtf8

2008 Sep 19

intToUtf8

Hi there, any explanation for this? > intToUtf8(66) Error in intToUtf8(66) : argument 'x' must be an integer vector > intToUtf8(c(66,55)) Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector > intToUtf8(c(66,55),multiple=TRUE) Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector Errr... 66 and c(66,55) are as integer vectorish

non-uniqueness in cluster analysis

2003 Dec 03

non-uniqueness in cluster analysis

Hi, I'm clustering objects defined by categorical variables with a hierarchical algorithm - average linkage. My distance matrix (general dissimilarity coefficient) includes several distances with exactly the same values. As I see, a standard agglomerative procedure ignores this problems, simply selecting, above equal distances, the one that comes first. For this reason the analysis in output

cutree with agnes

2003 Dec 11

cutree with agnes

Hi, this is rather a (presumed) bug report than a question because I can solve my personal statistical problem by working with hclust instead of agnes. I have done a complete linkage clustering on a dist object dm with 30 objects with agnes (R 1.8.0 on RedHat) and I want to obtain the partition that results from a cut at height=0.4. I run > cl1a <- agnes(dm, method="complete")

cutree with agnes

2003 Dec 11

cutree with agnes

hierarchical clustering within a size limit

2011 May 11

hierarchical clustering within a size limit

Hello List, I am trying to implement a hierarchical cluster using the hclust method agglomerative single linkage method with a small wrinkle. I would like to cluster a set of numbers on a number line only if they are within a distance of 500. I would then like to print out the members of this list. So far I can put a vector: > x<-c(2,10,200,300,600,700) into a distance matrix: >

DICE Coefficient of similarity measure

2010 Apr 24

DICE Coefficient of similarity measure

Hi, I wanted the DICE coefficient (similarity measure for binary variables) to be calculated in R and found that the "igraph" package has the option of "similarity.dice" to do this. But, for this command, the input object should be an igraph object. But, I have a dataframe of columns containing 1's and 0's. Can I convert this dataframe into an igraph object, so that

selecting outliers

2005 Aug 08

selecting outliers

Hi everybody, I'd like to know if there's an easy way for extracting outliers record from a dataset, in order to perform further analysis on them. Thanks Alessandro

K-means result - variance between cluster

2010 Jul 02

K-means result - variance between cluster

Hi, I like to present the results from the clustering method k-means in terms of variances: within and between Cluster. The k-means object gives only the within cluster sum of squares by cluster, so the between variance part is missing,for calculation the following table, which I try to get. Number of | Variance within | Var between | Var total | F-value Cluster k | cluster | cluster

Random numbers negatively correlated?

2006 Jun 27

Random numbers negatively correlated?

Dear list, I did simulations in which I generated 10000 independent Bernoulli(0.5)-sequences of length 100. I estimated p for each sequence and I also estimated the conditional probability that a one is followed by another one (which should be p as well). However, the second probability is significantly smaller than 0.5 (namely about 0.494, see below) and of course smaller than the direct

similar to: k-nn hierarchical clustering