Displaying 20 results from an estimated 3000 matches similar to: "k-nn hierarchical clustering"
2010 Aug 06
1
Latex errors when checking package
Dear listers,
I just run R CMD check on an update of one of my packages.
All seems fine but after having gone through all the Rd-file and example 
checking and so on, I get the following kind of errors:
LaTeX errors when creating PDF version.
This typically indicates Rd problems.
LaTeX errors found:
! Font T1/ptm/m/n/10=ptmr8t at 10.0pt not loadable: Metric (TFM) file not 
found
.
<to be read
2005 Sep 29
5
Regression slope confidence interval
Hi list,
is there any direct way to obtain confidence intervals for the regression
slope from lm, predict.lm or the like?
(If not, is there any reason? This is also missing in some other statistics
softwares, and I thought this would be quite a standard application.)
I know that it's easy to implement but it's for
explanation to people who faint if they have to do their own
programming...
2006 Aug 09
2
R CMD check error
Dear list,
R CMD check on my updated package now generated the following error:
"LaTeX errors when creating DVI version.
This typically indicates Rd problems."
But the Rd files (and everything else) were checked as "OK" (I 
removed the problem about which I asked the list some hours ago, but
answers are still appreciated because I rather created a rough 
workaround than
2010 Oct 10
1
Package "prabclus" not available?
Hi there,
I just tried to install the package prabclus on a computer running Ubuntu 
Linux 9.04 using install.packages from within R.
This gave me a message:
Warning message:
In install.packages("prabclus") : package ?prabclus? is not available
I tried to do this selecting two different CRAN mirrors (same result) and 
with other packages (installing them works fine).
Looking up the
2006 Aug 18
2
R-update - what about packages and ESS?
Hi there,
it seems that if I update R, it doesn't find previously installed packages 
anymore and is also not found by ESS.
Actually the update has been done by our system administrator who assumed 
that there would be no problems with these things (I don't have root 
access to this system) and will perhaps not be too keen on installing
everything else again.
Is there any simple way how
2010 Sep 01
2
Rd-file error: non-ASCII input and no declared encoding
Dear list,
I came across the following error for three of my newly written Rd-files:
non-ASCII input and no declared encoding
I can't make sense of this.
Below I copied in one of the three files.
Can anybody please tell me what's wrong with it?
Thank you,
Christian
\name{tetragonula}
\alias{tetragonula}
\alias{tetragonula.coord}
\docType{data}
% \non_function{}
\title{Microsatellite
2012 Aug 21
1
R CMD build error with data files
Dear list,
I want to update my prabclus package which I haven't done for quite a 
while.
In the previous version, I had .dat files in my data subdirectory, which I 
read using .R files. Now R CMD check gives me a warning that .dat files 
are no longer accepted there.
So I changed my filenames to .txt, but actually some of these files are 
only there in order to be read by .R, not in order
2008 Jun 13
3
cluster.stats
Dear list,
I just tried to use the function cluster.stat in the package fpc.
I just have a couple of questions about the syntax:
cluster.stats(d,clustering,alt.clustering=NULL,
silhouette=TRUE,G2=FALSE,G3=FALSE)
1) the distance object (d) is an object obtained by the function dist() on
my own original matrix?
2) clustering is the clusters vector as result of one of the many clustering
methods?
2010 Jan 19
1
Sampling theory
Hi there,
are there any R-packages for computations required in sampling theury 
(such as confidence intervals under random, stratified, cluster sampling; 
I'd be partoculary interested in confidence intervals for the population 
variance, which is difficult enough to find even in books)?
Thanks,
Christian
*** --- ***
Christian Hennig
University College London, Department of Statistical
2006 Aug 02
1
Summary method needed?
Hi list,
I'm updating my fpc package at the moment and will add some new functions. 
I learned that there should be print and summary methods for the key
functions.
The purpose of the summary methods seems to be to reduce the 
possibly incredibly complex information in the function's output and the 
print method (print.summary.foo) should print an overview of the result.
But in some
2007 Nov 05
1
order a matrix
Dear list,
order(x,y,z) returns a permutation to order x, ties broken by y, remaining 
ties broken by z. (And so on.)
What I'd like to do is
order(X), where X is a matrix (or a list or data frame if necessary) of 
unspecified size, which orders X[,1], ties broken by X[,2], remaining ties 
broken by X[,3] and so on - without having to know and to write down how 
many columns  X has.
Any
2008 Sep 19
1
intToUtf8
Hi there,
any explanation for this?
> intToUtf8(66)
Error in intToUtf8(66) : argument 'x' must be an integer vector
> intToUtf8(c(66,55))
Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector
> intToUtf8(c(66,55),multiple=TRUE)
Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector
Errr... 66 and c(66,55) are as integer vectorish
2003 Dec 03
3
non-uniqueness in cluster analysis
Hi,
I'm clustering objects defined by categorical variables with a hierarchical
algorithm - average linkage.
My distance matrix (general dissimilarity coefficient) includes several
distances with exactly the same values.
As I see, a standard agglomerative procedure ignores this problems, simply
selecting, above equal distances, the one that comes first.
For this reason the analysis in output
2003 Dec 11
1
cutree with agnes
Hi,
this is rather a (presumed) bug report than a question because I can solve
my personal statistical problem by working with hclust instead of agnes. 
I have done a complete linkage clustering on a dist object dm with 30
objects with agnes (R 1.8.0 on
RedHat) and I want to obtain the partition that results from a cut at
height=0.4.
I run
> cl1a <- agnes(dm, method="complete")
2003 Dec 11
1
cutree with agnes
Hi,
this is rather a (presumed) bug report than a question because I can solve
my personal statistical problem by working with hclust instead of agnes. 
I have done a complete linkage clustering on a dist object dm with 30
objects with agnes (R 1.8.0 on
RedHat) and I want to obtain the partition that results from a cut at
height=0.4.
I run
> cl1a <- agnes(dm, method="complete")
2011 May 11
2
hierarchical clustering within a size limit
Hello List,
I am trying to implement a hierarchical cluster using the hclust method
agglomerative single linkage method with a small wrinkle. I would like to
cluster a set of numbers on a number line only if they are within a distance
of 500. I would then like to print out the members of this list.
So far I can put a vector:
> x<-c(2,10,200,300,600,700)
into a distance matrix:
>
2010 Apr 24
4
DICE Coefficient of similarity measure
Hi,
 
I wanted the DICE coefficient (similarity measure for binary variables)
to be calculated in R and found that the "igraph" package has the option
of "similarity.dice" to do this. But, for this command, the input object
should be an igraph object. But, I have a dataframe of columns
containing 1's and 0's. Can I convert this dataframe into an igraph
object, so that
2005 Aug 08
2
selecting outliers
Hi everybody,
I'd like to know if there's an easy way for extracting
outliers record from a dataset, in order to perform
further analysis on them.
Thanks
Alessandro
2010 Jul 02
2
K-means result - variance between cluster
Hi,
I like to present the results from the clustering method k-means in
terms of variances: within and between Cluster. The k-means object
gives only the within cluster sum of squares by cluster, so the between
variance part is missing,for calculation the following table, which I
try to get.
Number of | Variance within | Var between | Var total | F-value
Cluster k | cluster         | cluster    
2006 Jun 27
2
Random numbers negatively correlated?
Dear list,
I did simulations in which I generated 10000
independent Bernoulli(0.5)-sequences of length 100. I estimated
p for each sequence and I also estimated the conditional probability that 
a one is followed by another one (which should be p as well).
However, the second probability is significantly smaller than 0.5 (namely
about 0.494, see below) and of course smaller than the direct