search for: euclidian

Displaying 20 results from an estimated 39 matches for "euclidian".

Did you mean: euclidean
2010 Jul 20
1
p-values pvclust maximum distance measure
.... I uploaded a PDF showing the results Here is the code which produces the PDF file: ------------------------------------------------------------------------------------- s <- matrix(runif(1600,0,1000), nrow=20) a.res1 <- pvclust(t(s), method.hclust="complete", method.dist="euclidian", nboot=500) a.res2 <- pvclust(t(s), method.hclust="complete", method.dist="maximum", nboot=500) a.res3 <- pvclust(t(s), method.hclust="complete", method.dist="canberra", nboot=500) a.res4 <- pvclust(t(s), method.hclust="ward", method....
2001 Dec 13
2
k-means with euclidian distance but no coordinates
...sure about that -- I would have preferred a k dimensional space with a binary cluster in each dimension so a word can belong to 0..k clusters, but I digress... I can measure the strength of correlation between words fairly easily by counting cooccurance divided by frequency of each word, giving a euclidian distance, although this doesn't work especially well for rare words. However I don't have coordinates as such, and deriving them given distance is non-trivial. Now, as I understand k-means, it uses euclidian distance rather than coordiantes, the first step given in texts is to derive the...
2016 May 05
2
GSoC 2016 - Introduction
...rough the previous clustering API to understand how it worked and it seems like the the approach for construction of the termlists which are used for distance metrics use TF-IDF weighting with cosine similarity, which is very similar to the approach I would need for this project. Just in this case, euclidian distance would be the metric. Would it be good to structure it in a way similar to the previous API with a few changes? For example, the Xapian::DocSimCosine::similarity( ) function in itself calculates the tf idf vectors and calculates the similarity. Instead would it be possible to have a custo...
2011 Apr 18
3
how to extract options for a function call
..."CG", "L-BFGS-B", "SANN") in the optim function. So I would like to have a vector with c("Nelder-Mead", "BFGS", "CG", "L-BFGS-B", "SANN") Or for instance the 'method' in the dist function can be supplied with euclidian,maximum,manhatten, canberra,binary,minkowski. Is there someway to extract this so I would have a vector with c("euclidian","maximum","manhatten", "canberra","binary","minkowski"). Thanks
2007 Sep 02
1
buglet in dist() ?
the first line of dist() says if (!is.na(pmatch(method, "euclidian"))) shouldn't that be "euclidean" ? --------------------- R version 2.5.1 (2007-06-27) i486-pc-linux-gnu locale: LC_CTYPE=en_US.UTF-8;LC_NUMERIC=C;LC_TIME=en_US.UTF-8;LC_COLLATE=en_US.UTF-8;LC_MONETARY=en_US.UTF-8;LC_MESSAGES=en_US.UTF-8;LC_PAPER=en_US.UTF-8;LC_NAME=C;LC_ADD...
2005 Sep 12
4
Document clustering for R
I'm working on a project related to document clustering. I know that R has clustering algorithms such as clara, but only supports two distance metrics: euclidian and manhattan, which are not very useful for clustering documents. I was wondering how easy it would be to extend the clustering package in R to support other distance metrics, such as cosine distance, or if there was an API for custom distance metrics. Best regards, Raymond Pon pon3 at llnl.go...
2016 Jul 26
3
K MEANS clustering
...ich I'm not able to find a solution to. Since we are representing documents as Tf-idf vectors, they are really sparse vectors (a usual corpus can have around 5000 terms). So it gets really difficult to represent these sparse vectors in a way that would be computationally efficient to calculate euclidian distances. I had implemented a K-Medioids algorithm using PAM just to try it out, after modifying the API for whatever more was required, and that seems fine, since we are dealing with document vectors and not arbitrary vectors. But with KMeans, I am not able to figure out how to represent these ce...
2016 Jul 27
2
K MEANS clustering
Hey Parth, Thanks for the reply. I am considering implementing a cosine distance metric too, along with euclidian distance because of the dimensionality issue that comes in with K-Means and euclidian distance metric. That does help when we deal with sparse vectors for documents. The particular problem I'm having is representing centroids in an efficient way. For example, when we find the mean vector of a c...
2006 Apr 03
2
about arguments in "bclust"
Hi All, Just want to make sure, in function "bclust", do the following argument only have one option? argument "dist.method" has one option "Euclidian"; argument "hclust.method" has one option "average"; argument "base.method" has one option "kmeans". Thank you! [[alternative HTML version deleted]]
2008 Sep 16
1
Spatial join – optimizing code
Hi, Few days ago I have asked about spatial join on the minimum distance between 2 sets of points with coordinates and attributes in 2 different data frames. Simon Knapp sent code to do it when calculating distance on a sphere using lat, long coordinates and I've change his code to use Euclidian distances since my data had UTM coordinates. Typically one data frame has around 30 000 points and the classification data frame has around 4000 points, and the aim is to add to each point from the first data frame all the attributes from the second data frame of the point that is closest to it....
2016 Jun 09
2
2nd week progress
Hello devs, I have filled out the repo link on TRAC as suggested. I'll also keep the journal updated on TRAC from now on. I am almost done with defining all the base classes required for the clusterer and have started coding the euclidian distance metric. This should be completed by tomorrow after which I'll be spending one day to test and make sure everything functions as expected, so that I can have feedback from the community on areas to improve on. Thanks -------------- next part -------------- An HTML attachment was scrubb...
1999 Jan 20
2
dist function suggestion
This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. Send mail to mime@docserver.cac.washington.edu for more info. ---559023410-162216788-916833047=:29339 Content-Type: TEXT/PLAIN; charset=US-ASCII On my R installation (0.62.4) there is no dist() function, so I attach one possibility. It provides
2013 May 21
1
keep the centre fixed in K-means clustering
...-0.3974660 0.1541685 1 [2,] 2 0.5321347 0.2497867 1 [3,] 3 0.2550276 0.1691720 1 [4,] 4 -0.1162162 0.6754874 1 ... [55,] 55 1.1570996 1.1175119 2 [56,] 56 1.4816195 1.6836226 2 p.s. I use Euclidian to obtain/calculate distance matrix. Many thanks in advance HJ [[alternative HTML version deleted]]
2007 Apr 01
4
Abundance data ordination in R
Um texto embutido e sem conjunto de caracteres especificado associado... Nome: n?o dispon?vel Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070401/33921c2a/attachment.pl
2008 Oct 09
2
vectorization instead of using loop
...e x with n people as rows and k variables as columns. Now, for each person (i.e., each row) we want to calculate a distance between him/her and EACH other person in x. In other words, we want to create a n x n matrix with distances (with zeros in the diagonal). However, we do not want to calculate Euclidian distances. We want to calculate Mahalanobis distances, which take into account the covariance among variables. Below is the piece of code we wrote ("covmat" in the function below is the variance-covariance matrix among variables in Data that has to be fed into mahalonobis function we are...
2010 Jun 25
1
best way to plot a evolution in time
Hi everyone, I have the following question: given three objects let's say: a <- c( 2 , 5, 15, 16) b <- c(1 ,1, 8 , 8) c <- c (10, 10 11 ,11) m<-matrix(c(a,b,c),byrow=T,nrow=3) rownames(m)<-c("gene a", 'gene b','gene c') m gene.dist<-dist(m,method='euclidian') gene.dist which is the best way to plot their evolution in time? shoul I use a levelplot or just a normal plot? if I use a normal plot how do I plot evolution in time? -- View this message in context: http://r.789695.n4.nabble.com/best-way-to-plot-a-evolution-in-time-tp2267993p2267993.html...
2020 Feb 22
2
The AnghaBench collection of compilable programs
...* We can check how many programs can be compiled by different high-level synthesis tools into FPGAs. We have tried LegUp and Vivado, for instance. * Our webpage contains a search box, so that you can get the closest programs to a given input program. Currently, we measure program distance as the Euclidian distance on Namolaru feature vectors. We do not currently provide inputs for those programs. It's possible to execute the so called "leaf-functions", e.g., functions that do not call other routines. We have thousands of them. However, we do not guarantee the absence of undefined beha...
2005 Oct 06
1
Compare two distance matrices
Hi all, I am trying to compare two distance matrices with R. I would like to create a XY plot of these matrices and do some linear regression on it. But, I am a bit new to R, so i have a few questions (I searched in the documentation with no success). The first problem is loading a distance matrix into R. This matrix is the output of a the Phylip program Protdist and lookes like this: 5
2016 Mar 06
3
GSOC-2016 Project : Clustering of search results
...n terms of speed!). > > Thanks for the reply James! I went through the code in a little more detail and there are a few things I noticed and a few questions I have. First off, the distance metric used in the current implementation is the cosine measure. Though useful, K-means implicitly uses Euclidian distance as a measure of document similarity between two document term vectors. Hence, simply creating one more class for a distance metric by just inheriting the DocSim base class will be good. Using the tf-idf weights, we can find term weights and instead of using these vectors for cosine similar...
2010 May 05
2
Using statistical test to distinguish two groups
Hi R friends, I am posting this question even though I know that the nature of it is closer to general stats than R. Please let me know if you are aware of a list for general statistical questions: I am looking for a simple method to distinguish two groups of data in a long vector of numbers: list <- c(1,2,3,2,3,2,3,4,3,2,3,4,3,2,400,340,3,2,4,5,6,4,3,6,4,5,3) I would like to