thr3ads.net - similar to: "exploring dist()"

Displaying 20 results from an estimated 3000 matches similar to: "exploring dist()"

stats 'dist' euclidean distance calculation

2018 Mar 15

stats 'dist' euclidean distance calculation

Hello, I am working with a matrix of multilocus genotypes for ~180 individual snail samples, with substantial missing data. I am trying to calculate the pairwise genetic distance between individuals using the stats package 'dist' function, using euclidean distance. I took a subset of this dataset (3 samples x 3 loci) to test how euclidean distance is calculated: 3x3 subset used

Clustering question \ dist(datmat)

2006 Mar 27

Clustering question \ dist(datmat)

Hello everybody. I am trying to cluster circular data (data points which are angles), thus i can not use the "dist" function in "mclust" to generate my distance matrix, I am using the function " Dij = 0.5*( 1 - cos(theta_i - theta_j)). The thing is "hclust" will not accept this distance matrix, i tried to put it in a data frame, but again i get an error message

BIO-ENV procedure

2004 May 13

BIO-ENV procedure

I've been unable to find a R package that provides the means of performing Clarke & Ainsworth's BIO-ENV procedure or something comparable. Briefly, they describe a method for comparing two separate sample ordinations, one from species data and the second from environmental data. The analysis includes selection of the 'best' subset of environmental variables for explaining

euclidean dist. between matrices

2012 Nov 18

euclidean dist. between matrices

Dear Users,I have two matrices A=15*365 and B=1*365. i want to calculate "Euclidean Distance" between these matrices in such a way that i should have euclidean distance of matrix B against all the columns of matrix A. More precisely, first i want euclidean dist. of column 1 of A against B, then column 2 against B, 3rd column of A against B and so on.is there a way in r to do it?your help

partitioning variation using the Vegan CCA routine?

2007 Apr 27

partitioning variation using the Vegan CCA routine?

Hello I am using Jari Oksanen's CCA routine from the Vegan package on some estuary data, following a technique applied in (Anderson, M.J. & Gribble, N.A., 1998, Partitioning the variation among spatial, temporal and environmental components in a multivariate data set, Australian Journal of Ecology 23, 158-167). Some steps in the process require that the dependent matrix be constrained by

dist function in R is very slow

2017 Jun 18

dist function in R is very slow

Hi Stefan, Thank you very much for pointing me to the wordspace package. It does the job a bit faster than my C code but is 100 times more convenient. By the way, since the tcrossprod function in the Matrix package is so fast, the Euclidean distance can be computed very fast: euc_dist <- function(m) {mtm <- Matrix::tcrossprod(m); sq <- rowSums(m*m);? sqrt(outer(sq,sq,"+") -

cluster of points

2012 Jul 30

cluster of points

Hello: What I want to do is quite simple, but I can't find a way. I have a data frame with several points (x and y coords). I want to add another column with cluster membership. For example aggregate all the points that stand within a distance of 40 from each other. I've tried using "nncluster" from the package nnclust, but the results are not correct, for some

Abundance data ordination in R

2007 Apr 01

Abundance data ordination in R

Um texto embutido e sem conjunto de caracteres especificado associado... Nome: n?o dispon?vel Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070401/33921c2a/attachment.pl

buglet in dist() ?

2007 Sep 02

buglet in dist() ?

the first line of dist() says if (!is.na(pmatch(method, "euclidian"))) shouldn't that be "euclidean" ? --------------------- R version 2.5.1 (2007-06-27) i486-pc-linux-gnu locale:

NAs introduced by coercion in dist()

2007 May 02

NAs introduced by coercion in dist()

I work with Windows and use R version 2.4.1. I am JUST starting to learn this program... I get this warning message 'NAs introduced by coercion' while trying to build a distance matrix (to be analyzed with NMDS later) from a 336 x 100 data matrix. The original matrix has lots of zeros and no missing values, but I don't think this should matter. I searched this forum and people have

OT: (quasi-?) separation in a logistic GLM

2008 Dec 15

OT: (quasi-?) separation in a logistic GLM

Dear List, Apologies for this off-topic post but it is R-related in the sense that I am trying to understand what R is telling me with the data to hand. ROC curves have recently been used to determine a dissimilarity threshold for identifying whether two samples are from the same "type" or not. Given the bashing that ROC curves get whenever anyone asks about them on this list (and

run a slef-written function

2011 Mar 15

run a slef-written function

How could one get started with a self-written function? I have a function written in .txt format, but can not find the way to import it to the R space. Would be very appreciated for help. -- View this message in context: http://r.789695.n4.nabble.com/run-a-slef-written-function-tp3356537p3356537.html Sent from the R help mailing list archive at Nabble.com.

dist label names

2007 Apr 25

dist label names

Hello, I am trying to do a multi-dimensional scaling of the World Bank's quality of governance indicators for the Balkan region. I am having trouble labelling my plot. Could some kind person help me out. How do I set the attribute Label by a variable (say, "Code")? At present I get this: >qog.dist<-dist(Balkans.data, method = "euclidean", diag = FALSE, upper =

dist() {"mva" package} bug: treats +/- Inf as NA

2002 Oct 21

dist() {"mva" package} bug: treats +/- Inf as NA

Vince Carey found this (thank you!). Since the fix to the problem is not entirely obvious, I post this to R-devel as RFC: help(dist) says: >> Missing values are allowed, and are excluded from all computations >> involving the rows within which they occur. If some columns are >> excluded in calculating a Euclidean, Manhattan or Canberra >> distance, the sum is

Non-metric multidimensional scaling (NMDS) help

2010 Jan 12

Non-metric multidimensional scaling (NMDS) help

Hi, I am currently working on some data and feel that NMDS would return an excellent result. With my current data set however I have been experiencing some problems and cannot carry out metaMDS. I have tried with a few smaller data sets which I created for practice sake and this has worked fine. I think it is the set up of my data set that is causing me trouble. I have 18 columns and 18 rows,

stats 'dist' euclidean distance calculation

2018 Mar 15

stats 'dist' euclidean distance calculation

> 3x3 subset used > Locus1 Locus2 Locus3 > Samp1 GG <NA> GG > Samp2 AG CA GA > Samp3 AG CA GG > > The euclidean distance function is defined as: sqrt(sum((x_i - y_i)^2)) My > assumption was that the difference between

custom metric for dist for use with hclust/kmeans

2010 May 05

custom metric for dist for use with hclust/kmeans

Hi guys, I've been using the kmeans and hclust functions for some time now and was wondering if I could specify a custom metric when passing my data frame into hclust as a distance matrix. Actually, kmeans doesn't even take a distance matrix; it takes the data frame directly. I was wondering if there's a way or if there's a package that lets you create distance matrices from

distance matrices

2009 Dec 03

distance matrices

i'm working on some distance matrices and i was wondering if there is a way to export the matrices from R to excel. OG thanks [[alternative HTML version deleted]]

metaMDS Error, Nan similar or negative values

2013 Jun 22

metaMDS Error, Nan similar or negative values

H ello R-experts, I want to do ordination plots using vegan metaMDS. I have a where many cells have zero values. Data structure: X[1:10,1:14] Height.1 Height.2 Height.3 Height.4 Height.5 Height.6 Height.7 Height.8 Height.9 Height.10 Height.11 Height.12 Height.13 D30I1A 46 0 0 0 0 0 0 0 0 0 39 0 98 D30I1B

How to subset based on column name that is a number ?

2011 Jun 09

How to subset based on column name that is a number ?

Hi, I have a data frame with column names "1", "2", "3", ... and I'd like to extract a subset based on the values in the first column. None of the methods I tried worked (below). x <- subset(dframe, 1 = = "My Text") x <- subset(dframe, "1" = = "My Text") x <- subset(dframe, names(dframe)[1] = = "My Text") Q

similar to: exploring dist()