thr3ads.net - similar to: "Canberra distance"

Displaying 20 results from an estimated 1000 matches similar to: "Canberra distance"

2010 Feb 06

Canberra distance

Hi the list, According to what I know, the Canberra distance between X et Y is : sum[ (|x_i - y_i|) / (|x_i|+|y_i|) ] (with | | denoting the function 'absolute value') In the source code of the canberra distance in the file distance.c, we find : sum = fabs(x[i1] + x[i2]); diff = fabs(x[i1] - x[i2]); dev = diff/sum; which correspond to the formula : sum[ (|x_i - y_i|) /

Canberra dist and double zeros

2001 Mar 05

Canberra dist and double zeros

Canberra distance is defined in function `dist' (standard library `mva') as sum(|x_i - y_i| / |x_i + y_i|) Obviously this is undefined for cases where both x_i and y_i are zeros. Since double zeros are common in many data sets, this is a nuisance. In our field (from which the distance is coming), it is customary to remove double zeros: contribution to distance is zero when both x_i

Canberra dist and double zeros

2001 Mar 05

Canberra dist and double zeros

Canberra distance

2007 Oct 16

Canberra distance

Hi, I misunderstand the definition of Canberra distance in R. On Internet and in function description pages of dist() from stats and Dist() from amap, Canberra distance between vectors x and y, d(x,y), is : d(x,y) = sum(abs(x-y)/(x+y)) But in use, through simple examples, we find that the formula is : d(x,y) = (NZ + 1)/NZ * sum(abs(x-y)/(x+y)) with NZ = nb of pairs of coordinates that are

mgcv::gam is it possible to have a 'simple' product of 1-d smooths?

2018 Jan 17

mgcv::gam is it possible to have a 'simple' product of 1-d smooths?

I am trying to test out several mgcv::gam models in a scalar-on-function regression analysis. The following is the 'hierarchy' of models I would like to test: (1) Y_i = a + integral[ X_i(t)*Beta(t) dt ] (2) Y_i = a + integral[ F{X_i(t)}*Beta(t) dt ] (3) Y_i = a + integral[ F{X_i(t),t} dt ] equivalents for discrete data might be: 1) Y_i = a + sum_t[ L_t * X_it * Beta_t ] (2) Y_i

function pointer question

2010 Apr 25

function pointer question

Hello, I have the following function that receives a "function pointer" formal parameter name "fnc": loocv <- function(data, fnc) { n <- length(data.x) score <- 0 for (i in 1:n) { x_i <- data.x[-i] y_i <- data.y[-i] yhat <- fnc(x=x_i,y=y_i) score <- score + (y_i - yhat)^2 } score <- score/n

lm with data=(means,sds,ns)

2004 Apr 18

lm with data=(means,sds,ns)

Hi Folks, I am dealing with data which have been presented as at each x_i, mean m_i of the y-values at x_i, sd s_i of the y-values at x_i number n_i of the y-values at x_i and I want to linearly regress y on x. There does not seem to be an option to 'lm' which can deal with such data directly, though the regression problem could be algebraically

Variance-covariance matrix for beta hat and b hat from lme

2003 Oct 23

Variance-covariance matrix for beta hat and b hat from lme

Dear all, Given a LME model (following the notation of Pinheiro and Bates 2000) y_i = X_i*beta + Z_i*b_i + e_i, is it possible to extract the variance-covariance matrix for the estimated beta_i hat and b_i hat from the lme fitted object? The reason for needing this is because I want to have interval prediction on the predicted values (at level = 0:1). The "predict.lme" seems to

covariance question which has nothing to do with R

2007 Mar 01

covariance question which has nothing to do with R

This is a covariance calculation question so nothing to do with R but maybe someone could help me anyway. Suppose, I have two random variables X and Y whose means are both known to be zero and I want to get an estimate of their covariance. I have n sample pairs (X1,Y1) (X2,Y2) . . . . . (Xn,Yn) , so that the covariance estimate is clearly 1/n *(sum from i = 1 to n of ( X_i*Y_i) ) But,

linear functional relationships with heteroscedastic & non-Gaussian errors - any packages around?

2008 Dec 01

linear functional relationships with heteroscedastic & non-Gaussian errors - any packages around?

Hi, I have a situation where I have a set of pairs of X & Y variables for each of which I have a (fairly) well-defined PDF. The PDF(x_i) 's and PDF(y_i)'s are unfortunately often rather non-Gaussian although most of the time not multi--modal. For these data (estimates of gas content in galaxies), I need to quantify a linear functional relationship and I am trying to do this as

stats 'dist' euclidean distance calculation

2018 Mar 15

stats 'dist' euclidean distance calculation

> 3x3 subset used > Locus1 Locus2 Locus3 > Samp1 GG <NA> GG > Samp2 AG CA GA > Samp3 AG CA GG > > The euclidean distance function is defined as: sqrt(sum((x_i - y_i)^2)) My > assumption was that the difference between

basic cubic spline smoothing (resending because not sure about pending)

2009 Sep 24

basic cubic spline smoothing (resending because not sure about pending)

Hello, I come from a non statistics background, but R is available to me, and I needed to test an implementation of smoothing spline that I have written in c++, so I would like to match the results with R (for my unit tests). I am following Smoothing Splines, D.G. Pollock (available online) where we have a list of points (xi, yi), the yi points are random such that: y_i = f(x_i) + e_i

Re: SVM questions

2002 Aug 20

Re: SVM questions

> > So i guess from your prev. email the svmModel$coefs correspond to the > "Alpha" . yes (times the sign of y!). > > Why do I see three columns in the coefs?( Is this the number of classes -1 > = Numbe of hyperplanes) yes, but in a packed format which is not trivial. I attach some explanation I sent to R-help some time ago (the guy wanted to write his own

basic cubic spline smoothing

2009 Sep 24

basic cubic spline smoothing

Hello, I come from a non statistics background, but R is available to me, and I needed to test an implementation of smoothing spline that I have written in c++, so I would like to match the results with R (for my unit tests) I am following http://www.nabble.com/file/p25569553/SPLINES.PDF SPLINES.PDF where we have a list of points (xi, yi), the yi points are random such that: y_i = f(x_i) +

lme - incorporating measurement error with estimated V-C matrix

2010 Feb 18

lme - incorporating measurement error with estimated V-C matrix

I have data (each Y_i is a vector) in the form of Y_i = X_i \beta_i + Z_i b_i + epsilon_i Were it not for the measurement error (the epsilon_i) it's a very simple model --- nice and balanced, compound symmetry, and I'd just use lme(y ~ x1 + x2, random=~1|subj, ...) but the measurement error is throwing me off. Because the Y_i are actually derived from other data, I am able

Help with efficient double sum of max (X_i, Y_i) (X & Y vectors)

2007 Feb 01

Help with efficient double sum of max (X_i, Y_i) (X & Y vectors)

Greetings. For R gurus this may be a no brainer, but I could not find pointers to efficient computation of this beast in past help files. Background - I wish to implement a Cramer-von Mises type test statistic which involves double sums of max(X_i,Y_j) where X and Y are vectors of differing length. I am currently using ifelse pointwise in a vector, but have a nagging suspicion that there is a

need help on computing double summation

2005 Jun 15

need help on computing double summation

Dear helpers in this forum, This is a clarified version of my previous questions in this forum. I really need your generous help on this issue. > Suppose I have the following data set: > > id x y > 023 1 2 > 023 2 5 > 023 4 6 > 023 5 7 > 412 2 5 > 412 3 4 > 412 4 6 > 412 7 9 > 220 5 7 > 220 4 8 > 220 9 8 > ...... > Now I want to compute the

metafor package: effect sizes are not fully independent

2010 Feb 05

metafor package: effect sizes are not fully independent

In a classical meta analysis model y_i = X_i * beta_i + e_i, data {y_i} are assumed to be independent effect sizes. However, I'm encountering the following two scenarios: (1) Each source has multiple effect sizes, thus {y_i} are not fully independent with each other. (2) Each source has multiple effect sizes, each of the effect size from a source can be categorized as one of a factor levels

how to fit a weighted logistic regression?

2004 Dec 15

how to fit a weighted logistic regression?

I tried lrm in library(Design) but there is always some error message. Is this function really doing the weighted logistic regression as maximizing the following likelihood: \sum w_i*(y_i*\beta*x_i-log(1+exp(\beta*x_i))) Does anybody know a better way to fit this kind of model in R? FYI: one example of getting error message is like: > x=runif(10,0,3) > y=c(rep(0,5),rep(1,5)) >

stats 'dist' euclidean distance calculation

2018 Mar 15

stats 'dist' euclidean distance calculation

Hello, I am working with a matrix of multilocus genotypes for ~180 individual snail samples, with substantial missing data. I am trying to calculate the pairwise genetic distance between individuals using the stats package 'dist' function, using euclidean distance. I took a subset of this dataset (3 samples x 3 loci) to test how euclidean distance is calculated: 3x3 subset used

similar to: Canberra distance