similar to: simprof test using jaccard distance

Displaying 20 results from an estimated 300 matches similar to: "simprof test using jaccard distance"

2008 Feb 11
2
Viable Approach to Parallel R?
All, We are researching approaches to parallel R with the end goal of running R in a distributed manner on a Linux cluster. We expect of course to do some work decomposing our problems to be task-parallel or data-parallel, but wouldn't mind getting an initial boost working with "embarrassingly parallel" code sections and one of the approaches below. Incidentally our environment
2009 Jun 07
1
Inf in nnet final value for validation data
Hi, I use nnet for my classification problem and have a problem concerning the calculation of the final value for my validation data.(nnet only calculates the final value for the training data). I made my own final value formula (for the training data I get the same value as nnet): # prob-matrix pmatrix <- cat*fittedValues tmp <- rowSums(pmatrix) # -log likelihood
2001 Nov 05
1
Why doesn't outer work?
Hello I'm a population ecologist and use R for all my stats and modelling. Recently I have been using R to numerically solve integral projection models. This involves constructing several large matrices. The current code by Easterling (Size-specific sensitivity: Applying a new structured population model. Ecology, 2000, 81, 694-708) uses nested loops to construct the matrices. To speed up the
2008 Feb 15
2
help on using try() to catch an error
Dear R Users, I have the following glm, which I am running several times in a loop (I am not including the full code): reduced_model <- NULL; full_model <- NULL; reduced_model <- try(glm.fit(X4,n,family=poisson(link="log"))) full_model <- try(glm.fit(X5,n,family=poisson(link="log"))); On some occasions, an error is produced, which is why I have attempted to work
2008 Jan 25
1
increasing speed for permutations of glm
Dear R Programmers, I am trying to run a Poisson regression on all pairs of variables in a data set and obtain the permutation distribution. The number of pairs is around 100000. It seems my code will take weeks to run, unless I try something else. Could you give me any suggestions on how to improve the speed of the code below, or any general suggestions on how I may accomplish this task. Thanks
2010 Jun 15
2
Integration problem: error in invoking an outside function
Dear all, Currently I am trying to integrate a function which depends on four variables, two of which are given, one is given in the integrate function, so there is one variable to integrate on. The code is as follows: Pmatrix = function(th) { P = matrix(nrow=6, ncol=6, data=0) P[1,1] = P[2,1]=P[3,2]=P[4,3]=P[5,4]=P[6,5]= exp(-th) P[,6] = 1-exp(-th) return(P)} lim.verd =
2010 Dec 28
3
Jaccard dissimilarity matrix for PCA
Hi I have a large dataset, containing a wide range of binary variables. I would like first of all to compute a jaccard matrix, then do a PCA on this matrix, so that I finally can do a hierarchical clustering on the principal components. My problem is, that I don't know how to compute the jaccard dissimilarity matrix in R? Which package to use, and so on... Can anybody help me? Alternatively
2009 Mar 25
1
how to calcualte Jaccard Coefficient
Does anyone have a good method for calculating Jaccard coefficients now that the dissimilarity() function is no longer an option? Wen Gu John Jay College of Criminal Justice445 West 59 StreetNew York, NY 10029 wgu@gc.cuny.edu _________________________________________________________________ Express your personality in color! Preview and select themes for Hotmail®.
2009 Nov 03
1
hierarchical clustering with Jaccard index
hi, I want to do hierarchical clustering with Jaccord index. I tried to do with vegan package for finding index and hierarchical clustering with hclust function. While doing clustering it is showing an error message as "invalid distance method". I would be grateful if anyone tells how to rectify the error. Thanks in advance,   kind regards, Ms.Karunambigai M PhD Scholar Dept. of
2004 Nov 05
0
R check passes code and docs that don't match
I have code and documentation that don't match, but R CMD check didn't flag it. in mspath.R mspath <- function(formula, # formula with observed Markov states ~ observation times (required) qmatrix, # matrix of 1s and 0s with indices of allowed transitions (diagonal is ignored) (required) misc = FALSE, ematrix = NULL, # matrix
2010 Jan 15
3
optimization problem
Dear R-experts, this is not a direct R-problem but I hope you can help me anyway. I would like to minimize || PG-I || over P, where P is a p x p permutation matrix (obtained by permuting the rows and/or columns of the identity matrix), G is a given p x p matrix with full rank and I the identity matrix. ||.|| is the frobenius norm. Does anyone know an algorithm to solve such a problem? And if
2009 Sep 28
0
msm and pmatrix
Dear All, I?m using R package ?msm? to fit a multi state model to infection history data (counts of infections per month upto diagnosis of a particular disease (sink state is state 11). The observed transitions are as follows: to from 1 2 3 4 5 6 7 8 10 11 1 35192 3806 899 233 46 11 3 0 1 534 2 3801 790 249 69 15
2006 Dec 28
1
LU bug in Matrix package
There is a bug in Matrix package, please check it, thanks! Matlab result: x = 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 >> lu(x) ans = 21.0000 22.0000 23.0000 24.0000 25.0000 0.0476 0.9524 1.9048 2.8571 3.8095 0.7619
2012 Oct 29
2
naming datasubsets in a loop
Hello everbody, I want to generate different subsets of my data-set and safe this subsets with names listet in a vector. Because in reality I have got about 70 subsets I want to realize this in a loop Somehow like this: names=c("nam1",
2002 Dec 18
6
Can I build an array of regrssion model?
Hi, I am trying to use piecewise linear regression to approximate a nonlinear function. Actually, I don't know how many linear functions I need, therefore, I want build an array of regression models to automate the approximation job. Could you please give me any clue? Attached is ongoing code: rawData = scan("c:/zyang/mass/data/A01/1.PRN", what=list(numeric(),numeric())); len =
2012 Dec 10
1
Can somebody suggest how to achieve following data manipulation?
Dear all, Let say I have following data: RawData <- matrix(1:101, nr = 1); colnames(RawData) <- c("ASD", as.character(as.yearmon(seq(as.Date("2012-03-01"), length.out = 100, by = "1 month")))); rownames(RawData) <- "XYZ" CutOffDate <- as.Date("2012-09-01") NewDateSeries <- as.character(as.yearmon(seq(CutOffDate, to =
2012 Dec 06
1
clustering of binary data
Good morning, I am analyzing a dataset composed by 364 subjects and 13 binary variables (0,1 = absence,presence). I am testing possible association (co-presence) of my variables. To do this, I was trying with cluster analysis. My main interest is to check for the significance of the obtained clusters. First, I tried with the pvclust() function, by using method.hclust="ward" and
2008 Mar 23
1
mapply
In an earlier post, a person wanted to divide each of the rows of rawdata by the row vector sens so he did below but didn't like it and asked if there was a better solution. rawdata <- data.frame(rbind(c(1,2,2), c(4,5,6))) sens <- c(2,4,6) temp <- t(rawdata)/sens temp <- t(temp) print(temp) Gabor sent three other solutions and I understood 2 of them but not the
2005 Aug 04
2
Avoiding for loop
I understand that in R, for loops are not used as often as other languages, and am trying to learn how to avoid them. I am wondering if there is a more efficient way to write a certain piece of code, which right now I can only envision as a for loop. I have a data file that basically looks like: 1,55 1,23 2,12 ... that defines a matrix. Each row of the data file corresponds to a row of the
2009 Jul 14
2
How to provide list as an argument for the data.frame()
Hi R -users, i've a table as describe below. I'm reading the numeric value presented in this table to populate a list. #table #============ #X    A    B    C #x1    2    3    4 #x2    5    7    10 #x4    2    3    5 #============ rawData <- read.table("raw_data.txt",header=T, sep="\t") myList=list() counter=0 for (i in c(1:length(rawData$X))) {     print (i)