thr3ads.net - search: "similarity"

Displaying 20 results from an estimated 55611 matches for "similarity".

2004 Dec 08

similarity matrix conversion to dissimilarity

I have a matrix of similarity scores that I want to convert into a matrix of dissimilarity scores so that I can apply some clustering methods to the data. That is, high values in my matrix signify similarity and low values (zero being the lowest) signify no similarity. What functions/options in R or its packages are avail...

tuning SVM's

2004 Dec 01

tuning SVM's

Hi I am doing this sort of thing: POLY: > > obj = best.tune(svm, similarity ~., data = training, kernel = "polynomial") > summary(obj) Call: best.tune(svm, similarity ~ ., data = training, kernel = "polynomial") Parameters: SVM-Type: eps-regression SVM-Kernel: polynomial cost: 1 degree: 3 gamma: 0.04545455 c...

[RFC] Framework for Finding and Using Similarity at the IR Level

2020 Sep 01

[RFC] Framework for Finding and Using Similarity at the IR Level

...that perform nearly the same operation. By copying and pasting code throughout a code base, or using a piece of code as a reference to create a new piece of code that has nearly the same structure redundancies are inadvertently introduced throughout programs. Furthermore, compilers can introduce similarity through sets of instructions that require the same code generation strategies, or optimizing to the same patterns. For example these two pieces of code: int fn(const std::vector<int> &myVec) { for (auto it = myVec.begin(), et = myVec.end(); it != et; ++it) { if (*it &a...

isoMDS

2004 Sep 08

isoMDS

Dear List: I have a question regarding an MDS procedure that I am accustomed to using. I have searched around the archives a bit and the help doc and still need a little assistance. The package isoMDS is what I need to perform the non-metric scaling, but I am working with similarity matrices, not dissimilarities. The question may end up being resolved simply. Here is a bit of substantive background. I am working on a technique where individuals organize items based on how similar they perceive the items to be. For example, assume there are 10 items. Person 1 might group items...

Similarity matrix

2001 Apr 10

Similarity matrix

I frequently use hclust on a similarity matrix. In R only a distance matrix is allowed. Is there a simple reliable transformation of a similarity matrix that will result in a distance matrix making hclust work the same as S-Plus with a similarity matrix? Venables & Ripley 3rd edition implies that a simple reversal of values will s...

Clustering with R - efficient processing of large sparse data sets (text data)

2009 Sep 27

Clustering with R - efficient processing of large sparse data sets (text data)

I checked the R procedure HCLUST (hierarchical clustering) but it looks like it requires a full triangular n x n similarity matrix as input, where n = number of observations. The number of variables is 200. My data set has n = 50,000 observations (keywords), and I use ad-hoc similarity measures, not available in R, to measure keyword similarity. Here, the vast majority of the n x n similarities are equal to zero. So I...

Measuring Goodness of a Matrix

2008 Jun 24

Measuring Goodness of a Matrix

Hi all, Suppose I have 2 matrices A and B. And I want to measure how good each of this matrix is. So I intend to compare A and B with another "gold standard" matrix X. Meaning the more similar a matrix to X the better it is. What is the common way in R to measure matrix similarity (ie. A vs X, and B vs X) ? - Gundala Viswanath Jakarta - Indonesia

How to measure level of similarity of two data frames

2012 May 26

How to measure level of similarity of two data frames

...ow if there's a way of summarizing the overall distance by some kind of metrics. If anyone know a proper way of doing so and/or a package I would greatly appreciate your suggestions. Thanks very much! Kel -- View this message in context: http://r.789695.n4.nabble.com/How-to-measure-level-of-similarity-of-two-data-frames-tp4631466.html Sent from the R help mailing list archive at Nabble.com.

R equivalent of proc varclus

2011 Oct 07

R equivalent of proc varclus

Dear List What is the R package equivalent of Proc Varclus or Information Value. ANy assistance in determining R equivalents of f Oblique Component Analysis (PROC VARCLUS), Information Value (IV) and Weight Of Evidence (WOE) analysis, and business intelligence http://www.nesug.org/proceedings/nesug06/an/da23.pdf Regards, Ajay Websites- http://decisionstats.com [[alternative HTML version

similarity matrix

2011 Dec 04

similarity matrix

Hello R-users, I've got a file with individuals as colums and the clusters where they occur in as rows. And I wanted a similarity matrix which tells me how many times each individual occurs with another. My eventual goal is to make Venn-diagrams from the occurence of my individuals. So I've this: cluster ind1 ind2 ind3 etc. 1 0 1 2 2 3 0 1 3 1 1 1 And I wan...

question about similarities cluster using hierclust

2004 Jun 10

question about similarities cluster using hierclust

my major is bioinformatics, and i'm trying to cluster ( agglomerate the closest pari of observations ) in R. i have already got my own similarities metric, but do not know how to clust it based on similarities instead of dissimilarities. since the help document of hierclust mentions the parameter "sim", which seems good to me, but it doesn't appear in the code of hierclust()

2011 Jul 12

apply (or similar preferred) for multiple columns

Dear all, I would like to use the apply or a similar function belonging to this family, but applying for each column (or row) but let say for each q columns. For example I would like to apply a function FUN for the first q columns of matrix X then for q+1:2*q and so on. If I do apply (X, 2, FUN) it applies for each column and not for every q columns. Is that possible with any similar function?

2008 Aug 24

similarity between two gene lists with varied length

Dear listers, a little off-topic: I am looking for and compare algorithms which can calculate "distance" or "similarity" between two gene lists with different lengths. Any paper, any implementation in R and any suggestion is welcome! Thanks, -- Weiwei Shi, Ph.D Research Scientist GeneGO, Inc. "Did you always know?" "No, I did not. But I believed..." ---Matrix III [[alternative HTML ve...

similarity measure for binary data

2009 Oct 29

similarity measure for binary data

I am doing hierarchical clustering with cluster package. I couldnot find similarity measures like matching coefficient, Jaccard coefficient and sokal and sneath. Could anyone please tell package with similarity measures for binary data? kind regards, Ms.Karunambigai M PhD Scholar Dept. of Biostatistics NIMHANS Bangalore India From cricket scores to your friends. Try the Y...

taking the median across similar data

2010 Feb 25

taking the median across similar data

Dear All, I am analyzing the miRNA data set in which I have 817 unique probes for each they have 20 features each . I have to group the similar features and take the median across them so that I have a data with no repeats to perform invariant analysis . My data looks something similar format probename sample1 sample2 sample3 A 2.3 2.4 2.5 A

help with hclust

2011 Sep 13

help with hclust

Hello, how can I get the similarity value (i.e., the inner cluster similarity) that was used to cut a hierarchical tree at a specific height? I would appreciate your help! Best regards, Madeleine

Xapian document matching

2007 Apr 30

Xapian document matching

Hi, i'm wondering is there a possibility to do like ABCSok do (http://nyheter.abcsok.no/), to make "Main article" and "Same articles" collapsed to it. Like on http://news.google.com/?hl=en the same thing. "Parent" and "same article on other sites" (they do differ from each other a little bit). Maybe somebody know how to do that thing or where to read

DWARF info in readobj

2015 Sep 15

DWARF info in readobj

Hi All, I see that llvm-readobj displays information similar to GNU readelf does except DWARF data. I also see llvm-dwarfdump dumps all DWARF data in user readable format. Is there a plan for readobj to incorporate similar options? This will make readobj more feature complete for reading objects similar to readelf. If this is not the plan, will llvm-dwarfdump be a tool that regular user

OpenRico LiveGrid or similar

2006 Feb 08

OpenRico LiveGrid or similar

Hi Has anyone used OpenRico''s "on-demand listbox" LiveGrid or something similar in a Rails app already? How well does it behave? I''m asking this because I need a scrollable list but the number of records in the table could be well above 5000. In the past I''ve used similar "on-demand fetchings" in desktop apps and it was a real blessing (the

rsync performance on large files strongly depends on file's (dis)similarity

2014 Apr 11

rsync performance on large files strongly depends on file's (dis)similarity

...sum, exactly because the files are very different from each other. rsync ends up computing lots of strong checksums, which do not result in a match. Is this a valid/reasonable explanation? Can someone else confirm this relationship between rsync's computational overhead and the file's (dis)similarity? Thanks, Thomas.

search for: similarity