Displaying 20 results from an estimated 600 matches similar to: "[LLVMdev] Spell Correction Efficiency"
2011 Feb 03
0
[LLVMdev] Spell Correction Efficiency
On Jan 15, 2011, at 8:31 AM, Matthieu Monrocq wrote:
> Hello Doug,
>
> *putting llvmdev in copy since they are concerned too*
>
> I've finally got around to finish a working implementation of the typical Levenshtein Distance with the diagonal optimization.
>
> I've tested it against the original llvm implementation and checked it on a set of ~18k by randomly
2009 Oct 21
2
[LLVMdev] 2.6 pre-release2 testing ends today
Just a reminder that today is the last day for pre-release2 testing.
Thanks,
Tanya
2009 Oct 22
0
[LLVMdev] 2.6 pre-release2 testing ends today
On Oct 21, 2009, at 3:44 PM, Tanya Lattner wrote:
> Just a reminder that today is the last day for pre-release2 testing.
>
Hi Tanya,
Attached are the log files for Mac OS X 10.5.8 on PowerPC. There are
these failures for clang:
********************
Failing Tests (20):
/Volumes/SandBox/2.6/llvm-2.6/tools/clang/test/CodeGen/
always_inline.c
2012 Jun 29
0
Adding Bi-gram in the QueryParser and Object.
Hi all,
I have jotted down a plan for how to handle or add Bi-gram in Query Object
though QueryParser.
PFA as a sequence diagram which depicts what i got to know about how parser
works and query is build from tokens provided by the lexer.I
have highlighted some area in blue where i think there is possibility of
having bi-grams.While Integrating bi-gram in the Parser ,Query our aim is
to generate
2002 Nov 17
1
SVD for reducing dimensions
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi all, this is probably simple and I'm just doing something stupid, sorry
about that :-)
I'm trying to convert words (strings of letters) into a fairly small
dimensional space (say 10, but anything between about 5 and 50 would be ok),
which I will call a feature vector. The the distance between two words
represents the similarity of the
2016 Apr 12
0
Xapian 1.3.5 snapshot performance and index size
On Mon, Apr 11, 2016 at 09:54:36AM +0200, Jean-Francois Dockes wrote:
> This way, "to be or not to be" gets from 11 S to 0.6 S, and "to be of
> the" gets from 12 S to 0.9 S. Which is of course brilliant !
>
> I think that I can dump my plan of indexing compound terms for runs of
> common words :)
We had been experimenting with bigrams to accelerate phrases, and
2000 Nov 15
2
loess documentation
Hi all,
I 've got a question about the usage of loess in the modreg package.
The documentation (loess.html) states that the smoothing window is
either set by span or enp.target. If span is used, the details section
of the docs state...
<SNIP>
DETAILS
Fitting is done locally. That is, for the fit at point x, the fit is
made using points in a neighbourhood of x, weighted by their
2000 Nov 15
2
loess documentation
Hi all,
I 've got a question about the usage of loess in the modreg package.
The documentation (loess.html) states that the smoothing window is
either set by span or enp.target. If span is used, the details section
of the docs state...
<SNIP>
DETAILS
Fitting is done locally. That is, for the fit at point x, the fit is
made using points in a neighbourhood of x, weighted by their
2004 Feb 11
6
AGREP
Hi all, I have two questions
1 - I have the version 1.4.1 of R, and it doesn't have the 'agrep'
function in the base library. Is there a way to make this funcion
avaliable in R 1.4.1? I mean, how to 'copy' it from R 1.8.1 and 'paste'
it in R 1.4.1?
2 - The AGREP function doesn't give me the Levenshtein distance (edit
distance). Is there a function in R that does
2011 Oct 21
2
Change column/row-name
Hi,
I am very happy. My problems are solved without one little thing:
(Iske <- matrix(c(1, 1, 1, 2, 2, 2, 1, 1, 1, 5, 1, 2, 2, 2, 1, 1, 1,
4, 4, 4, 4, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 5, 4, 4, 4, 4, 4, 4, 2,
2, 2, 2, 4, 5, 5, 5, 5, 5, 5, 5, 5, 5, 1, 2), ncol = 5)) #My Matrix
Iske<- Iske+33 #I want see the letters
(Iske.char<-apply(Iske, 1, function(x) rawToChar(as.raw(x)))) #Numbers to
2008 Aug 26
2
String search: Return "closest" match
Hi,
I have to match names where names can be recorded with errors or additions.
Now I am searching for a string search function which returns always the "closest" match. E.g. searching for "Washington" it should return only Washington but not Washington, D.C. But it also could be that the list contains only "Hamburg" but the record I am searching for is
2012 Jun 03
0
Proposal for Integration of Bi-gram in Xapian Architecture
Hi,
I have made a proposal for changes to integrate bi-grams in Xapian
Architecture on Wiki page.
Bigram Integration Proposal:
http://trac.xapian.org/wiki/GSoC2012/Bi-gram%20Language%20Modeling/Bi-gram%20Integration%20Proposal
Since Bi-gram integration will make some difference in how data is accessed
from the back-end so its better to get review from whole comunity.Moreover
i also have some
2008 Oct 01
0
MiscPsycho 1.3 posted to CRAN
An updated version of the Miscellaneous Psychometrics package has been
updated to CRAN. The following updates are included in the package:
1) An implementation of the Stocking-Lord procedure for linking test
scales.
2) An implementation of the Levenshtein algorithm for comparing
character strings
3) stringProbs, a function for computing the probability of a given
Levenshtein Distance
4) Three
2009 Jan 22
4
text vector clustering
Hi,
I am a new user of R using R 2.8.1 in windows 2003. I have a csv file with
single column which contain the 30,000 students names. There were typo
errors while entering this student names. The actual list of names is <
1000. However we dont have that list for keyword search.
I am interested in grouping/cluster these names as those which are
similar letter to letter. Are there any
2010 Jan 07
2
Find by looping thru array
Hello all,
I''m in my third month of Ruby on Rails. I think it''s coming along
fairly well.
My current issue is this.
I''m do a search based on string data some of which is very long. So I''m
doing a similar match using "amatch" gem.
When I have a 50% or greater match I store the id to an array.
How can I do a find looping thru the array then send the
2016 Mar 04
2
GSOC 2016 project on Ranking
Hello Sir,
I am a third-year student at the Department of mathematics at IIT
Kharagpur. I have good experience in Information Retrieval and Machine
Learning. I have read many chapters of the book Introduction to Information
Retrieval. Recently I am doing a project on tagging a question on a Q&A
Forum using ranking the tags and probabilistic inference. I also have
software development
2010 Jan 18
4
Index indexed words
Hello,
We would like to create Google or Firefox like "search hints".
If someone types "abc", the search system should name
some possible hints.
I think, Firefox does it by indexing 3-characters of the domain
name. If you enter parts, you get some hints.
Thank you very much
Marcus
2008 Nov 07
0
[LLVMdev] [cfe-commits] r58509 - in /cfe/trunk: include/clang/Basic/DiagnosticKinds.def lib/AST/Expr.cpp lib/Sema/Sema.h lib/Sema/SemaExprCXX.cpp lib/Sema/SemaInherit.cpp lib/Sema/SemaInherit.h lib/Sema/SemaOverload.cpp test/SemaCXX/condition.cpp
On 2008-11-04, at 08:12, Sebastian Redl wrote:
> Mike Stump wrote:
>> On Nov 3, 2008, at 1:39 PM, Sebastian Redl wrote:
>>>> svn help cl
>>>>
>>>> [ runs away, much the same way someone that throws a lighted match
>>>> into a pool of gasoline ]
Probably wise.
>>> "cl": unknown command.
>>
>> From svn 1.5:
2017 May 30
2
Communication between Clang Sema and the Clang Codegen...
Hi All,
We have populated info in the Clang Sema i.e class Sema
(include/clang/Sema/Sema.h) and like to propagate the same to Clang
Codegen .
Currently we are propagating through ASTContext ,where we have
duplicating fields info and operation in the Sema and ASTContext .
Any better way of doing the same ?
Thank you
~Umesh
2006 Jun 19
2
fuzzy search
This may be offtopic to Rails, but what are people doing to find records
based on fuzzy string matches? For example, if you wanted to find a
Person with name "David Heinemeier Hansson" but searched using the
string "Dave Hansson".
Currently I am find_by_sql that calls the PostgreSQL function
"levenshtein(string1, string2)" which returns results with a score