similar to: Add an example to the community page and contribute more code

Displaying 20 results from an estimated 1000 matches similar to: "Add an example to the community page and contribute more code"

2014 Mar 31
2
Paice-Husk Stemmer
Hi everyone, I was working on the Paice-Husk Stemmer, which is a Bite Size Project for Xapian, and I have created a C++ as well as Snowball version of it. I read the algorithm, and picked the rules from here: http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm The C++ code takes rules as input from a file and generates the stem of given word, whereas the Snowball version
2013 Jan 24
1
Integrating a PaiceHusk stemmer into the library
Hey guys Hi :) I've implemented a PaiceHusk stemmer externally So what I am doing right now is passing a pointer to my StemPaiceHusk class(which in turn has been subclassed from Stemimplementation) to the Stem::Stem(StemImplementation *p) constructor .So basically,I have to include "paicehusk.h" in my indexer .However,I now want to make it a part of the Xapian library so that I
2014 Dec 01
2
Adding Support for Krovetz Stemmer Algo in Xapian
> On 30 Nov 2014, at 17:51, Abhishek Singh Kushwah <abhishek18kushwah at gmail.com> wrote: > > > Two of the implementation of algorithms has already been rejected > > previously due to licenses both being the implementation of porter > > but our xapian use implementation in snowball which i assume is > > under GPL. The only cases I can think you might be
2014 Nov 29
4
Adding Support for Krovetz Stemmer Algo in Xapian
Hello, As mentioned on the project ideas page, Adding more support for stemmer algorithm, i found an implementation of Krovetz Stemmer Algo in C++ but before working on it to merge it into xapian, i needed help in recognizing the license information associated with the source code. To avoid further licensing issues kindly someone check the link
2011 Oct 14
1
stemming an irregular forms?
Dear All, I could not find the irregular forms table in xapian. Please, could you tell me how to define/add words to the irregular forms table in xapian? Thank you a lot. Sascha
2008 Aug 16
1
python how do i stem words in python?
hi, i am newbie to xapian and am trying to get started with it in python. there is no stemmer.stem_word method in the latest python library. how do i stem words before doing doc.add_posting? is there any sample hello world code in python that i can use? thanks a lot! >>> stemmer = xapian.Stem('english') >>> stemmer. stemmer.__call__
2007 Jan 21
2
A few questions: Tweaking StemFilter, indexes, ...
Hello all, I am new to the list, but I have been using ferret for a little bit already. I would first like to thank Dave for all his work on ferret. I had a few questions that I haven''t been able to figure out after messing around with ferret and going through the documentation. StemFilter ------ I am trying to improve the quality of my searches in context of the content of my
2005 Oct 12
2
Stemmer Modifications
I'm using Xapian as a search back-end on a website. My client has certain search terms that the stemmer does not stem in a way they would like. For example "continuity" stems to "continu", which produces undesirable results in their application. Is there a way to override the stemming of certain words in a way that is compatible with the indexing stemmer and the query
2015 Feb 15
3
Bitsize project: Krovetz Stemmer
Hello xapian devs, I had shown interest in writing a krovetz stemmer for xapian and spoke to James Aylett about it. Since it was hard to code the stemmer in snowball, I came up with a C++ implementation of the stemmer. But since it is a dictionary based stemmer, im having problems on deciding how to create the dictionary. I did check out some of the implementations of the Krovetz stemmer online
2007 Dec 19
1
noun-verb vs verb-noun aka dogs black vs black dogs
Wow. I wasn't expecting such a voluminous reply -- some I agree with and some I don't. My apologies for an equally voluminous reply. On Wed, 19 Dec 2007, Tony Plack wrote: > > We're not discussing code or the inner workings of Asterisk or even > > changing the functionality of Asterisk, just what the proper order of > > the words should be. > > > >
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
Xapians! If tomorrow Xapian search engine would achieved the same performance and result in searches as Google we would not be able to beat Google, because we would create only a copy of the searches that already exists from Google search engine. However there is a way to beat anyone, and there is a way to beat Google successfully as well just do not give up. Some see it as implementing Ajax, or
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all, I have some troubles using the stemming algorithm provided by the tm (text mining) + Snowball packages. Here is my config: MacOS 10.5 R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions) I have installed all the needed packages (tm, rJava, rWeka, Snowball) + dependencies. I have desactivated AWT (like written in
2006 Mar 29
1
Using boolean terms in PHP bindings
OK, I'm indexing my data with the scriptindex. I want to be able to restrict the search by the category field. Do I need to do anything to the data itself? Like, literally prefix it with the characters "XC"? Below is my indexor for scriptindex and the my php code... document_id : field=ref unique=Q boolean=Q search_id : field=document_id index=S document_title : field=title
2009 Jun 05
2
Blacklist stemming
Hi, I need to modify the stemming for a couple of words (a blacklist) and for all the other to use the usual snowball stemmer. The "natural" way of doing it would be to derive from Stem and override operator ()... but I am using *python-bindings*. Would this be possible? If not I have two other solutions in mind: - add a custom stemmer to Xapian - write custom index & search
2013 Feb 19
2
Implementing tf-idf weighting scheme in Xapian
Hello guys.I just read up about tf-idf schemes and want to implement it in Xapian (with some frequently used normalizations) as it will also give me a good hang of implementing a weighting scheme before I start working on implementing DFR schemes. I read the following as references and I think Ive understood it well and can write the hack :- 1.)
2013 Sep 02
2
having trouble with prefixes
I've got a small test database setup with one record. $ delve -r 1 -V /tmp/1/ Values for record #1: 0:DD4F2162FFFF0E43741A4A1C2B8EC0E7 1:./Text_page_scan_2.jpg 2:jpg 3:.jpg Term List for record #1: E:.jpg P:./Text_page_scan_2.jpg Q:DD4F2162FFFF0E43741A4A1C2B8EC0E7 T:jpg The terms were added with lines like this: doc.add_term(string("P:") + path); Problem is, I can't seem to
2016 Aug 09
1
Man page for idmap_rid
On 08/09/2016 03:18 PM, Rowland Penny via samba wrote: > On Tue, 9 Aug 2016 16:04:04 -0300 > francis picabia via samba <samba at lists.samba.org> wrote: > >> On Tue, Aug 9, 2016 at 3:35 PM, Rowland Penny via samba < >> samba at lists.samba.org> wrote: >> >>> On Tue, 9 Aug 2016 15:21:53 -0300 >>> francis picabia via samba <samba at
2007 Jan 09
2
non-snowball stemmer
Hi! I am going to use non-snowball russian stemmer with Xapian. There is a good one at http://www.aot.ru. I've found that current implementation of Xapian::Stem does not allow it (there is no public interface for Xapian::Stem::Internal). Do you apply patches? Are there any recommendations for writing patches? Regards, Oleg Obolenskiy highpower at mail.ru
2006 Aug 09
1
[LLVMdev] Re: llvm 1.8 release notes draft
On Tue, 08 Aug 2006 23:19:52 -0700, Bil wrote: > > * The following Unix system functionality has not been tested and may > not work: > > "functionalities have" > "functionality" seems more of a substance noun than a thing noun. More like "water" than ike "cup"/ So I think the singular is appropriate here. -- hendrik
2008 Nov 05
1
Polish stemmer?
Hi, is there a xapian stemmer suitable for polish or czech languages? Thanks, Torsten