similar to: Adding Support for Krovetz Stemmer Algo in Xapian

Displaying 20 results from an estimated 900 matches similar to: "Adding Support for Krovetz Stemmer Algo in Xapian"

2014 Dec 01
2
Adding Support for Krovetz Stemmer Algo in Xapian
> On 30 Nov 2014, at 17:51, Abhishek Singh Kushwah <abhishek18kushwah at gmail.com> wrote: > > > Two of the implementation of algorithms has already been rejected > > previously due to licenses both being the implementation of porter > > but our xapian use implementation in snowball which i assume is > > under GPL. The only cases I can think you might be
2015 Feb 10
3
Bitsize project - Krovetz stemmer
Hello Xapian devs, -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20150210/c848e9b7/attachment-0002.html>
2015 Feb 15
3
Bitsize project: Krovetz Stemmer
Hello xapian devs, I had shown interest in writing a krovetz stemmer for xapian and spoke to James Aylett about it. Since it was hard to code the stemmer in snowball, I came up with a C++ implementation of the stemmer. But since it is a dictionary based stemmer, im having problems on deciding how to create the dictionary. I did check out some of the implementations of the Krovetz stemmer online
2007 Jan 09
2
non-snowball stemmer
Hi! I am going to use non-snowball russian stemmer with Xapian. There is a good one at http://www.aot.ru. I've found that current implementation of Xapian::Stem does not allow it (there is no public interface for Xapian::Stem::Internal). Do you apply patches? Are there any recommendations for writing patches? Regards, Oleg Obolenskiy highpower at mail.ru
2008 Nov 05
1
Polish stemmer?
Hi, is there a xapian stemmer suitable for polish or czech languages? Thanks, Torsten
2005 Oct 12
2
Stemmer Modifications
I'm using Xapian as a search back-end on a website. My client has certain search terms that the stemmer does not stem in a way they would like. For example "continuity" stems to "continu", which produces undesirable results in their application. Is there a way to override the stemming of certain words in a way that is compatible with the indexing stemmer and the query
2013 Jan 24
1
Integrating a PaiceHusk stemmer into the library
Hey guys Hi :) I've implemented a PaiceHusk stemmer externally So what I am doing right now is passing a pointer to my StemPaiceHusk class(which in turn has been subclassed from Stemimplementation) to the Stem::Stem(StemImplementation *p) constructor .So basically,I have to include "paicehusk.h" in my indexer .However,I now want to make it a part of the Xapian library so that I
2010 Feb 02
1
How to use a custom stemmer from Python bindings?
Hi, I'm using Xapian bindings for Python in my project. How could I use a custom stemmer instead of the included one (Snowball)? The one I'm looking at right now is Hunspell (http://hunspell.sourceforge.net/) which has Python bindings (http://code.google.com/p/pyhunspell/). Thanks in advance, Eugene
2011 Mar 24
2
Problem with Snowball & RWeka
Dear Forum, when I try to use SnowballStemmer() I get the following error message: "Could not initialize the GenericPropertiesCreator. This exception was produced: java.lang.NullPointerException" It seems to have something to do with either Snowball or RWeka, however I can't figure out, what to do myself. If you could spend 5 minutes of your valuable time, to help me or give me a
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all, I have some troubles using the stemming algorithm provided by the tm (text mining) + Snowball packages. Here is my config: MacOS 10.5 R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions) I have installed all the needed packages (tm, rJava, rWeka, Snowball) + dependencies. I have desactivated AWT (like written in
2010 Jul 13
1
Czech stemming
Hello, I just find Xapian project when looking for some indexing engine in Ruby and was quite impressed. Is there any change for Czech stemming? I found that it is already written in Java as part of Lucene here: http://svn.apache.org/viewvc/lucene/dev/trunk/modules/analysis/common/src/ja va/org/apache/lucene/analysis/cz/CzechStemmer.java?view=markup Sadly, I have no experience with C++, but
2014 Mar 31
2
Paice-Husk Stemmer
Hi everyone, I was working on the Paice-Husk Stemmer, which is a Bite Size Project for Xapian, and I have created a C++ as well as Snowball version of it. I read the algorithm, and picked the rules from here: http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm The C++ code takes rules as input from a file and generates the stem of given word, whereas the Snowball version
2007 Mar 29
1
stemtest failing with romanian
On Tuesday, I replaced the romanian1 and romanian2 stemmers in Xapian-core with Martin's new romanian stemmer. At the time, I also updated the stemming test data (by re-generating the output file using snowball's "stemwords" utility), and I clearly remember re-running the testsuite at the time and checking that all tests passed. Now, when I run make check, stemtest fails
2015 Jan 03
3
Xapian-discuss Digest, Vol 127, Issue 1
Hey Richhiey, Most probably Xapian is used with CYGWIN in Windows and Windows Specific Code in Xapian is based on CYGWIN, However we would be able to help you out with this issue, if you could pastebin whole 'gnu-make' generated report. Regards, Abhishek On Sat, Jan 3, 2015 at 5:30 PM, <xapian-discuss-request at lists.xapian.org> wrote: > Send Xapian-discuss mailing list
2009 Jun 05
2
Blacklist stemming
Hi, I need to modify the stemming for a couple of words (a blacklist) and for all the other to use the usual snowball stemmer. The "natural" way of doing it would be to derive from Stem and override operator ()... but I am using *python-bindings*. Would this be possible? If not I have two other solutions in mind: - add a custom stemmer to Xapian - write custom index & search
2010 Apr 11
1
A Hebrew stemmer based on libhspell
Hello. I'm interested in creating a Hebrew stemmer to use with Xapian. Hebrew is a complicated language to stem, as it uses the semitic "root" system, rather than prefixes and suffixes, and has many irregularities in accidence (morphology). Fortunately, two bright fellows from the Technion University in Israel have already created a Hebrew morphological analyzer as part of their
2011 Oct 14
1
stemming an irregular forms?
Dear All, I could not find the irregular forms table in xapian. Please, could you tell me how to define/add words to the irregular forms table in xapian? Thank you a lot. Sascha
2007 Jul 04
3
Stemming problem
Does anyone know if xapian stemming support suffix -er? I tried -s and -ing both work, but not -er. _________________________________________________________________ ?????????????? MSN Messenger: http://messenger.msn.com/cn
2005 Dec 09
1
QueryParser and utf-8 strings
Hi all, I am rather new to xapian, I just recently tried to include it in my application, so bear with me if this has already been discussed. I was playing with QueryParser and noticed that it expects input to be in ISO8859_1 encoding - characters above 0x80 are transliterated, and are not considered letters. For example, using single word (in utf-8 encoding) "bo?e" as input for
2010 Mar 13
1
Stemmer in Xapian, some questions from student.
Hello, I am student from University of Science and Technology in Cracow. I would like to ask you some questions about Xapian, because it is one of few search engines I am testing. 1. What type of stemmer is used in Xapian? Are You able to write some important features applied to Xapian? 2. We must use stemming for polish language. I read on Your webpage what languages include Xapian. There is no