Displaying 20 results from an estimated 2000 matches similar to: "stemming an irregular forms?"
2013 Jan 24
1
Integrating a PaiceHusk stemmer into the library
Hey guys Hi :) I've implemented a PaiceHusk stemmer externally So what I
am doing right now is passing a pointer to my StemPaiceHusk class(which in
turn has been subclassed from Stemimplementation) to the
Stem::Stem(StemImplementation *p) constructor .So basically,I have to
include "paicehusk.h" in my indexer .However,I now want to make it a part
of the Xapian library so that I
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all,
I have some troubles using the stemming algorithm provided by the tm
(text mining) + Snowball packages.
Here is my config:
MacOS 10.5
R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
I have installed all the needed packages (tm, rJava, rWeka, Snowball)
+ dependencies. I have desactivated AWT (like written in
2009 Jun 05
2
Blacklist stemming
Hi,
I need to modify the stemming for a couple of words (a blacklist) and for
all the other to use the usual snowball stemmer.
The "natural" way of doing it would be to derive from Stem and override
operator ()... but I am using *python-bindings*. Would this be possible?
If not I have two other solutions in mind:
- add a custom stemmer to Xapian
- write custom index & search
2013 Jan 10
1
Add an example to the community page and contribute more code
Hi guys.I've finished an example indexer which acts like a grep replacement
for a file.It indexes each line containing a proper noun in a given text
file.The line containing the proper noun will be displayed upon searching
for that noun.I would like to add it to the community code examples.I'm
planning to write more examples which demonstrate some advanced features of
Xapian along similar
2010 Jul 13
1
Czech stemming
Hello,
I just find Xapian project when looking for some indexing engine in Ruby and
was quite impressed. Is there any change for Czech stemming? I found that it
is already written in Java as part of Lucene here:
http://svn.apache.org/viewvc/lucene/dev/trunk/modules/analysis/common/src/ja
va/org/apache/lucene/analysis/cz/CzechStemmer.java?view=markup
Sadly, I have no experience with C++, but
2008 Mar 12
1
how can i use stopwords?
Hi,
I do not understand the stopword function...
I've set the termgenerator like this:
$self->{'Stemmer'} = new Search::Xapian::Stem(german2);
$self->{'Stopper'} = new Search::Xapian::SimpleStopper();
$self->{'TermGenerator'} = new Search::Xapian::TermGenerator;
$self->{'TermGenerator'}->set_stemmer( $self->{'Stemmer'} );
2007 Jul 04
3
Stemming problem
Does anyone know if xapian stemming support suffix -er? I tried -s and -ing
both work, but not -er.
_________________________________________________________________
?????????????? MSN Messenger: http://messenger.msn.com/cn
2005 Oct 12
2
Stemmer Modifications
I'm using Xapian as a search back-end on a website. My client has
certain search terms that the stemmer does not stem in a way they would
like. For example "continuity" stems to "continu", which produces
undesirable results in their application. Is there a way to override the
stemming of certain words in a way that is compatible with the indexing
stemmer and the query
2015 Feb 10
3
Bitsize project - Krovetz stemmer
Hello Xapian devs,
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20150210/c848e9b7/attachment-0002.html>
2014 Nov 29
4
Adding Support for Krovetz Stemmer Algo in Xapian
Hello,
As mentioned on the project ideas page, Adding more support for stemmer
algorithm,
i found an implementation of Krovetz Stemmer Algo in C++ but before
working on it to merge it into xapian, i needed help in recognizing the
license information associated with the source code.
To avoid further licensing issues kindly someone check the link
2007 Jan 09
2
non-snowball stemmer
Hi!
I am going to use non-snowball russian stemmer with Xapian. There is a
good one at http://www.aot.ru. I've found that current implementation of
Xapian::Stem does not allow it (there is no public interface for
Xapian::Stem::Internal). Do you apply patches? Are there any
recommendations for writing patches?
Regards,
Oleg Obolenskiy
highpower at mail.ru
2007 Mar 29
1
stemtest failing with romanian
On Tuesday, I replaced the romanian1 and romanian2 stemmers in
Xapian-core with Martin's new romanian stemmer. At the time, I also
updated the stemming test data (by re-generating the output file using
snowball's "stemwords" utility), and I clearly remember re-running the
testsuite at the time and checking that all tests passed.
Now, when I run make check, stemtest fails
2011 Mar 24
2
Problem with Snowball & RWeka
Dear Forum,
when I try to use SnowballStemmer() I get the following error message:
"Could not initialize the GenericPropertiesCreator. This exception was
produced: java.lang.NullPointerException"
It seems to have something to do with either Snowball or RWeka, however I
can't figure out, what to do myself. If you could spend 5 minutes of your
valuable time, to help me or give me a
2008 Nov 05
1
Polish stemmer?
Hi,
is there a xapian stemmer suitable for polish or czech languages?
Thanks,
Torsten
2010 Feb 02
1
How to use a custom stemmer from Python bindings?
Hi,
I'm using Xapian bindings for Python in my project. How could I use a
custom stemmer instead of the included one (Snowball)? The one I'm
looking at right now is Hunspell (http://hunspell.sourceforge.net/)
which has Python bindings (http://code.google.com/p/pyhunspell/).
Thanks in advance,
Eugene
2011 Sep 23
2
understanding stemming and synonyms
I am working with version 1.2.7 and want to use stemming and synonyms.
I use the perl-bindings and get some problems.
First of all: the perl-bindings dont allow the QueryParser a third
argument when calling parse_query! So i cannot set a default prefix
(which perhaps is the solution to my problem, but later more)
i have a simple testcase:
3 documents, every document only has one word:
2014 Mar 31
2
Paice-Husk Stemmer
Hi everyone,
I was working on the Paice-Husk Stemmer, which is a Bite Size Project for
Xapian, and I have created a C++ as well as Snowball version of it.
I read the algorithm, and picked the rules from here:
http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm
The C++ code takes rules as input from a file and generates the stem of
given word, whereas the Snowball version
2005 Jun 09
1
Query parser and stemming of norwegian letters
Hello, can I get an explanation of the following.
Running the following code:
....
pqp=new QueryParser();
Stem stem("norwegian");
cout << "DEBUG " << stem.stem_word(_sXapian)<< endl;
pqp->set_stemmer(stem);
pqp->set_database(*_pdatabase);
pqp->set_default_op(Query::OP_AND);
//Set the
2006 Aug 18
1
Portuguese Stemming
Today while compiling ferret I noticed there was a Portuguese stemmer
being compiled. How do I enable it''s use for my index?
Pedro.
2006 Mar 31
3
undefined method `<=>'' for :id:Symbol
Upgrading to 0.9.0, I have the following error. Anybody?
c:/ruby/lib/ruby/gems/1.8/gems/ferret-0.9.0/lib/ferret/index/term.rb:35:in
`<=>'': undefined method `<=>'' for :id:Symbol
(NoMethodError)
from
c:/ruby/lib/ruby/gems/1.8/gems/ferret-0.9.0/lib/ferret/index/term_infos_io.rb:263:in
`get_index_offset''
from