similar to: python how do i stem words in python?

Displaying 20 results from an estimated 400 matches similar to: "python how do i stem words in python?"

2009 Jun 05
2
Blacklist stemming
Hi, I need to modify the stemming for a couple of words (a blacklist) and for all the other to use the usual snowball stemmer. The "natural" way of doing it would be to derive from Stem and override operator ()... but I am using *python-bindings*. Would this be possible? If not I have two other solutions in mind: - add a custom stemmer to Xapian - write custom index & search
2006 Apr 20
1
Odd stemmer behavior
I've noticed some strange results from the stemmer in the Ruby port: irb(main):003:0> @stem.stem_word("anybody") => "anybodi" irb(main):004:0> @stem.stem_word("swimmingly") => "swim" irb(main):005:0> @stem.stem_word("fiercely") => "fierc" irb(main):006:0> @stem.stem_word("fraudulently") =>
2010 Apr 11
1
A Hebrew stemmer based on libhspell
Hello. I'm interested in creating a Hebrew stemmer to use with Xapian. Hebrew is a complicated language to stem, as it uses the semitic "root" system, rather than prefixes and suffixes, and has many irregularities in accidence (morphology). Fortunately, two bright fellows from the Technion University in Israel have already created a Hebrew morphological analyzer as part of their
2010 Feb 02
1
How to use a custom stemmer from Python bindings?
Hi, I'm using Xapian bindings for Python in my project. How could I use a custom stemmer instead of the included one (Snowball)? The one I'm looking at right now is Hunspell (http://hunspell.sourceforge.net/) which has Python bindings (http://code.google.com/p/pyhunspell/). Thanks in advance, Eugene
2005 Jun 09
1
Query parser and stemming of norwegian letters
Hello, can I get an explanation of the following. Running the following code: .... pqp=new QueryParser(); Stem stem("norwegian"); cout << "DEBUG " << stem.stem_word(_sXapian)<< endl; pqp->set_stemmer(stem); pqp->set_database(*_pdatabase); pqp->set_default_op(Query::OP_AND); //Set the
2008 Sep 16
1
Some Questions From the beginner of Xapian
Dear, guys: I am a beginner of Xapian, when reading the documents, I encountered follow questions. (1) I see the Xapian::Document has a method void add_value (Xapian::valueno valueno, const std::string &value) What's the purpose of this method? Document will related to the terms, but what's the purpose of this? (2) add_posting method will add term to a documents. void
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all, I have some troubles using the stemming algorithm provided by the tm (text mining) + Snowball packages. Here is my config: MacOS 10.5 R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions) I have installed all the needed packages (tm, rJava, rWeka, Snowball) + dependencies. I have desactivated AWT (like written in
2007 Apr 17
3
Deprecation
Yesterday I updated the deprecation.rst document such that I think it contains all the items which are slated for deprecation (and also contains full details of the upgrade suggestions, as Olly suggested). I've marked several things which have been deprecated for a long period of time (ie, since 0.9.0 or earlier) for removal in 1.0.0, in line with the policy outlined in the document. If
2006 May 15
1
term / posting question
Hi guys Sorry to take up your time with this, I have just been stuck on a little problem with xapian for a few days here and I can't seem to figure it out for myself. I have created an xapian index (using the php bindings). I have added documents to it, with values, terms and postings. I can successfully search in this index on anything that is in a posting, but if I search on a word that
2015 Jan 20
2
Question on "single writer, multiple reader"
On Sun, Jan 18, 2015 at 04:25:29PM +0000, James Aylett wrote: > That?s exactly how it?s supposed to work. ?Eventually? (once the > writer gets sufficiently far ahead of the reader), the reader will get > a DatabaseModifiedError and will have to re-open the database, but > until then it?s up to it when it does so. You may wish to do it every > N requests, or every K seconds, or only
2008 Sep 27
3
Query::MatchAll
Why there still been rank when using Query::MatchAll() ?
2006 May 17
3
QueryParser lowercase / uppercase and stemming
Hello. There are several problems I couldn't find a solution. 1. QueryParser does not perform stemming I am working with PHP5 and use the xapian wrapper written by Daniel M?nard I build a query using parseQuery. Output of the parsed query shows that terms are not stemmed, although a stemmer is set ( see code snippet) # create a XapianDatabase object to search in $db = new
2006 May 31
1
Phrase Query vs AND Query? Why don't these find the same things?
PHRASE QUERY Xapian::Query((MBOX:12345678-1234-1234-1234-1234567890ab AND (LP:backup:(pos=1) PHRASE 6 LP:c::(pos=2) PHRASE 6 LP:program files:(pos=3) PHRASE 6 LP:Mozilla Firefox:(pos=4) PHRASE 6 LP:res:(pos=5) PHRASE 6 LP: table-add-column-after-hover.gif:(pos=6)))) AND QUERY Xapian::Query((MBOX:12345678-1234-1234-1234-1234567890ab AND LP:backup:(pos=1) AND LP:c::(pos=2) AND LP:program
2015 Jan 25
2
TestCases Failure
Hi, I am a student in Pakistan willing to contribute to Xapian. I have been able to successfully build the code. But when I run the existing testcases (to verify everthing is working fine), It says 3 out of the six testcases fails. Here is the report of the test cases. "... make check-TESTS make[5]: Entering directory `/home/saad/GitProjects/xapian/build/xapian-core/tests' Running
2015 Sep 11
0
xapian-core-1.0.23 compiled on Interix / Eric Lindblad
Report by Eric Lindblad 11-09-2015 http://www.ericlindblad.blogspot.com I compiled xapian-core-1.0.23 today on Interix. 'gmake check' summary PASS: btreetest, quartztest, stemtest, termgentest FAIL: apitest, internaltest, queryparsertest MS XP SP3 HOME modified SFU Interix 3.5 w/ getopt.h, inttypes.h, & stdint.h copied over from SUA Interix for Vista (and newer) into
2015 Sep 11
1
xapian 1.0.x compiled on Interix / Eric Lindblad
Report by Eric Lindblad 11-09-2015 http://www.ericlindblad.blogspot.com I compiled xapian-core-1.0.23 today on Interix. 'gmake check' summary PASS: btreetest, quartztest, stemtest, termgentest FAIL: apitest, internaltest, queryparsertest MS XP SP3 HOME modified SFU Interix 3.5 w/ getopt.h, inttypes.h, & stdint.h copied over from SUA Interix for Vista (and newer) into
2011 Sep 23
2
understanding stemming and synonyms
I am working with version 1.2.7 and want to use stemming and synonyms. I use the perl-bindings and get some problems. First of all: the perl-bindings dont allow the QueryParser a third argument when calling parse_query! So i cannot set a default prefix (which perhaps is the solution to my problem, but later more) i have a simple testcase: 3 documents, every document only has one word:
2006 Aug 18
1
Portuguese Stemming
Today while compiling ferret I noticed there was a Portuguese stemmer being compiled. How do I enable it''s use for my index? Pedro.
2006 Dec 16
1
Changing weights per field
I'm not entirely sure if the following is possible with xapian (didn't see much in the way of the documentation that says it is). I have four fields which reference a PK in our DB that I'd like to index (in most weighted to least weighed order): full name, e-mail, title and location. I see I how I can use add_document() to add a concatenated string of the 4 fields, but how would I
2014 Sep 05
0
stemmer or no-stemmer..
Hi, In the documentation of FTS lucene, I need little more clarification about this section: -------------------------------------------------- --with-lucene --with-stemmer The second switch is only required if you have compiled libstemmer yourself or if it's included in the CLucene you are using. -------------------------------------------------- I am totally unsure but as I understand,