Emmanuel Engelhart
2010-Apr-05 17:13 UTC
[Xapian-discuss] Problem with stop words by indexing
Hi,
I try to remove stop words during the index process and I have no stemming.
I have tried with a simple example but it does not work at all.
I have my writableDatabase and my termGenerator (indexer) and they work
well both together: I can index texts and search trough the database
correctly.
But if I add (before indexing my texts):
Xapian::SimpleStopper stopper;
stopper.add("testword");
indexer.set_stopper(&stopper);
... the result is exactly the same as before. I have checked with delve
and "testword" is indexed.
Do I use the SimpleStopper in a right way?
Regards
Emmanuel
On Mon, Apr 05, 2010 at 07:13:02PM +0200, Emmanuel Engelhart wrote:> I try to remove stop words during the index process and I have no stemming. > I have tried with a simple example but it does not work at all. > > I have my writableDatabase and my termGenerator (indexer) and they work > well both together: I can index texts and search trough the database > correctly. > > But if I add (before indexing my texts): > Xapian::SimpleStopper stopper; > stopper.add("testword"); > indexer.set_stopper(&stopper); > > ... the result is exactly the same as before. I have checked with delve > and "testword" is indexed.http://article.gmane.org/gmane.comp.search.xapian.general/7571 Looks like I failed to add that note to the API docs - now done. This ought to be more configurable, as should some other things in TermGenerator. I'm thinking we should look at how to improve TermGenerator in 1.3.x. Cheers, Olly