Emmanuel Engelhart
2010-Apr-05 17:13 UTC
[Xapian-discuss] Problem with stop words by indexing
Hi, I try to remove stop words during the index process and I have no stemming. I have tried with a simple example but it does not work at all. I have my writableDatabase and my termGenerator (indexer) and they work well both together: I can index texts and search trough the database correctly. But if I add (before indexing my texts): Xapian::SimpleStopper stopper; stopper.add("testword"); indexer.set_stopper(&stopper); ... the result is exactly the same as before. I have checked with delve and "testword" is indexed. Do I use the SimpleStopper in a right way? Regards Emmanuel
On Mon, Apr 05, 2010 at 07:13:02PM +0200, Emmanuel Engelhart wrote:> I try to remove stop words during the index process and I have no stemming. > I have tried with a simple example but it does not work at all. > > I have my writableDatabase and my termGenerator (indexer) and they work > well both together: I can index texts and search trough the database > correctly. > > But if I add (before indexing my texts): > Xapian::SimpleStopper stopper; > stopper.add("testword"); > indexer.set_stopper(&stopper); > > ... the result is exactly the same as before. I have checked with delve > and "testword" is indexed.http://article.gmane.org/gmane.comp.search.xapian.general/7571 Looks like I failed to add that note to the API docs - now done. This ought to be more configurable, as should some other things in TermGenerator. I'm thinking we should look at how to improve TermGenerator in 1.3.x. Cheers, Olly