On Tue, Dec 14, 2004 at 12:26:41PM -0300, Georges Dupret
wrote:> I would like to use the lists of stopwords provided with Xapian. Are
> there some standard way to remove stopwords automatically, or should I
> implement it mysel in the indexer?
There's the Xapian::Stopper class which Xapian::QueryParser uses. But
that's just there as a way of passing a stopping algorithm to the
QueryParser.
If you're stopping at index time it's simplest to just store the list in
a std::set (or if you're using Xapian from another language, whatever
the appropriate data structure is in that language - e.g. you'd use a
hash in Perl).
Cheers,
Olly