Displaying 1 result from an estimated 1 matches for "textprocessor".
2007 Mar 28
2
Moving indextext.cc into core.
...nt supplied to it.
[Actually, I'm not sure that "text splitter" is the right name for what
the code in indextext.cc does - it doesn't just split text, but also
does stemming, creates "R" terms, and possibly a few other things I've
missed. I'd call it a "TextProcessor" class, but someone else might
have a better name.]
A cleaner separation and code organisation, to my mind, would be to make
a new intermediate library which sits on top of Xapian, and provides
language specific processing features. The stemming algorithm stuff
would also be moved into t...