Hi, I'm currently looking over the FTS pages to enable it in my dovecot. But I'm unsure what the best settings of the lucene plugin are, if you receive german and english mails. Wiki says: textcat_conf=<path> textcat_dir=<path>: If specified, enable guessing the stemming language for emails and search keywords. This is a little bit problematic in practice, since indexing and searching languages may differ and may not find even exact words because they stem differently. On Debian libstemmer is included in the debian-lucene package. So what settings are the best to have not the problem that exact words can't be found?
On 15 Jun 2015, at 21:22, Felix Zielcke <fzielcke at z-51.de> wrote:> > Hi, > > I'm currently looking over the FTS pages to enable it in my dovecot. > But I'm unsure what the best settings of the lucene plugin are, if you > receive german and english mails. > Wiki says: > > textcat_conf=<path> textcat_dir=<path>: If specified, enable guessing > the stemming language for emails and search keywords. This is a little > bit problematic in practice, since indexing and searching languages may > differ and may not find even exact words because they stem differently. > > On Debian libstemmer is included in the debian-lucene package. > > So what settings are the best to have not the problem that exact words > can't be found?The textcat support in fts-lucene works very badly and shouldn't be used. There's new lib-fts code being developed that supports multiple languages better. It's already kind of usable in v2.2.18, but would be better to wait for v2.2.19.
Am Mittwoch, den 17.06.2015, 16:43 +0300 schrieb Timo Sirainen:> > The textcat support in fts-lucene works very badly and shouldn't be > used. There's new lib-fts code being developed that supports multiple > languages better. It's already kind of usable in v2.2.18, but would > be better to wait for v2.2.19.Thanks for your answer. Then as soon as 2.2.19 has been uploaded to unstable I'll do my own backport package. Instead of waiting to be officially avaible