David Gessel
2020-May-19 13:15 UTC
FTS-lucene errors : language not available for stemming
I'm getting some log errors with clucene that I am having no luck tracking down on the interwebs. Errors: May 19 05:05:16 indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: Error: lucene index /mail/blackrosetech.com/gessel//lucene-indexes: IndexWriter::addDocument() failed (#4): language not available for stemming May 19 05:05:16 indexer-worker: Error: May 19 05:05:16 indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: Error: Mailbox Security: Mail search failed: Internal error occurred. Refer to server log for more information. [2020-05-19 05:05:16] May 19 05:05:16 indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: Error: Mailbox Security: Transaction commit failed: FTS transaction commit failed: transaction context (attempted to index 1 messages (UIDs 152736..152736)) Config: FreeBSD 11.3-RELEASE-p8 #0 r360490 dovecot-2.3.10_3 clucene-2.3.3.4_19 py37-pystemmer-2.0.0.1 py37-snowballstemmer-1.2.1 icu-67.1,1 plugin { ? #setting_name = value ??????? expire = Trash ??????? mail_log_events = delete undelete expunge copy mailbox_delete mailbox_rename ??????? mail_log_fields = uid box msgid size ??????? fts_autoindex=yes ??????? #zlib_save_level = 6 # 1..9 ??????? #zlib_save = gz # or bz2 } plugin { ? fts = lucene ? # Lucene-specific settings, good ones are: ? fts_lucene = whitespace_chars=@. mime_parts } I am considering switch to xapian (solr and java... pls noe) as the port is quite tempting from an ease of integration perspective, but the easiest solution would be to resolve these odd indexing errors.? Anyone have a clue? -David
Jan Bramkamp
2020-May-19 13:40 UTC
FTS-lucene errors : language not available for stemming
On 19.05.20 15:15, David Gessel wrote:> I'm getting some log errors with clucene that I am having no luck > tracking down on the interwebs. > > > Errors: > > May 19 05:05:16 > indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: > Error: lucene index /mail/blackrosetech.com/gessel//lucene-indexes: > IndexWriter::addDocument() failed (#4): language not available for > stemming > May 19 05:05:16 indexer-worker: Error: > May 19 05:05:16 > indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: > Error: Mailbox Security: Mail search failed: Internal error occurred. > Refer to server log for more information. [2020-05-19 05:05:16] > May 19 05:05:16 > indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: > Error: Mailbox Security: Transaction commit failed: FTS transaction > commit failed: transaction context (attempted to index 1 messages > (UIDs 152736..152736)) > > > Config: > > FreeBSD 11.3-RELEASE-p8 #0 r360490 > > dovecot-2.3.10_3 > > clucene-2.3.3.4_19 > > py37-pystemmer-2.0.0.1 > > py37-snowballstemmer-1.2.1 > > icu-67.1,1 > > plugin { > ? #setting_name = value > ??????? expire = Trash > ??????? mail_log_events = delete undelete expunge copy mailbox_delete > mailbox_rename > ??????? mail_log_fields = uid box msgid size > ??????? fts_autoindex=yes > ??????? #zlib_save_level = 6 # 1..9 > ??????? #zlib_save = gz # or bz2 > } > > plugin { > ? fts = lucene > ? # Lucene-specific settings, good ones are: > ? fts_lucene = whitespace_chars=@. mime_parts > } > > I am considering switch to xapian (solr and java... pls noe) as the > port is quite tempting from an ease of integration perspective, but > the easiest solution would be to resolve these odd indexing errors.? > Anyone have a clue? >I ran into the same problem a few weeks back. The workaround I found was to add no_snowball to fts_lucene. It disables the snowball algorithm.
Stuart Henderson
2020-May-19 13:48 UTC
FTS-lucene errors : language not available for stemming
On 2020-05-19, David Gessel <gessel at blackrosetech.com> wrote:> I'm getting some log errors with clucene that I am having no luck tracking down on the interwebs.This looks relevant: https://www.mail-archive.com/dovecot at dovecot.org/msg66366.html> I am considering switch to xapian (solr and java... pls noe) as the > port is quite tempting from an ease of integration perspective, but the > easiest solution would be to resolve these odd indexing errors.? Anyone > have a clue?dovecot-fts-xapian is easy to configure, but has a big downside compared to solr in that the indexer runs as root.
<!doctype html> <html> <head> <meta charset="UTF-8"> </head> <body> <div> <br> </div> <blockquote type="cite"> <div> On 19/05/2020 16:48 Stuart Henderson <<a href="mailto:stu@spacehopper.org">stu@spacehopper.org</a>> wrote: </div> <div> <br> </div> <div> <br> </div> <div> On 2020-05-19, David Gessel <<a href="mailto:gessel@blackrosetech.com">gessel@blackrosetech.com</a>> wrote: </div> <blockquote type="cite"> <div> I'm getting some log errors with clucene that I am having no luck tracking down on the interwebs. </div> </blockquote> <div> This looks relevant: </div> <div> <br> </div> <div> <a href="https://www.mail-archive.com/dovecot@dovecot.org/msg66366.html" rel="noopener" target="_blank">https://www.mail-archive.com/dovecot@dovecot.org/msg66366.html</a> </div> <div> <br> </div> <blockquote type="cite"> <div> I am considering switch to xapian (solr and java... pls noe) as the </div> <div> port is quite tempting from an ease of integration perspective, but the </div> <div> easiest solution would be to resolve these odd indexing errors. Anyone </div> <div> have a clue? </div> </blockquote> <div> dovecot-fts-xapian is easy to configure, but has a big downside compared </div> <div> to solr in that the indexer runs as root. </div> </blockquote> <div> Dovecot indexer does not run as root. </div> <div class="io-ox-signature"> <pre>--- Aki Tuomi</pre> </div> </body> </html>
David Gessel
2020-May-20 12:06 UTC
FTS-lucene errors : language not available for stemming
On 2020-05-19 16:48, Stuart Henderson wrote:> On 2020-05-19, David Gessel <gessel at blackrosetech.com> wrote: >> I'm getting some log errors with clucene that I am having no luck tracking down on the interwebs. > This looks relevant: > > https://www.mail-archive.com/dovecot at dovecot.org/msg66366.htmlThanks Stuart & Jan - no_snowball seems to have cleared up the errors. relevant? config now reads: plugin { ? fts = lucene ? # Lucene-specific settings, good ones are: ? fts_lucene = whitespace_chars=@. mime_parts no_snowball } May 20 04:40:50 indexer-worker(gessel at blackrosetech.com)<26130><rw0HDUIXxV6KBwEA0J78UA:4CgkD0IXxV4SZgAA0J78UA>: Error: lucene index /mail/blackrosetech.com/gessel//lucene-indexes: IndexWriter::addDocument() failed (#4): language not available for stemming May 20 04:40:50 indexer-worker: Error: May 20 04:40:50 indexer-worker(gessel at blackrosetech.com)<26130><rw0HDUIXxV6KBwEA0J78UA:4CgkD0IXxV4SZgAA0J78UA>: Error: Mailbox Lists.Spamassassin: Mail search failed: Internal error occurred. Refer to server log for more information. [2020-05-20 04:40:50] May 20 04:40:50 indexer-worker(gessel at blackrosetech.com)<26130><rw0HDUIXxV6KBwEA0J78UA:4CgkD0IXxV4SZgAA0J78UA>: Error: Mailbox Lists.Spamassassin: Transaction commit failed: FTS transaction commit failed: transaction context (attempted to index 2 messages (UIDs 7..8)) May 20 04:45:05 master: Warning: Killed with signal 15 (by pid=81740 uid=0 code=kill) May 20 04:46:39 indexer-worker(gessel at blackrosetech.com)<87087><5jtvLp8YxV4tVAEA0J78UA:NexHM58YxV4vVAEA0J78UA>: Warning: fts-lucene: Settings have changed, rebuilding index for mailbox (no further errors, various mailboxes being indexed.)>> I am considering switch to xapian (solr and java... pls noe) as the >> port is quite tempting from an ease of integration perspective, but the >> easiest solution would be to resolve these odd indexing errors.? Anyone >> have a clue? > dovecot-fts-xapian is easy to configure, but has a big downside compared > to solr in that the indexer runs as root.-David
Hello Indexer does not run as root It runs as "mail_uid = xxx" (based on your config)> dovecot-fts-xapian is easy to configure, but has a big downside compared > to solr in that the indexer runs as root.-------------- next part -------------- An HTML attachment was scrubbed... URL: <https://dovecot.org/pipermail/dovecot/attachments/20200521/824cc1bd/attachment.html>
Apparently Analagous Threads
- FTS-lucene errors : language not available for stemming
- FTS-lucene errors : language not available for stemming
- Dovecot 2.2.16: disappearing messages, mismatched summaries, duplicated messages, excessive full re-downloads
- Dovecot 2.2.16: disappearing messages, mismatched summaries, duplicated messages, excessive full re-downloads
- Sieve permissions issue following update