Displaying 20 results from an estimated 42 matches for "fts_tika".
2020 Nov 15
2
[patch] enhancement for tika server protected by user/password basic auth
...cted by ip restrictions via a proxy in front of it.
I've configured a tika server behind an apache proxy which enforces
basic auth, but sending basic auth credentials for a tika server is not
currently supported by Dovecot.
The following patch allows to have user and password specified in the
fts_tika url in much the same way you can for fts_solr.
fts_tika = https://user:password at tika_host/tika
John
--- dovecot-2.3.11.3-orig/src/plugins/fts/fts-parser-tika.c????
2020-08-12 14:20:41.000000000 +0200
+++ dovecot-2.3.11.3/src/plugins/fts/fts-parser-tika.c? 2020-11-15
15:18:24.351281064 +0100
@...
2020 Nov 15
2
[patch] enhancement for tika server protected by user/password basic auth
...ed up.
> (ya-request for a proper @dovecot public bug/issue queue!)
>
> have you found any other 'magic required' to get solr & tika indexing
> text/attachments, respectively, in Dovecot context?
> is it as straightforward as spec'ing the 'fts_solr' & 'fts_tika' urls,
> and Dovecot does the passing-around correctly?
I've just started using tika myself, but from my tests, it's as simple
as adding fts_tika to a working solr integration.
John
2016 Jun 27
2
fts_solr crashs
Hi,
I?ve set up in dovecot 2.2.24-1~auto+49 (from dovecot repo) fts_solr and
fts_tika - jetti8 (from Debian Jessie) and latest tika-server running on
a seperate machine. But if I want to rescan all messages for reindexing
for instance all attachments with "doveadm -v index -u user at domain.tld
INBOX" with 3137 mail in the INBOX it counts and then by 2900 mails the
do...
2014 Nov 04
0
error using fts/tika
Hi,
I played around a bit and tried to get tika to run with dovecot. In the end I was at least a bit successful.
However, when I tried to index my inbox with "doveadm index -A '*'"
I get:
doveadm(infoomatic): Error: fts_tika: PUT http://localhost:8081/tika failed: 500 Server Error
doveadm(infoomatic): Warning: I/O leak: 0x7f4f697bb170 (line 127, fd 24)
doveadm(infoomatic): Panic: file ioloop-iolist.c: line 22: unreached
doveadm(infoomatic): Error: Raw backtrace: /usr/lib/dovecot/libdovecot.so.0(+0x67f30) [0x7f4f697e8f3...
2015 Jun 16
3
bug in indexer/indexer-worker
...I have already mentioned this in http://www.dovecot.org/pipermail/dovecot/2014-November/098592.html
I could reproduce the errors above in a self-compiled v2.2.18 and the prebuilt packages from xi.rename-it.nl (in addition to version 2.2.15 mentioned in the link)
The problem occurs when enabling fts_tika in the plugins (tried tika 1.6, 1.7 and 1.8). I tried to move a folder of my mailbox with about 2000 mails to my server (no users, modern hardware).
At some point I get an error and from this time on dovecot keeps repeating the last lines with every mail that comes in ... see [1]
I then get kernel...
2020 Nov 15
0
[patch] enhancement for tika server protected by user/password basic auth
...request for a proper @dovecot public bug/issue queue!)
>>
>> have you found any other 'magic required' to get solr & tika indexing
>> text/attachments, respectively, in Dovecot context?
>> is it as straightforward as spec'ing the 'fts_solr' & 'fts_tika' urls,
>> and Dovecot does the passing-around correctly?
> I've just started using tika myself, but from my tests, it's as simple
> as adding fts_tika to a working solr integration.
>
> John
>
>
Just a couple of updates about Tika and Solr together.
1. On mass r...
2020 Mar 06
1
Problem with tika
...ds of documents, sometimes after a few.
Usually after a few hundred.
It appears there are less errors using http than https.
Relevant config:
OS: CentOS6, fully updated
plugin {
? fts = solr
? batch_size = 1
? fts_solr =
url=https://username:password at solr-01.vevida.net:443/solr/dovecot/
? #fts_tika = https://solr-01.vevida.net:443/tika/
? batch_size = 1000
? fts_autoindex=yes
? soft_commit=no
}
# dovecot --version
2.3.9.3 (9f41b88fa)
# Configure options:
??? --docdir=%{_docdir}/dovecot? \
??? --disable-static???????????? \
??? --with-nss?????????????????? \
??? --with-shadow???????????????...
2016 Nov 03
1
Forcibly terminated after 10 milliseconds
...ent mailbox date index ihave duplicate mime foreverypart extracttext vnd.dovecot.pipe vnd.dovecot.execute
mbox_write_locks = fcntl
passdb {
args = failure_show_msg=yes dovecot
driver = pam
}
plugin {
fts = solr
fts_autoindex = yes
fts_solr = url=http://localhost:4949/solr/dovecot/
fts_tika = http://localhost:9998/tika
sieve = ~/.dovecot.sieve
sieve_dir = ~/sieve
sieve_execute_bin_dir = /usr/local/lib/dovecot/sieve-execute
sieve_extensions = +vnd.dovecot.pipe +vnd.dovecot.execute
sieve_pipe_bin_dir = /usr/local/lib/dovecot/sieve-pipe
sieve_plugins = sieve_extprograms...
2015 Mar 12
1
indexer-worker panics with latest mercurial
...r:
service(indexe
r-worker): child 24003 killed with signal 6 (core dumps disabled)
Mar 12 20:49:01 dsync-local(laeeth at laeeth.com): Error: Couldn't lock
/home/mail/
laeeth_laeeth_com/.dovecot-sync.lock: Timed out after 30 seconds
Mar 12 20:49:16 indexer-worker(laeeth at laeeth.com): Error: fts_tika: PUT
http://l
ocalhost:9997/tika/ failed: 500 Server Error
Mar 12 20:49:17 indexer-worker(rosie at kaleidicassociates.com): Warning:
I/O leak:
0x7fc47d60fcf0 (line 127, fd 25)
Mar 12 20:49:17 indexer-worker(rosie at kaleidicassociates.com): Panic:
file ioloop
.c: line 39 (io_add_file): asserti...
2017 Jul 13
5
passwd-file, getting invalid uid 0
...b {
args = user=%Ln noauthenticate
driver = static
skip = authenticated
}
passdb {
args = failure_show_msg=yes session=yes max_requests=20
driver = pam
skip = authenticated
}
plugin {
fts = solr
fts_autoindex = yes
fts_solr = url=http://thebighonker.lerctr.org:8983/solr/dovecot/
fts_tika = http://localhost:9998/tika/
imapsieve_mailbox1_before = file:/usr/local/share/dovecot-pigeonhole/sieve/report-spam.sieve
imapsieve_mailbox1_causes = COPY
imapsieve_mailbox1_name = SPAM
imapsieve_mailbox2_before = file:/usr/local/share/dovecot-pigeonhole/sieve/report-ham.sieve
imapsieve_...
2020 Oct 27
2
imapsieve: setting imapsieve_url disables admin scripts
...h
? }
? prefix =
? separator = /
}
passdb {
? args = /etc/dovecot/master-users
? driver = passwd-file
? master = yes
? pass = yes
}
passdb {
? args = /etc/dovecot/dovecot-sql.conf.ext
? driver = sql
}
plugin {
? fts = solr
? fts_autoindex = yes
? fts_solr = url=http://127.0.0.1:8983/solr/dovecot/
? fts_tika = http://127.0.0.1:9998/tika/
? imapsieve_mailbox1_before = file:/usr/local/lib/imapsieve/report-spam.sieve
? imapsieve_mailbox1_causes = COPY
? imapsieve_mailbox1_name = Junk
? imapsieve_mailbox2_before = file:/usr/local/lib/imapsieve/report-ham.sieve
? imapsieve_mailbox2_causes = COPY
? imapsieve...
2017 Dec 25
2
Sieve 0.5.0/Dovecot 2.3.0
...b {
args = user=%Ln noauthenticate
driver = static
skip = authenticated
}
passdb {
args = failure_show_msg=yes session=yes max_requests=20
driver = pam
skip = authenticated
}
plugin {
fts = solr
fts_autoindex = yes
fts_solr = url=http://thebighonker.lerctr.org:8983/solr/dovecot/
fts_tika = http://localhost:9998/tika/
imapsieve_mailbox1_before = file:/usr/local/share/dovecot-pigeonhole/sieve/report-spam.sieve
imapsieve_mailbox1_causes = COPY
imapsieve_mailbox1_name = SPAM
imapsieve_mailbox2_before = file:/usr/local/share/dovecot-pigeonhole/sieve/report-ham.sieve
imapsieve_...
2017 Dec 25
3
Sieve 0.5.0/Dovecot 2.3.0
...gt; > args = failure_show_msg=yes session=yes max_requests=20
> > driver = pam
> > skip = authenticated
> > }
> > plugin {
> > fts = solr
> > fts_autoindex = yes
> > fts_solr = url=http://thebighonker.lerctr.org:8983/solr/dovecot/
> > fts_tika = http://localhost:9998/tika/
> > imapsieve_mailbox1_before = file:/usr/local/share/dovecot-pigeonhole/sieve/report-spam.sieve
> > imapsieve_mailbox1_causes = COPY
> > imapsieve_mailbox1_name = SPAM
> > imapsieve_mailbox2_before = file:/usr/local/share/dovecot-pigeon...
2014 May 11
5
v2.2.13 released
...e connections hanging
arond for a long time. (Affected Dovecot v1.1+)
+ mdbox: Added mdbox_purge_preserve_alt setting to keep the file
within alt storage during purge. (Should become enforced in v2.3.0?)
+ fts: Added support for parsing attachments via Apache Tika. Enable
with: plugin { fts_tika = http://tikahost:9998/tika/ }
+ virtual plugin: Delay opening backend mailboxes until it's necessary.
This requires mailbox_list_index=yes to work. (Currently IMAP IDLE
command still causes all backend mailboxes to be opened.)
+ mail_never_cache_fields=* means now to disable all cachin...
2014 May 11
5
v2.2.13 released
...e connections hanging
arond for a long time. (Affected Dovecot v1.1+)
+ mdbox: Added mdbox_purge_preserve_alt setting to keep the file
within alt storage during purge. (Should become enforced in v2.3.0?)
+ fts: Added support for parsing attachments via Apache Tika. Enable
with: plugin { fts_tika = http://tikahost:9998/tika/ }
+ virtual plugin: Delay opening backend mailboxes until it's necessary.
This requires mailbox_list_index=yes to work. (Currently IMAP IDLE
command still causes all backend mailboxes to be opened.)
+ mail_never_cache_fields=* means now to disable all cachin...
2019 Sep 08
1
Subscribe to a fileinto :create mailbox?
...driver = static
skip = authenticated
}
passdb {
args = failure_show_msg=yes session=yes max_requests=20
driver = pam
override_fields = domain=lerctr.org
skip = authenticated
}
plugin {
fts = solr
fts_autoindex = yes
fts_solr = url=http://thebighonker.lerctr.org:8983/solr/dovecot/
fts_tika = http://localhost:9998/tika/
imapsieve_mailbox1_before =
file:/usr/local/share/dovecot-pigeonhole/sieve/report-spam.sieve
imapsieve_mailbox1_causes = COPY
imapsieve_mailbox1_name = SPAM
imapsieve_mailbox2_before =
file:/usr/local/share/dovecot-pigeonhole/sieve/report-ham.sieve
imapsieve_...
2017 Oct 11
0
FTS Dealing badly with Tika failures
...er piping
the attachment through a shellscript as done with fts_decoder.
Having this feature enabled though I sometimes do see the situation that
the tika service does not reply in time and it looks like the whole FTS
transaction gets aborted:
dovecot: indexer-worker(eggs at localhost): Error: fts_tika: PUT
http://localhost:9998/tika/ failed: Request timed out (Request queued
60.145 secs ago, 1 attempts in 60.134 secs, 60.034 in http ioloop, 0.000
in other ioloops, connected 1739.235 secs ago)
dovecot: indexer-worker(eggs at localhost): Error: Mailbox INBOX.junk: Mail
search failed: Internal...
2020 Nov 15
0
[patch] enhancement for tika server protected by user/password basic auth
...mise/patch will get picked up.
(ya-request for a proper @dovecot public bug/issue queue!)
have you found any other 'magic required' to get solr & tika indexing text/attachments, respectively, in Dovecot context?
is it as straightforward as spec'ing the 'fts_solr' & 'fts_tika' urls, and Dovecot does the passing-around correctly?
2014 May 20
0
Solr/Tika
...ist/solr-cell-*|
solr-4.7.2/contrib/extraction/lib/*
to
||/var/lib/tomcat6/webapps/solr/WEB-INF/lib/
how do I get dovecot to index attachments?
Do I just need to add| fts-solr =/index-attachments url=http://localhost:8080/solr-4.7.2/
/to the 90-plugin.conf?
Or do I need to so something like
fts_tika =/http://localhost:8080/solr-4.7.2//
IE the same uri I have for the solr?
Thanks
Alex
2017 Jul 18
1
passwd-file, getting invalid uid 0
...gt; > args = failure_show_msg=yes session=yes max_requests=20
> > driver = pam
> > skip = authenticated
> > }
> > plugin {
> > fts = solr
> > fts_autoindex = yes
> > fts_solr = url=http://thebighonker.lerctr.org:8983/solr/dovecot/
> > fts_tika = http://localhost:9998/tika/
> > imapsieve_mailbox1_before = file:/usr/local/share/dovecot-
> > pigeonhole/sieve/report-spam.sieve
> > imapsieve_mailbox1_causes = COPY
> > imapsieve_mailbox1_name = SPAM
> > imapsieve_mailbox2_before = file:/usr/local/share/dov...