PGNet Dev
2022-May-23 23:27 UTC
enable/control fts-tika debug logging in Dovecot 2.3.18 + Tika Server 2.4.0?
i run dovecot-2.3.18-1.fc36.x86_64 i've installed Apache Tika, v 2.4.0 ls -al tika-server-standard-2.4.0.jar -rw-r--r-- 1 root root 59M May 2 09:53 tika-server-standard-2.4.0.jar tika's listening telnet 127.0.0.1 9998 Trying 127.0.0.1... Connected to 127.0.0.1. Escape character is '^]'. telnet> and responds to a test curl \ -T /tmp/test.pdf \ http://127.0.0.1:9998/meta pdf:unmappedUnicodeCharsPerPage,0,0,0,0,0,0,0,0,0,0,0,0,0,0 pdf:PDFVersion,1.4 xmp:CreatorTool,Adobe InDesign 15.1 (Macintosh) pdf:hasXFA,false access_permission:modify_annotations,true access_permission:can_print_degraded,true X-TIKA:Parsed-By-Full-Set,org.apache.tika.parser.DefaultParser,org.apache.tika.parser.pdf.PDFParser dcterms:created,2020-08-13T14:55:46Z language,en dcterms:modified,2020-09-24T23:38:28Z dc:format,application/pdf; version=1.4 xmpMM:DocumentID,xmp.id:8a612346-9d03-4caf-8ebf-da6f3716ed0a pdf:docinfo:creator_tool,Adobe InDesign 15.1 (Macintosh) access_permission:fill_in_form,true pdf:docinfo:modified,2020-09-24T23:38:28Z pdf:hasCollection,false pdf:encrypted,false pdf:hasMarkedContent,true Content-Type,application/pdf dc:language,en-US pdf:producer,Adobe PDF Library 15.0 access_permission:extract_for_accessibility,true access_permission:assemble_document,true xmpTPg:NPages,14 pdf:hasXMP,true pdf:charsPerPage,84,676,1653,1914,814,1022,645,1221,1087,732,887,1295,1263,149 access_permission:extract_content,true xmpMM:DerivedFrom:DocumentID,xmp.did:b98726d4-04c4-48f5-88be-0a48a0074356 access_permission:can_print,true pdf:docinfo:trapped,false X-TIKA:Parsed-By,org.apache.tika.parser.DefaultParser,org.apache.tika.parser.pdf.PDFParser xmpMM:DerivedFrom:InstanceID,xmp.iid:3dd6a91f-a114-4d63-804e-e2b749c15075 pdf:annotationTypes,null access_permission:can_modify,true pdf:docinfo:producer,Adobe PDF Library 15.0 pdf:docinfo:created,2020-08-13T14:55:46Z pdf:annotationSubtypes,Link in dovecot config, i've added plugin { fts_tika = http://127.0.0.1:9998/tika/ } and log_debug = (category=fts-flatcurve OR category=fts-tika) on message receipt, I see verbose logs for fts-flatcurve, as expected, but not a trace of output from fts-tika, in dovecot logs how to correctly turn on debug/verbose logging for fts-tika use in/by dovecot?
Michael Slusarz
2022-May-24 00:16 UTC
enable/control fts-tika debug logging in Dovecot 2.3.18 + Tika Server 2.4.0?
> On 05/23/2022 5:27 PM PGNet Dev <pgnet.dev at gmail.com> wrote: > > how to correctly turn on debug/verbose logging for fts-tika use in/by dovecot?mail_debug = yes This turns on HTTP debugging for the outgoing Tika requests. Unfortunately, Tika has not yet been converted to events/categories with the ability to more granularly enable debugging just for this component. It's probably easier to just look at tika's debugging logs. The default log level (at least in Tika 2.3) will output an INFO line for every attachment indexed: INFO [qtp235162442-22] 16:15:19,905 org.apache.tika.server.core.resource.TikaResource /tika (text/calendar) michael