search for: extractingrequesthandl

Displaying 7 results from an estimated 7 matches for "extractingrequesthandl".

2011 May 23
3
[PATCH] Indexing mail attachments with Dovecot + Solr
...does not index attachments; With this patch, you can index mails and their attachments (pdf, docs, openoffice docs...) . You can get others goodies with this patch and the Solr Config provided, like Synonyms and Stemming (Spanish by default). Attachment indexing is provided by Solr Cell and Tika (ExtractingRequestHandler) * http://wiki.apache.org/solr/ExtractingRequestHandler Synonyms and Stemming are provided by SnowballPorterFilterFactory from Solr Language Analysis: * http://wiki.apache.org/solr/LanguageAnalysis We have tested Solr with Tomcat and Jetty. Tomcat is better to handle UTF-8 and bigger POSTS....
2020 Nov 15
0
[patch] enhancement for tika server protected by user/password basic auth
...a useful 1st read > > ? > https://lucene.apache.org/solr/guide/8_7/uploading-data-with-solr-cell-using-apache-tika.html It's an approach that could be worthwhile looking into, though not using solr cell, given the following statements at that link: "If any exceptions cause the |ExtractingRequestHandler| and/or Tika to crash, Solr as a whole will also crash because the request handler is running in the same JVM that Solr uses for other operations. Indexing can also consume all available Solr resources, particularly with large PDFs, presentations, or other files that have a lot of rich media emb...
2019 Apr 23
2
SolrCore 'dovecot' is not available due to init failure: fieldType 'text_general' not found in the schema
Hello, it seems an mismatch of schema file provided by dovecot-2.3.5.2 [root at mail conf]# pwd /var/solr/data/dovecot/conf # Below solrconfig.xml is from Solr-8.0.0. [root at mail conf]# grep text_general solrconfig.xml <str name="queryAnalyzerFieldType">text_general</str> field types. Text content will be indexed as "text_general" as <str
2019 Apr 23
3
SolrCore 'dovecot' is not available due to init failure: fieldType 'text_general' not found in the schema
...,/spell,/browse"> > <lst name="defaults"> > <str name="df">_text_</str> > </lst> > </initParams> > <requestHandler name="/update/extract" startup="lazy" > class="solr.extraction.ExtractingRequestHandler"> > <lst name="defaults"> > <str name="lowernames">true</str> > <str name="fmap.content">_text_</str> > </lst> > </requestHandler> > <searchComponent name="spellcheck&...
2019 Apr 23
0
SolrCore 'dovecot' is not available due to init failure: fieldType 'text_general' not found in the schema
...,/query,/select,/tvrh,/elevate,/spell,/browse"> <lst name="defaults"> <str name="df">_text_</str> </lst> </initParams> <requestHandler name="/update/extract" startup="lazy" class="solr.extraction.ExtractingRequestHandler"> <lst name="defaults"> <str name="lowernames">true</str> <str name="fmap.content">_text_</str> </lst> </requestHandler> <searchComponent name="spellcheck" class="solr.SpellCh...
2020 Nov 15
2
[patch] enhancement for tika server protected by user/password basic auth
On 11/15/20 12:21 PM, John Fawcett wrote: > I'm using tika-server.jar installed as a service yup. same here. atm, listening on localhost, with Dovecot -> Tika direct, no proxy. similarly fragile under load. throwing ~10 messages with .5-5MB attachments at it at once causes all sorts of complaints. one at a time seems OK ... > Dovecot currently implements separate integrations,
2013 Feb 05
1
"Header is huge" in fts-solr
...: solr.JsonUpdateRequestHandler 01.02.2013 18:03:54 org.apache.solr.core.RequestHandlers initHandlersFromConfig INFO: created /update/json: solr.JsonUpdateRequestHandler 01.02.2013 18:03:54 org.apache.solr.core.RequestHandlers initHandlersFromConfig INFO: adding lazy requestHandler: solr.extraction.ExtractingRequestHandler 01.02.2013 18:03:54 org.apache.solr.core.RequestHandlers initHandlersFromConfig INFO: created /update/extract: solr.extraction.ExtractingRequestHandler 01.02.2013 18:03:54 org.apache.solr.core.RequestHandlers initHandlersFromConfig INFO: adding lazy requestHandler: solr.XsltUpdateRequestHandler 0...