The first result show "no results" in dovecot for any search by header (I typed an email add in RoundCube search box, using Dovecot as back end, using Solr as own backend) So many efforts for crappy results. Can't we really revive Squat ? It is 2 lines of config, and no single problems On January 2, 2019 08:16:33 Joan Moreau via dovecot <dovecot at dovecot.org> wrote:> and the first line of the diff is : > < this file, see http://wiki.apache.org/solr/SolrConfigXml. > --- >> this file, see http://wiki.apache.org/solr/SolrConfigXml. > 38c38 > < <luceneMatchVersion>6.4.1</luceneMatchVersion> > --- >> <luceneMatchVersion>7.5.0</luceneMatchVersion> > > So, are you running 6.4.1 or 7.5.0 ???? > > On 2019-01-02 08:12, Joan Moreau wrote: >> The real main differecne seems coming from "diffconfig.xml" >> >> When I put yours, Solr delete (!) schema.xml and create a "manage-schema" >> and starts complaining about useless types (tdates, booleans, etc..) that >> are not needed for Mail fileds >> >> When I put mine (from standard distribution of Arch), it keeps things as >> they are (yeah !), does not complains about those useless types and startup >> properly. >> >> I attach my diffconfig >> >> >> >> But these are the configurations that one should adjust as per his/her own use. >> >> The main problem is : After some time of indexing from Dovecot, Dovecot >> returns errors (invalid SID, etc...) and Solr return "out of range indexes" >> errors >> >> >> >> >> >> >> On 2019-01-02 07:49, Joan Moreau wrote: >> >> Hi >> >> Solr is a standard package in ArchLinux. ("pacman -S solr") . the systemd >> installation script is included (and it is launching /opt/solr/bin/solr.in.sh) >> >> Instance : sudo -u solr /opt/solr/bin/solr create -c dovecot -> this >> creates a separate folder with default solrconfig.xml, schema.xml, etc.. >> >> I made a symlink of the data folder to a second drive (ext4) much bigger >> >> >> >> >> >> >> >> >> >> >> On 2018-12-31 14:09, Daniel Miller wrote: >> >> On 12/29/2018 4:49 PM, Joan Moreau wrote: >> >> Also : >> >> - Java is 10.0.2 >> >> Same as me. >> >> - If i delete schema.xml but create only managed-schema, the solr refuses >> to start with a java error "schema.xml missing" >> >> Ok...so we need to do some more digging. >> >> How did you install Solr? (I downloaded a "binary" installation and >> unpacked it) >> >> How did you create the dovecot instance? (I've provided explicit >> instructions for how I did it - did you follow those exactly or something >> different)? >> >> How are you starting Solr? (I use the provided "solr/bin/solr start" >> command, wrapped inside a systemd service). >> >> -- >> Daniel-------------- next part -------------- An HTML attachment was scrubbed... URL: <https://dovecot.org/pipermail/dovecot/attachments/20190102/8e65117a/attachment.html>
Other use case : I type "must" in the search filed-> I have some returns , but very not all, for instance "solarmust" is not in the results If I type "solarmust" -> then I have the solarmust mail Honestly, this is highly unstable. Not sure whereas bugs come from Solr or Dovecot Below my adjusted (corrections from the one of Daniel who is definitely not working) schema.xml <?xml version="1.0" encoding="UTF-8"?> <schema name="dovecot" version="2.0"> <uniqueKey>id</uniqueKey> <fieldType name="booleans" class="solr.BoolField" sortMissingLast="true" multiValued="true"/> <fieldType name="gjlong" class="solr.LongPointField" positionIncrementGap="0"/> <fieldType name="gjtext" class="solr.TextField" autoGeneratePhraseQueries="true" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.EdgeNGramTokenizerFactory" maxGramSize="15" minGramSize="3" /> <filter class="solr.WordDelimiterGraphFilterFactory" catenateNumbers="1" generateNumberParts="1" splitOnCaseChange="1" generateWordParts="1" splitOnNumerics="1" catenateAll="1" catenateWords="1" preserveOriginal="1"/> <filter class="solr.FlattenGraphFilterFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.ClassicTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> </fieldType> <fieldType name="string" class="solr.StrField"/> <field name="_version_" type="string" indexed="true" stored="true"/> <field name="bcc" type="string" indexed="false" stored="false"/> <field name="body" type="gjtext" indexed="true" stored="false"/> <field name="box" type="string" indexed="true" required="true" stored="true"/> <field name="cc" type="gjtext" indexed="true" stored="false"/> <field name="from" type="gjtext" indexed="true" stored="false"/> <field name="hdr" type="gjtext" indexed="false" stored="false"/> <field name="id" type="string" indexed="true" required="true" stored="true"/> <field name="subject" type="gjtext" indexed="true" stored="false"/> <field name="to" type="gjtext" indexed="true" stored="false"/> <field name="uid" type="string" indexed="true" required="true" stored="true"/> <field name="user" type="string" indexed="true" required="true" stored="true"/> </schema> On 2019-01-02 10:04, Joan Moreau wrote:> The first result show "no results" in dovecot for any search by header (I typed an email add in RoundCube search box, using Dovecot as back end, using Solr as own backend) > > So many efforts for crappy results. > > Can't we really revive Squat ? It is 2 lines of config, and no single problems > > On January 2, 2019 08:16:33 Joan Moreau via dovecot <dovecot at dovecot.org> wrote: > > and the first line of the diff is : > > < this file, see http://wiki.apache.org/solr/SolrConfigXml. > --- >> this file, see http://wiki.apache.org/solr/SolrConfigXml. > 38c38 > < <luceneMatchVersion>6.4.1</luceneMatchVersion> > --- >> <luceneMatchVersion>7.5.0</luceneMatchVersion> > > So, are you running 6.4.1 or 7.5.0 ???? > > On 2019-01-02 08:12, Joan Moreau wrote: > > The real main differecne seems coming from "diffconfig.xml" > > When I put yours, Solr delete (!) schema.xml and create a "manage-schema" and starts complaining about useless types (tdates, booleans, etc..) that are not needed for Mail fileds > > When I put mine (from standard distribution of Arch), it keeps things as they are (yeah !), does not complains about those useless types and startup properly. > > I attach my diffconfig > > But these are the configurations that one should adjust as per his/her own use. > > The main problem is : After some time of indexing from Dovecot, Dovecot returns errors (invalid SID, etc...) and Solr return "out of range indexes" errors > > On 2019-01-02 07:49, Joan Moreau wrote: > > Hi > > Solr is a standard package in ArchLinux. ("pacman -S solr") . the systemd installation script is included (and it is launching /opt/solr/bin/solr.in.sh) > > Instance : sudo -u solr /opt/solr/bin/solr create -c dovecot -> this creates a separate folder with default solrconfig.xml, schema.xml, etc.. > > I made a symlink of the data folder to a second drive (ext4) much bigger > > On 2018-12-31 14:09, Daniel Miller wrote: > On 12/29/2018 4:49 PM, Joan Moreau wrote: > Also : > > - Java is 10.0.2 > > Same as me. > - If i delete schema.xml but create only managed-schema, the solr refuses to start with a java error "schema.xml missing" > > Ok...so we need to do some more digging. > > How did you install Solr? (I downloaded a "binary" installation and unpacked it) > > How did you create the dovecot instance? (I've provided explicit instructions for how I did it - did you follow those exactly or something different)? > > How are you starting Solr? (I use the provided "solr/bin/solr start" command, wrapped inside a systemd service). > > -- > Daniel-------------- next part -------------- An HTML attachment was scrubbed... URL: <https://dovecot.org/pipermail/dovecot/attachments/20190102/8b546831/attachment.html>
Refinement of the schema.xml (below) THis however does not solve the "no results" and "Out of range" errors in Dovecot and Solr <?xml version="1.0" encoding="UTF-8"?> <schema name="dovecot" version="2.0"> <uniqueKey>id</uniqueKey> <fieldType name="booleans" class="solr.BoolField" sortMissingLast="true" multiValued="true"/> <fieldType name="gjlong" class="solr.LongPointField" positionIncrementGap="0"/> <fieldType name="gjtext" class="solr.TextField" autoGeneratePhraseQueries="true" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.WordDelimiterGraphFilterFactory" catenateNumbers="1" generateNumberParts="1" splitOnCaseChange="1" generateWordParts="1" splitOnNumerics="1" catenateAll="1" catenateWords="1" preserveOriginal="1"/> <filter class="solr.FlattenGraphFilterFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.KeywordTokenizerFactory"/> </analyzer> </fieldType> <fieldType name="gjfield" class="solr.TextField" autoGeneratePhraseQueries="true"> <analyzer type="index"> <tokenizer class="solr.NGramTokenizerFactory" maxGramSize="25" minGramSize="3" /> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.KeywordTokenizerFactory"/> </analyzer> </fieldType> <fieldType name="string" class="solr.StrField"/> <field name="_version_" type="string" indexed="true" stored="true"/> <field name="bcc" type="string" indexed="false" stored="false"/> <field name="body" type="gjtext" indexed="true" stored="false"/> <field name="box" type="string" indexed="true" required="true" stored="true"/> <field name="cc" type="gjfield" indexed="true" stored="false"/> <field name="from" type="gjfield" indexed="true" stored="false"/> <field name="hdr" type="string" indexed="false" stored="false"/> <field name="id" type="string" indexed="true" required="true" stored="true"/> <field name="subject" type="gjtext" indexed="true" stored="false"/> <field name="to" type="gjfield" indexed="true" stored="false"/> <field name="uid" type="string" indexed="true" required="true" stored="true"/> <field name="user" type="string" indexed="true" required="true" stored="true"/> </schema> -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://dovecot.org/pipermail/dovecot/attachments/20190102/a72ad4ae/attachment-0001.html>