Displaying 20 results from an estimated 2000 matches similar to: "Lucene 3.6.2 backend for xapian (#25)"
2013 Sep 02
2
Backend for Lucene format indexes-How to get doclength
On Mon, Sep 02, 2013 at 09:21:48AM +0800, jiangwen jiang wrote:
> TfIdfWeight and BM25(b=0) also need wdf_upper_bound, it is not exists in
> Lucene backends.
If you don't provide an implementation of wdf_upper_bound(), the default
is to use the collection frequency of the term, so provided that
information is available in the lucene files, the lack of
wdf_upper_bound information
2013 Aug 26
2
Backend for Lucene format indexes-How to get doclength
On Mon, Aug 26, 2013 at 09:41:07AM +0800, jiangwen jiang wrote:
> > For now, using weighting schemes which don't use document length is
> > probably the simplest answer.
>
> There's tf-idf weighting scheme on svn master, is it suitable for lucene
> backend?
Yes - TfIdfWeight doesn't ever use the document length (at least with
the normalisations currently
2013 Jun 17
2
Backend for Lucene format indexes-How to get doclength
*Or do you mean that it's one number per document whereas the other stats
are per database, so it's harder to store it?*
yes, I mean this. It's a huge data. If a new doclength list(contains all
the doclength in a list, like chert)
is added by myself, I am concern about:
1. This doclength list may be the bottlenect in this backend,
http://trac.xapian.org/ticket/326
2. Change too much
2013 Aug 25
2
Backend for Lucene format indexes-How to get doclength
On Tue, Aug 20, 2013 at 07:28:42PM +0800, jiangwen jiang wrote:
> I think norm(t, d) in Lucene can used to caculate the number which is
> similar to doc length(see norm(t,d) in
> http://lucene.apache.org/core/3_5_0/api/all/org/apache/lucene/search/Similarity.html#formula_norm).
It sounds similar (especially if document and field boosts aren't in use),
though some places may rely on
2013 Jun 16
3
Backend for Lucene format indexes-How to get doclength
Hi, all:
I have wrote a demo patch for Backend for Lucene format indexes, Lucene
version is 3.6.2.
http://lucene.apache.org/core/3_6_2/fileformats.html
Now, this demo patch just support the basic features in Lucene. Compound
File(.cfs/.cfe)?term vector(.tvx/.tvd/.tvf)
delete document(.del) are not supported, skip list in .fdx is not supported
too
example/quest.cc is used to test this demo.
2006 May 15
16
Ferret not able to read a Lucene Index?
Hi all,
Having problems trying to get Ferret to read an index generated by
Lucene.
Am I right in thinking Ferret should be able to read a Lucene generated
index no problem?
Using the code snippets detailed in
http://www.ruby-forum.com/topic/64099#new
Any advice gratefully received.
Many Thanks,
Steven
--
Posted via http://www.ruby-forum.com/.
2006 Jan 05
0
Java Lucene compatibility?
I haven''t dug into this yet, but wanted to report it. I''ve built an
index with Java Lucene (1.9, from svn trunk) and then trying to
search it with Ferret and getting this:
/usr/lib/ruby/site_ruby/1.8/ferret/index/term_buffer.rb:31:in `read'':
undefined method `name'' for nil:NilClass (NoMethodError)
from /usr/lib/ruby/site_ruby/1.8/ferret/index/
2005 Feb 25
2
Bug in TermIterator::skip_to() ?
Hi all,
I've been toying with xapian (mostly using the Python bindings) and I
think I've hit a bug in the TermIterator::skip_to() method (or maybe
in QuartzAllTermsList::skip_to()).
I've attached a c++ source file that demonstrates the issue. In short,
if you have a WritableDatabase, ask for the all-terms TermIterator
with db.allterms_begin(), and then skip_to() a word that is itself
2012 May 21
1
Failed build with lucene plugin
Hi,
I've tryed build dovecot with the lucene plugin but I have this is output:
/bin/sh ../../../libtool --tag=CXX --mode=compile c++
-DHAVE_CONFIG_H -I. -I../../.. -I../../../src/lib
-I../../../src/lib-mail -I../../../src/lib-index
-I../../../src/lib-storage -I../../../src/plugins/fts
-I../../../src/doveadm -I/usr/local/include -I/usr/local/lib -O2
-pipe -fno-strict-aliasing
2013 Jun 14
1
dovecot 2.2 with lucene compile error
Hi ,i get an compile error
security -Werror=format-security -MT lucene-wrapper.lo -MD -MP -MF
.deps/lucene-wrapper.Tpo -c lucene-wrapper.cc -fPIC -DPIC -o
.libs/lucene-wrapper.o
lucene-wrapper.cc: In function 'lucene_index* lucene_index_init(const
char*, mailbox_list*, const fts_lucene_settings*)':
lucene-wrapper.cc:128:2: error: 'else' without a previous 'if'
make[5]:
2008 Jan 25
5
Ferret+Lucene Index
We use Nutch and Lucene for our heavy duty text analysis jobs but I?m trying
to use ferrret to do some experiments. I understood that Ferret used the
same index format as lucene but I cannot look into a lucene index with
ferret and cannot read a ferret index with luke (the lucene index browser).
Am I doing somehting wrong or have the formats diverged?
-------------- next part --------------
An
2007 Sep 01
2
Can lucene use index generated by ferret?
I found it quite fun to generate index by ferret. However, I had to
use lucene now. Can I use the index that I''ve already done by ferret?
ps. Lucene 2.2.0 vs Ferret 0.11.4
Thx!
--
Posted via http://www.ruby-forum.com/.
2016 Apr 12
2
fts-lucene with virtual folders
Hi,
It looks like I've found something strange. It looks like dovecot
updates fts-lucene index every time I'm opening virtual folder that
contains FTS query:
indexer-worker(dion): Warning: fts-lucene: Settings have changed, rebuilding index for mailbox
dovecot-virtual is pretty simple:
archive/INBOX
BODY "test"
first of all, I'm trying to perform FTS in archive/INBOX
2013 Apr 25
2
Lucene and Zlib with 2.2.1
Hello,
I have an "archive" folder in my inbox, where I manually stick old
mails into a compressed mbox format. Since upgrading to Dovecot 2.2.1,
I've started seeing messages like the following in my log files:
imap(...): Error: lucene: Failed to sync mailbox INBOX: Mailbox isn't selectable
indexer-worker(...): Error: lucene: Failed to sync mailbox INBOX: Mailbox
2011 Feb 14
1
Idea: Backend for Lucene format indexes
Hi,
I'm interested to implement the idea of using search indexes build by
Lucene. I have some idea of how to do that. I'm currently studying the index
format of both. I have written an application of search using Lucene and now
I'll do the same using Xapian. then I'll check the details of those index
format and then find how to convert Lucene index to Xapian one. Is there
somebody
2015 Jun 15
2
FTS lucene and english + german mails
Hi,
I'm currently looking over the FTS pages to enable it in my dovecot.
But I'm unsure what the best settings of the lucene plugin are, if you
receive german and english mails.
Wiki says:
textcat_conf=<path> textcat_dir=<path>: If specified, enable guessing
the stemming language for emails and search keywords. This is a little
bit problematic in practice, since indexing and
2004 Oct 28
1
Lucene ranking
Kevin Burton has posted about poor ranking in Lucene preferring
shorter documents over longer ones[1]. A similar search in Xapian
returns documents in the expected order:
Performing query `Xapian::Query(foo)'
3 results found
ID 3 99% [foo foo foo]
ID 2 94% [foo foo]
ID 1 80% [foo]
Anyone know what Lucene is doing here? Their FAQ doesn't mention what
weighting scheme they use, and I
2020 May 19
5
FTS-lucene errors : language not available for stemming
I'm getting some log errors with clucene that I am having no luck tracking down on the interwebs.
Errors:
May 19 05:05:16 indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: Error: lucene index /mail/blackrosetech.com/gessel//lucene-indexes: IndexWriter::addDocument() failed (#4): language not available for stemming
May 19 05:05:16
2014 Aug 01
1
Mail Archiving/Lucene Indexes/mbox
I have a script (available on demand) that moves all my mail for a given
month to a separate NAMESPACE (#ARCHIVE).
When I do that, the lucene-indexes folder in the PRIMARY namespace seems to
get cleaned out, and NOTHING gets put in the lucene-indexes folder
(although it gets created) in the #ARCHIVE namespace.
Is this to spec?
the archiving is done via doveadm move commands.
Thanks!
--
2012 Feb 06
2
Possible broken indexer(lucene/solr)?
Hello,
try to use the new 2.1rc and don't get any data in the searchindex.
Tried first lucene, and switched than back so solr.
If I do an search in an mailfolder, i get in the dovecot-log
2012-02-06 22:17:11 | dovecot: |
indexer-worker(xxxxxx): Indexed 0 messages in INBOX/dovecot
Log from solr.
Feb 6, 2012 10:17:11 PM org.apache.solr.core.SolrCore execute
INFO: []