Displaying 1 result from an estimated 1 matches for "doclengh".
Did you mean:
doclength
2013 Jun 16
3
Backend for Lucene format indexes-How to get doclength
...found some data needed for BM25 in Xapian are not existed in
Lucene:
1. doclength_lower_bound?doclength_upper_bound
2. wdf_lower_bound?wdf_uppper_bound
3. total_length
4. doclength(for each document)
1-3 are statistics data, can be caculated when doing copydatabase, and
store them in somewhere. But doclengh is
hard to do this way.
1. some other data instead of doclength?
2. Xapian support other rank algorithm which does not need doclength?
Is there some suggestions to solve this problem?
And the demo patch is here:
https://github.com/white127/xapian-patch/blob/master/xapian_lucene_demo.patch
Regard...