search for: doc_length

Displaying 2 results from an estimated 2 matches for "doc_length".

2013 Aug 25
2
Backend for Lucene format indexes-How to get doclength
...sed to caculate the number which is > similar to doc length(see norm(t,d) in > http://lucene.apache.org/core/3_5_0/api/all/org/apache/lucene/search/Similarity.html#formula_norm). It sounds similar (especially if document and field boosts aren't in use), though some places may rely on the doc_length = sum(wdf) definition - in particular, some other measure of length may violate assumptions like wdf <= doc_length. For now, using weighting schemes which don't use document length is probably the simplest answer. > And this feature is applied into this pull request( > https://github...
2013 Jun 16
3
Backend for Lucene format indexes-How to get doclength
Hi, all: I have wrote a demo patch for Backend for Lucene format indexes, Lucene version is 3.6.2. http://lucene.apache.org/core/3_6_2/fileformats.html Now, this demo patch just support the basic features in Lucene. Compound File(.cfs/.cfe)?term vector(.tvx/.tvd/.tvf) delete document(.del) are not supported, skip list in .fdx is not supported too example/quest.cc is used to test this demo.