Jiuding Duan
2012-Apr-20 03:33 UTC
[Xapian-devel] Implementing the tf-idf weighting scheme
Hi, all: This is the basic implementation of tf-idf scheme (basic scheme used in SMART) that can be used in the Xapian. It might still need some futher revision, but I believe it works anyway.:) I modified the weight.h to define a subclass Tf_idfWeight and add a new file tf_idf.cc in ../weight in the repo, to implement Tf_idfWeight. Here is the git diff patch: https://gist.github.com/2422049 I think the next thing to do is register this scheme to Xapian and write some test to see whether or not it works? I'm grepped the current BM25Weight, TradWeight and BoolWeight, and find clues about Enquire::set_weighting_scheme( ). But something more should be done to understand it. Best, Jiuding -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120420/129e0730/attachment.html>
Jiuding Duan
2012-Apr-20 03:47 UTC
[Xapian-devel] Implementing the tf-idf weighting scheme
And the patch file is in the attachment. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120420/4ba8eac8/attachment.html> -------------- next part -------------- A non-text attachment was scrubbed... Name: report3.patch Type: application/octet-stream Size: 4022 bytes Desc: not available URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120420/4ba8eac8/attachment-0002.obj>
Apparently Analagous Threads
- Added code and tests for the tf-idf weighting scheme.
- Sent a pull request for the Tf-Idf Weighting scheme
- Implementing tf-idf weighting scheme in Xapian
- [IDF][analyzer] Generalizing IDFCalculator to be used for Clang's CFG
- [IDF][analyzer] Generalizing IDFCalculator to be used for Clang's CFG