search for: jiangwen

Displaying 13 results from an estimated 13 matches for "jiangwen".

2013 Feb 05
0
make error in xapian-application/omega (jiangwen jiang)
Hey Hi jiangwen,hope you are doing fine :) You need some libraries and tools installed on your system before you build Xapian and omega.The complete list can be found in the "Building from svn or git" section of this document.:- http://svn.xapian.org/trunk/xapian-core/HACKING?view=co Make sure you have...
2013 Sep 24
2
Is the project learning to rank need to be improved?
HI, Parth: Thanks for your reply. There's xapian-letor folder in current master trunk, is this code merged in 2011? the letor code of 2012 is not merged to the trunk? Regards 2013/9/23 Parth Gupta <pargup8 at gmail.com> > Dear Jiangwen Jiang, > > The current state of the project can be reached from the repo: > http://trac.xapian.org/wiki/GSoC2012/LTR > > I would suggest to fork the branch and start playing with it. You might > have some compilation issues. Current improvements are needed to properly > put tog...
2013 Sep 23
2
Is the project learning to rank need to be improved?
Hi, guys: I found this project idea http://trac.xapian.org/wiki/GSoCProjectIdeas#Project:LearningtoRank If it is need to be improved, I will try to handle it thanks Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130923/d0ced0d6/attachment-0002.html>
2013 Sep 02
2
Backend for Lucene format indexes-How to get doclength
On Mon, Sep 02, 2013 at 09:21:48AM +0800, jiangwen jiang wrote: > TfIdfWeight and BM25(b=0) also need wdf_upper_bound, it is not exists in > Lucene backends. If you don't provide an implementation of wdf_upper_bound(), the default is to use the collection frequency of the term, so provided that information is available in the lucene file...
2013 Sep 25
2
Is the project learning to rank need to be improved?
...there are some compilation errors in the branch. The code in branch is better refactored. The Ranker and FeatureManager classes are well defined and implemented. Parth. On Wed, Sep 25, 2013 at 9:02 AM, Olly Betts <olly at survex.com> wrote: > On Tue, Sep 24, 2013 at 08:34:10PM +0800, jiangwen jiang wrote: > > There's xapian-letor folder in current master trunk, is this code merged > in > > 2011? > > Yes. > > > the letor code of 2012 is not merged to the trunk? > > It hasn't. Sadly the 2012 project just tailed off. I'm not sure what >...
2013 Aug 26
2
Backend for Lucene format indexes-How to get doclength
On Mon, Aug 26, 2013 at 09:41:07AM +0800, jiangwen jiang wrote: > > For now, using weighting schemes which don't use document length is > > probably the simplest answer. > > There's tf-idf weighting scheme on svn master, is it suitable for lucene > backend? Yes - TfIdfWeight doesn't ever use the document length (a...
2013 Oct 30
2
Lucene 3.6.2 backend for xapian (#25)
[Replying to xapian-devel, as I think a wider audience would be useful] On Mon, Oct 21, 2013 at 11:24:51PM +0800, jiangwen jiang wrote: > yes, it's less efficient. Lucene database has multiple segments, each > segment can treat as a independent database. The same term may exists in >= > 1 segments. Sorry for taking a while to respond - I've been both busy and mulling this over. I think that perhap...
2013 Aug 25
2
Backend for Lucene format indexes-How to get doclength
On Tue, Aug 20, 2013 at 07:28:42PM +0800, jiangwen jiang wrote: > I think norm(t, d) in Lucene can used to caculate the number which is > similar to doc length(see norm(t,d) in > http://lucene.apache.org/core/3_5_0/api/all/org/apache/lucene/search/Similarity.html#formula_norm). It sounds similar (especially if document and field boosts ar...
2013 Jun 17
2
Backend for Lucene format indexes-How to get doclength
*Or do you mean that it's one number per document whereas the other stats are per database, so it's harder to store it?* yes, I mean this. It's a huge data. If a new doclength list(contains all the doclength in a list, like chert) is added by myself, I am concern about: 1. This doclength list may be the bottlenect in this backend, http://trac.xapian.org/ticket/326 2. Change too much
2013 Jun 16
3
Backend for Lucene format indexes-How to get doclength
Hi, all: I have wrote a demo patch for Backend for Lucene format indexes, Lucene version is 3.6.2. http://lucene.apache.org/core/3_6_2/fileformats.html Now, this demo patch just support the basic features in Lucene. Compound File(.cfs/.cfe)?term vector(.tvx/.tvd/.tvf) delete document(.del) are not supported, skip list in .fdx is not supported too example/quest.cc is used to test this demo.
2013 Mar 13
1
patch-Add standard ExpandDecider subclass to restrict to terms with a particular prefix
Hi, guys, I wrote a patch for ticket #467<http://trac.xapian.org/ticket/467>(Add standard ExpandDecider subclass to restrict to terms with a particular prefix). A new ExpandDecider class is added in expanddecider.cc/h, which delete all unprefix terms I am a newbiee to open source contribution, please let me know if this patch is work. Patch is here(generate by svn diff):
2013 Feb 05
1
make error in xapian-application/omega
Hi, all: I check out the trunk code from svn yesterday *svn co svn://svn.xapian.org/xapian/trunk xapian* compile process: *./bootstrap* *mkdir build* *cd build* *../configure --disable-documentation* *make* then the error occur: cd xapian-applications/omega && make all make[1]: Entering directory `/home/white127/xapian-svn/build/xapian-applications/omega' /usr/bin/perl
2013 Aug 27
2
What does collection_freq means?
Hi, all: I am confused with the concept of colletion_freq There's no informations about it on http://xapian.org/docs/glossary.html What does it means? Thanks Regards! -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130827/370cc6a3/attachment.html>