Displaying 20 results from an estimated 1100 matches similar to: "GSOC 2014"
2014 Feb 26
2
GSOC 2014
Just to add on top of what Olly has already mentioned.
> > Now, I'm reading the resources provided on ideas' page. Do you have
> > any other suggestions of knowing more about the letor?
> > And I'd like to test the function of letor. But I can't find code
> > example. Can u give me some suggestions?
>
> Hopefully Parth can help here.
>
In order
2014 Mar 09
2
[GSOC 2014] Some questions about Letor module
Thanks for your reply! For the third question: In https://inex.mmci.uni-saarland.de/data/documentcollection.jsp, I can find inex2010-article.qrels in 2010 assessment, but can?t find query files. Could you send me the link? I have registered on INEX website. And I also need to download ``INEX 2009 collection without annotation tags: (unofficial)`` on
2014 Mar 09
2
[GSOC 2014] Some questions about Letor module
Hi,
I've read the code of letor module. And I have some questions about it:
1. In https://github.com/rishabhmehrotra/xapian/blob/master/xapian-letor/letor_internal.cc#L299, there is a write_to_file method, which save RankList into ?train.txt?. But the format for ?train.txt? is different from the one mentioned in http://trac.xapian.org/wiki/GSoC2011/LTR/Notes#QueryLevelNorm. And in
2014 Mar 11
2
[GSOC 2014] Indexing INEX dataset
Hi Parth,
I?ve implemented SVMRanker class and also sorted out most of current Letor APIs.
Now I?m trying to use INEX dataset to verify my implement. But I stuck in the indexing part. You said in the documentation that we have to add prefix when indexing. Also I notice that you set some metadata in omindex.cc of your version. But the omindex.cc has changed since 2011. I think that?s why my result
2014 May 21
2
Some questions about Letor project
Hi all,
Thank you for giving me the opportunity to work with Xapian :) I am Jiarong
Wei, a third year undergraduate student in Zhejiang University, China. In
GSoC 2014, I will work on Letor module with Hanxiao Sun.
Here are some questions I encountered these days,
1. In letor.cc, we have two parts of functions: the training part and
the ranking part. I?ll use SVMRanker as an example. The
2014 Mar 11
2
[GSOC 2013] Question about indexing INEX dataset
Hi,
I?m trying to use Omega to index INEX dataset for Letor. But omindex told me these xml files are unknown. Olly told me I could tell omindex to handle them as HTML. (Thanks Olly :) ) Is it appropriate? Parth, could you give me some suggestions?
Thank you!
Jiarong Wei
2014 Mar 11
2
[GSOC 2014] Indexing INEX dataset
On Tue, Mar 11, 2014 at 03:20:31PM +0100, Parth Gupta wrote:
> >
> > On current trunk, we index the title with prefix "S" by default in
> > omindex, though with a wdf inc of 5 rather than 1:
> >
> > indexer.index_text(title, 5, "S");
> >
> > So I don't think you need that change to omindex now.
>
> Yes, but please
2014 Mar 17
2
[GSOC 2014] Indexing INEX dataset
Hi Olly,
Wouldn't setting the weight of terms in title back to normal (e.g. 5 to 1)
by below line, automatically adjust the wdfs and field lengths?
indexer.index_text(title, 5, "S"); -> indexer.index_text(title, 1, "S");
if it does not then we should include that part in the patch too. I like to
create a patch for xapian-letor for resolving common code of xapian.
2016 Mar 20
2
GSoC 2016 Letor Stabilisation
Hello,
I'm Ayush from New Delhi, India. I am interested in Letor Stabilisation
project for GSoC. I have a good background in machine learning. Sorry for
getting in so late, university exams were holding me back. I'll try to
cover as much as I can in the coming week.
I am following the plan of attack suggested on the project page. Following
are the things that I have completed:
1.
2016 May 14
2
GSoC 2016 Letor dataset discussion
Hello,
I wanted to decide the dataset that should be used for Letor stabilisation
project.
I think 2009 INEX Wikipedia Collection
<http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/inex/>
should work fine. It's a collection of 2,666,190 XML articles, 115 topics
<http://inex.mmci.uni-saarland.de/protected/adhoc/2009-topics.zip>, 50,275
qrel
2014 Mar 05
2
Question regarding LETOR
Hi Parth,
I have a few questions-
1. Could you please provide me with the link for query-file, qrel-file for
the dataset available at
http://www.mpi-inf.mpg.de/departments/d5/software/inex/ .
2. I wanted to know how automated testing would be implemented. Will there
be test cases like a test query must match this particular N results and
this particular ranking. Or will it be in terms of
2017 Mar 23
2
GSoC 2017: Letor Click Data Mining
> You could do that by identifying the search session instead of the user,
> which makes it closer to what we need than to something that might trip you
> into privacy concerns.
Okay, that would be much better. :)
> Third records some information about what sort of query it is — add,
> morelike or a plain query. Last provides the estimated match size and then
> the HTTP
2012 Jul 27
1
A Little Help
Hi Rishabh,
I think its better not to expose RankiList to Letor.h and make it better
user friendly. So my suggestion is to convert RankList to the following
statement in this method.
std::map<Xapian::docid, double> letor_score(const Xapian::MSet & mset);
So just convert the RankList in std::map<Xapian::docid, double> format in
the methods where you need to return.
Parth.
On
2013 Sep 25
2
Is the project learning to rank need to be improved?
As Olly has already pointed out the 2012 branch is not merged.
I think there are some compilation errors in the branch.
The code in branch is better refactored. The Ranker and FeatureManager
classes are well defined and implemented.
Parth.
On Wed, Sep 25, 2013 at 9:02 AM, Olly Betts <olly at survex.com> wrote:
> On Tue, Sep 24, 2013 at 08:34:10PM +0800, jiangwen jiang wrote:
>
2014 Apr 24
2
Need someone to help me adjust my gsoc proposal
Dear all,
My name is Hanxiao Sun and I am a master student in the University of
Chinese Academy of Sciences. I am so glad to be selected by Xapian as the
GSoC student. This is my first time to work with open source community. I
hope I can do a good job with you in this summer !
In my proposal, I followed the scope of the LTR project on the ideas page.
But, after the interview, it seems to need
2013 Sep 24
2
Is the project learning to rank need to be improved?
HI, Parth:
Thanks for your reply.
There's xapian-letor folder in current master trunk, is this code merged in
2011? the letor code of 2012 is not merged to the trunk?
Regards
2013/9/23 Parth Gupta <pargup8 at gmail.com>
> Dear Jiangwen Jiang,
>
> The current state of the project can be reached from the repo:
> http://trac.xapian.org/wiki/GSoC2012/LTR
>
> I would
2014 Mar 10
2
A few more question about LETOR
1.Could you explain why are these libraries included in all the
xapian-letor headers?
#include<xapian/intrusive_ptr.h>
#include<xapian/types.h>
#include<xapian/visibility.h>
Or just provide me with the documentation of these header. I looked into
these header files but couldn't anything substantial.
2.
2014 Dec 01
2
Contributing to Xapian
I'd suggest that a good thing to look at would be functional tests of the metrics and algorithms in Hanxiao Sun's work from this summer. You'll generally need to go either to the original paper, or find an alternative implementation, to build up a series of tests that demonstrate that the implementation is doing what it is supposed to.
Xapian-core contains a test framework which it
2014 Dec 04
3
Contributing to Xapian
Hi
Maybe I can start off with Hanxiao's branch but I can see there are two
repos.
https://github.com/v-hasu/xapian/tree/gsoc2014-evaluation
https://github.com/v-hasu/xapian/tree/gsoc2014
which one should I pick?
I have already successfully built xapian from the git repository (
https://github.com/xapian/xapian)
Since I am new to ML (well I dont know anything about it?), should I read
the
2012 Apr 24
3
Letor re-factored code
Hello Rishabh,
The attached diff file should help you to refactor the existing letor code
and plug in your code easily. I have defined the header files with the
necessary flow. I have also specified the ranker class, which should be
implemented by a new LTR models, ListNet and ListMLE in your case. The
evaluation file should be used for validating the performance of the
algorithm in the training