Hi all, I am Prajwal, a sophomore Undergraduate student from IIT Bombay, India. I want to contribute to Xapian through GSoC 2012. Specifically I am interested in the Weighting Schemes project. I have not yet taken any IR related course, but IR and NLP have always been my academic interests. I want to learn more about these topics, and I think that this is the best opportunity to do so. I thus spent the last one week going through some articles related to IR and weighing schemes (the ones given in the XAPIAN page and from other internet sources). I am very proficient in coding in C++,Java and Python. I am currently going through the documentation of the Xapian library. I wanted to know how I should go about reading and knowing more about the project (to make my theoretical background strong), and what I can possibly do to show that I am deserving for this project, before the application period begins. Again as I said before, I want to make the maximum use of this opportunity to learn more about IR. I request you to browse through my website - www.cse.iitb.ac.in/~prajwal to have a look at my interests, and skills. Thank you, -- Prajwal.A.N. www.cse.iitb.ac.in/~prajwal Sophomore Undergraduate Student, Computer Science and Engineering, IIT Bombay. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120325/62edff56/attachment.html>
Hi Prajwal, I wanted to know how I should go about reading and knowing more about the> project (to make my theoretical background strong), and what I can possibly > do to show that I am deserving for this project, before the application > period begins. >The best way to go about for the weighting scheme project would be to zero down to one of the potential weighting scheme. That can be Divergence From Randomness (DFR), Language Modelling (LM), Learning to Rank etc. Then you should try to understand the recipe of the weighting scheme from some of the standard sources like an IR book or Scientific papers from the original authors or so. Unfortunately DFR is recent enough not to be involved in the IR book by Prabhakar Raghvan. So the papers would be the best option. Another thing should be to follow the page (below) for getting familiar with Xapian code base and try to figure it out where your scheme fits in. http://trac.xapian.org/wiki/GSoC2012 Of course you can discuss the intermediate or even initial details or questions you come across with the mentors through the mailing list or the IRC. Thanks for your interest. Parth.> Again as I said before, I want to make the maximum use of this > opportunity to learn more about IR. I request you to browse through my > website - www.cse.iitb.ac.in/~prajwal<http://www.cse.iitb.ac.in/%7Eprajwal>to have a look at my interests, and skills. > > Thank you, > -- > Prajwal.A.N. > www.cse.iitb.ac.in/~prajwal <http://www.cse.iitb.ac.in/%7Eprajwal> > Sophomore Undergraduate Student, > Computer Science and Engineering, > IIT Bombay. > > > _______________________________________________ > Xapian-devel mailing list > Xapian-devel at lists.xapian.org > http://lists.xapian.org/mailman/listinfo/xapian-devel > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120325/53ac4617/attachment-0001.html>