search for: aarshkshah1992

Displaying 20 results from an estimated 39 matches for "aarshkshah1992".

2013 Jan 27
1
Added a python example to the community page
Hey guys,I have added a python indexer example to the SampleCode page of our wiki.Please do have a look.The code can also be found here :- https://github.com/aarshkshah1992/xapian/blob/efcf443527b74326119bbc0935fc41a002ce60db/xapian-bindings/python/docs/examples/simpleindexgrep.py/ Thanks :) -Regards -Aarsh -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130127/b181ca76/...
2013 Apr 11
1
Added support for TfIdf to Omega
Hello guys,I have added code for tfidf to the weight.cc file in omega/ . Here is the patch : - https://github.com/aarshkshah1992/xapian/commit/5ff41a15f574e6780cc61e67e7f3da3d97ff4ec8 It compiles well and I think it'll work well. Here's the link to the documentation file omegascript.rst where I've added tfidf. https://github.com/aarshkshah1992/xapian/commit/9434ad15ad8b69691ad45f2d340450b3070f524e Please let...
2014 May 14
2
Starting work on Perf Test Module
Hello, I am beginning work on the perf test module. The initial steps that I aim to accomplish are :- -> Download the wikipedia dumps for multiple languages . -> Write python scripts to tokenize the dump (will probably use something like nltk which has powerful inbuilt tokenizers) -> Discuss and finalize the design of the search and query expansion perf tests as I want to complete them
2013 Mar 26
1
Merging of the TfIdf patch
Hello Guys. I have updated the code,tests,documentation,makefile entries and the registry entry of the* *TfIdf patch as per the feedback.Please do let me know if any additional changes are required before the patch can be merged, -Regards -Aarsh On Sun, Mar 3, 2013 at 2:50 PM, aarsh shah <aarshkshah1992 at gmail.com> wrote: > Hello guys.I have sent a pull request for the code and tests of the Tf-Idf > weighting scheme. > Please do let me know if any changes are required.Meanwhile,Ill begin > working on implementing normalizations which require additional statistics > and on the...
2014 Mar 04
2
Test Dataset for performance and accuracy analysis
Hi Parth, I implemented DFR algorithms in Xapian as a part of GSOC last year under the mentorship of Olly. This year, I want to work on analyzing and optimizing the performance of the DFR algorithms and comparing them with BM25.I also want to work on profiling the query expansion schemes and test the relevance(precision and recall) / speed(time taken) of the
2012 Dec 08
2
Want to contribute code to the Xapian project
Hey guys,I am a 3rd year Computer Science undergrad student.I a extremely interested in contributing code to the XAPIAN project. The work you people do sounds extremely fascinating and interesting.Can someone just give me a brief overview of how to proceed ?. I Can code in C,C++ and Python and have experience in Natural Lanuage Processing.Am also quite comfortable with NLTK and using Wordnet.Am
2013 May 15
0
Better parsing of BM25 parameters in Omega
...ys, as discussed on IRC, I have written some code for better parsing of BM25 parameters in Omega. If no parameters are specified ,it defaults all of them. However, if there some are specified and some are not or if the invalid values are given for any of them,it throws an error. https://github.com/aarshkshah1992/xapian/commit/ac0a11f5d8ff975fad1e96e63764eab9b04dfcfb -Regards -Aarsh -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130515/0c44019d/attachment-0001.html>
2013 Jul 17
1
Base class for query expansion
Hello Dan and Olly, this is the code for the base class for query expansion that I have written. The code will not compile as I have written only the base class until now. Have yet to use it. Please do tell me what you think of the base class and what changes you suggest I should make before I move forward with the project. https://github.com/xapian/xapian/pull/23 -Regards -Aarsh --------------
2013 Mar 03
0
Added code and tests for the tf-idf weighting scheme.
...." > > > Today's Topics: > > 1. Sent a pull request for the Tf-Idf Weighting scheme (aarsh shah) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Tue, 26 Feb 2013 02:21:52 +0530 > From: aarsh shah <aarshkshah1992 at gmail.com> > Subject: [Xapian-devel] Sent a pull request for the Tf-Idf Weighting > scheme > To: Xapian Development <xapian-devel at lists.xapian.org> > Message-ID: > < > CABz8NmSN0E-P54Sp9Rr4VWbSbYw+ryv5mOpwkKaxHBtf1hJdjg at mail.gmail.com> >...
2013 Mar 27
1
Need help as Pl2 tests not performing as expected
...possible *statistic of the Mset *is less than max attained*and so ,Ill have to have a look at the code again.This may take some time,as PL2 has a very complex formula and it's a bit hard to understand what's happening where. -Regards -Aarsh On Wed, Mar 27, 2013 at 6:23 PM, aarsh shah <aarshkshah1992 at gmail.com>wrote: > Hello guys.I just realized that Ive not set the weighting scheme to PL2 > in the tests for PL2 and so a default weighting scheme of BM25 is used. I > am extremely sorry for this and am updating the tests by setting the > weighting scheme to PL2. > > -Reg...
2013 Mar 04
2
Need Beginner Guide for Matcher Optimisations Project
Hi, While searching for a project which matches my interest andskill level, I found this project named Matcher Optimization. This project is really challenging and excting from my view point and I would like to be a part of this project. Optimization techniques metioned in the reference links provided will take some time for me to have a good understanding about them. But I am trying to get my
2013 Mar 03
0
Sent a pull request for testing TradWeight using an Rset.
...----- > An HTML attachment was scrubbed... > URL: < > http://lists.xapian.org/pipermail/xapian-devel/attachments/20130228/fb776ded/attachment.html > > > > ------------------------------ > > Message: 2 > Date: Thu, 28 Feb 2013 23:43:46 +0530 > From: aarsh shah <aarshkshah1992 at gmail.com> > Subject: [Xapian-devel] Implemented test coverage for using TradWeight > with an Rset > To: Xapian Development <xapian-devel at lists.xapian.org> > Message-ID: > < > CABz8NmRc66c+-qvQec7QpXMn0VQ0Fu0FSizj4xQwwdV5aVCUWg at mail.gmail.co...
2013 Feb 19
2
Implementing tf-idf weighting scheme in Xapian
Hello guys.I just read up about tf-idf schemes and want to implement it in Xapian (with some frequently used normalizations) as it will also give me a good hang of implementing a weighting scheme before I start working on implementing DFR schemes. I read the following as references and I think Ive understood it well and can write the hack :- 1.)
2013 Jan 31
0
Modify Eset to allow specification of weighing scheme
Hey guys,Hi :) I am now beginning work on modification of Eset so that that the user can specify one of the three of Xapian's inbuilt weighing scheme to build the Eset or code his own (similar to what we already allow for Mset) .Will read the code for Mset and Eset and see where it differs so as to know what code to write for Eset .Will mail the community once I am done with it.:) -Regards
2013 Feb 14
0
Thanks for the detailed feedback on the stemmer
Hey Hi Olly :) Thank you so much for your detailed help and feedback on the stemmer patch . Ive begun working on all the points that you mentioned.Will send a new pull request in a couple of days.:) Sorry for the shabby patch,I'm just new to this stuff,Ill do better next time. :) -Regards -Aarsh -------------- next part -------------- An HTML attachment was scrubbed... URL:
2013 Feb 25
0
Sent a pull request for the Tf-Idf Weighting scheme
Hello guys :) I have sent a pull request for the Tf-Idf Weighting scheme incorporating as many normalizations as I could with the help of statistics currently available from Xapian::Weight . Please let me know what you'll think about it. I used the weighting scheme in a simple searcher and it did a fine job with it. I have no experience with writing tests for features like this.Please give me
2013 Feb 27
0
Sent a pull request for an example weighting scheme that requires statistics
Hello guys, as discussed on IRC, I've added an example of a weighting scheme that requires statistics to the howto manual of our Getting Started guide.Please do let me know if any modifications are required. The pull request is here : https://github.com/jaylett/xapian-docsprint/pull/3 -Regards -Aarsh. -------------- next part -------------- An HTML attachment was scrubbed... URL:
2013 Feb 28
0
Implemented test coverage for using TradWeight with an Rset
Hello guys. Olly had pointed out on IRC that there is no test for using the TradWeight scheme with a Rset and had directed me to the test coverage of the TradWeight scheme :- http://lcov.xapian.org/latest/weight/tradweight.cc.gcov.html I have added a test to api_anydb.cc which tests the TradWeight scheme using an Rset . I simply modified the rset2 test in api_anydb.cc so that the Mset is made
2013 Mar 04
0
Corrected errors in TradWeight test as per feedback .
Hey guys,Hi. :) I've corrected the mistakes in the TradWeight test commits that Olly pointed out and have added them to the pull request. Thank you for the feedback . -Regards -Aarsh -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130304/f1b26c95/attachment.html>
2013 Mar 05
0
Please take a look at the TfIdf patch
Hello guys, :) Please do take a look at the pull request for the TfIdf patch Ive sent because I want to start working on writing DFR schemes for us and want to incorporate the feedback into making a good hack for the DFR schemes.The patch incorporates all normalizations possible with our current statistics and passed all the tests I wrote for it.Have also attached the tests with the pull request.