Displaying 20 results from an estimated 1000 matches similar to: "GSoC 2016 Introduction"
2016 Jun 06
2
Letor stabilisation - project progress
Hello everyone,
I have completed introducing some code from v-hasu's branch into mine,
mainly for Features, FeatureVector and FeatureManager classes. I have
pushed the changes to https://github.com/ayshtmr/xapian/tree/letor-update. I
am now proceeding to write unit tests for feature modules.
There are a few things that I wanted to clarify:
1. I have introduced a lot of code in a single
2016 Jun 27
2
xapian-letor: FeatureVector discussion
Hello James, Parth,
Following our discussion on IRC and on code review, the way FeatureVector
class works needs some discussion.
Presently, the FeatureVector class is defined as follows, with a fixed
number of feature count (19):
class FeatureVector::Internal : public Xapian::Internal::intrusive_base{
friend class FeatureVector;
double label;
double score;
2016 Jun 29
2
xapian-letor: FeatureVector discussion
>
>
>
> The approach I was thinking would look something like this:
>
> * instead of Features, which is really a namespace implemented as a
> class, we separate out the calculation of the different features
> into distinct subclasses of Feature, whose only job is to calculate
> a single feature. Currently the FeatureManager calls these (via
>
2016 Mar 20
2
GSoC 2016 Letor Stabilisation
Hello,
I'm Ayush from New Delhi, India. I am interested in Letor Stabilisation
project for GSoC. I have a good background in machine learning. Sorry for
getting in so late, university exams were holding me back. I'll try to
cover as much as I can in the coming week.
I am following the plan of attack suggested on the project page. Following
are the things that I have completed:
1.
2015 Mar 02
1
GSoC 2015 - Weighting Schemes
Hello everyone!
I'm Ayush Tomar, junior undergrad in Computer Science from New Delhi,
India. I love C++ coding and working on machine learning and information
retrieval project. I was exploring the GSoC ideas for Xapian and the
project on "Adding Weighting Schemes" looked really interesting to me. I
wanted to work on text mining/IR this summer and this idea seems perfect!
I have
2016 Aug 08
2
Letor: Feature sub-classes question
Hello James,
I am working on breaking down Features into sub-classes. Should each of the
features get their own sub-class, or should the grouping be done according
to type? i.e. query-document pair dependent, query-dependent and document
dependent sub-classes.
Using this approach makes more sense if we plan to add support for user to
include query-dependent and document-dependent features in
2016 Apr 02
2
xapian-letor refactoring and adding tests
Hello,
I applied to letor stabilisation project for gsoc. I'd like to use coming
weeks to improve the workability of xapian-letor. For that, I'm planning to
refactor code in current master and begin writing some tests for it.
Before adding tests, I think it would be better if xapian-letor could be
made consistent with how xapian-core is written. For that, I'd first like
to
2016 Mar 08
2
Error while building from git - xapian-letor
Hi all,
While building from git with xapian-letor not ignored in bootstrap, I am
getting the following make error:
In function `main':
/home/ayush/Desktop/xapian/xapian-letor/bin/xapian-letor-update.cc:98:
undefined reference to `Xapian::Internal::str(unsigned int)'
/home/ayush/Desktop/xapian/xapian-letor/bin/xapian-letor-update.cc:99:
undefined reference to
2016 May 14
2
GSoC 2016 Letor dataset discussion
Hello,
I wanted to decide the dataset that should be used for Letor stabilisation
project.
I think 2009 INEX Wikipedia Collection
<http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/inex/>
should work fine. It's a collection of 2,666,190 XML articles, 115 topics
<http://inex.mmci.uni-saarland.de/protected/adhoc/2009-topics.zip>, 50,275
qrel
2016 Mar 13
2
Unable to generate lcov test coverage reports (Out of memory error)
Hi all,
I was trying to generate lcov test coverage reports for xapian-core but got
an out of memory error:
$ lcov --capture --directory . --output-file xapian-core.info
Capturing coverage data from .
Found gcov version: 4.7.3
Scanning . for .gcda files ...
Found 270 data files in .
Processing bin/xapian-progsrv.gcda
Out of memory!
These are the steps I followed in xapian-core directory
2016 Aug 18
3
KMeans - Evaluation Results
>
>
>
> Actually, you're doing something slightly unusual there: making the
> internal member public. Protected would be better, and private is I think
> most usual; library clients aren't going to have access to the Internal
> class declaration, so they can't call things on it. This means it's
> actually difficult right now to subclass Feature.
>
> I
2016 Jul 30
2
Letor: returning MSet after re-ranking
>
>
> I'd prefer to avoid adding things to the public API that don't get
> used by end users. However because LTR is outside the Xapian build
> tree, we can't easily give it privileged access to Xapian internals.
>
Sorry for a delayed response.
The way I was thinking of performing reranking with updated weights was to
add a class MSetRanker (basically containing a
2016 Aug 17
2
KMeans - Evaluation Results
I've gone through the link that you sent me and I currently understand how
this helps and works to some extent, but I am not too sure of how I should
start with converting the current interface to PIMPL design. I'm not used
to this design pattern so its taking some time to sink in :)
Say I start with the Clusterer class, I create a ClustererImpl class which
is the internal class that
2017 Mar 26
2
Draft Proposal
Hi,
I have submitted a draft proposal on the GSoC website but I also
wanted to share my draft proposal through Dropbox to get your feedback
quickly through it. Please review it and let me know your feedbacks as soon
as possible. I haven't written much about automated testing as I haven't
written any test yet ( I will learn writing automated tests before April
end ). Link to my
2010 Apr 18
2
Table or file as STDIN to the R Script
Hi all,
I am trying to call the R script/ R Program via my Shell Script. I need to
pass a file to the R script as an argument. When I am trying to do that I am
getting error and only first line of the file is being read. Can you tell me
how should I do that so that it reads the entire file in a R object so that
later I am able to process the matrix operation on that file. It is a tab
delimited
2016 Feb 14
4
Implementation of substring search in omegascript
Hi,
I'm Ayush an undergraduate Computer Science student from Thapar university,
India. I was fiddling with xapian since the morning and trying to
understand the code and internals of Xapian.
I tried implementing the bite sized project idea posted here:
https://trac.xapian.org/wiki/ProjectIdeas#AddnewOmegaScriptcommandtodoasubstringsearch
but could not understand what needs to be returned when
2010 Apr 01
1
Regarding the De-bugger in R
Hi,
I would like to know if there is some debugger in R where I can check that I
am not using or not doing calculation on my previously stored objects. I
can't use rm (list = ls()) to remove all the objects since I am using a for
loop for reading 500 files and making making common calculation for each
file, therefore I need to keep the track of my iterator. Is it possible to
remove everything
2004 Apr 16
1
Problem on choosing the OCFS version for RH 2.1
Hi All,
The Linux version of the machine on which I want to install OCFS is:
$uname -r
2.4.9-e.25kmap1enterprise
Can any body suggest me which version and the module of OCFS should I
install.
The machine on which I want to install has 2-CPUs.
Thanks,
Ayush
2010 Apr 18
1
Calling R using Shell Script and give input file to the R script
Hi,
I am trying to call the R script using shell script and pass the input file
as an argument to the R script. Can somebody give me some insights on this
problem.
Thanks.
--
Regards,
Ayush Raman
[[alternative HTML version deleted]]
2014 May 21
2
Some questions about Letor project
Hi all,
Thank you for giving me the opportunity to work with Xapian :) I am Jiarong
Wei, a third year undergraduate student in Zhejiang University, China. In
GSoC 2014, I will work on Letor module with Hanxiao Sun.
Here are some questions I encountered these days,
1. In letor.cc, we have two parts of functions: the training part and
the ranking part. I?ll use SVMRanker as an example. The