similar to: Error while building from git - xapian-letor

Displaying 20 results from an estimated 700 matches similar to: "Error while building from git - xapian-letor"

2016 Mar 20
2
GSoC 2016 Letor Stabilisation
Hello, I'm Ayush from New Delhi, India. I am interested in Letor Stabilisation project for GSoC. I have a good background in machine learning. Sorry for getting in so late, university exams were holding me back. I'll try to cover as much as I can in the coming week. I am following the plan of attack suggested on the project page. Following are the things that I have completed: 1.
2016 Apr 02
2
xapian-letor refactoring and adding tests
Hello, I applied to letor stabilisation project for gsoc. I'd like to use coming weeks to improve the workability of xapian-letor. For that, I'm planning to refactor code in current master and begin writing some tests for it. Before adding tests, I think it would be better if xapian-letor could be made consistent with how xapian-core is written. For that, I'd first like to
2016 May 14
2
GSoC 2016 Letor dataset discussion
Hello, I wanted to decide the dataset that should be used for Letor stabilisation project. I think 2009 INEX Wikipedia Collection <http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/inex/> should work fine. It's a collection of 2,666,190 XML articles, 115 topics <http://inex.mmci.uni-saarland.de/protected/adhoc/2009-topics.zip>, 50,275 qrel
2016 Jun 29
2
xapian-letor: FeatureVector discussion
> > > > The approach I was thinking would look something like this: > > * instead of Features, which is really a namespace implemented as a > class, we separate out the calculation of the different features > into distinct subclasses of Feature, whose only job is to calculate > a single feature. Currently the FeatureManager calls these (via >
2016 Jul 30
2
Letor: returning MSet after re-ranking
> > > I'd prefer to avoid adding things to the public API that don't get > used by end users. However because LTR is outside the Xapian build > tree, we can't easily give it privileged access to Xapian internals. > Sorry for a delayed response. The way I was thinking of performing reranking with updated weights was to add a class MSetRanker (basically containing a
2016 Jun 27
2
xapian-letor: FeatureVector discussion
Hello James, Parth, Following our discussion on IRC and on code review, the way FeatureVector class works needs some discussion. Presently, the FeatureVector class is defined as follows, with a fixed number of feature count (19): class FeatureVector::Internal : public Xapian::Internal::intrusive_base{ friend class FeatureVector; double label; double score;
2016 Jun 06
2
Letor stabilisation - project progress
Hello everyone, I have completed introducing some code from v-hasu's branch into mine, mainly for Features, FeatureVector and FeatureManager classes. I have pushed the changes to https://github.com/ayshtmr/xapian/tree/letor-update. I am now proceeding to write unit tests for feature modules. There are a few things that I wanted to clarify: 1. I have introduced a lot of code in a single
2017 Mar 07
1
Normalization in Letor
Hi, Wanted to know if other normalization techniques like normalization by standard-deviation have been tried to normalize the Feature-list in Letor. Regards, Ayush Pandey.
2016 Aug 08
2
Letor: Feature sub-classes question
Hello James, I am working on breaking down Features into sub-classes. Should each of the features get their own sub-class, or should the grouping be done according to type? i.e. query-document pair dependent, query-dependent and document dependent sub-classes. Using this approach makes more sense if we plan to add support for user to include query-dependent and document-dependent features in
2014 Mar 10
2
A few more question about LETOR
1.Could you explain why are these libraries included in all the xapian-letor headers? #include<xapian/intrusive_ptr.h> #include<xapian/types.h> #include<xapian/visibility.h> Or just provide me with the documentation of these header. I looked into these header files but couldn't anything substantial. 2.
2016 Aug 17
2
KMeans - Evaluation Results
I've gone through the link that you sent me and I currently understand how this helps and works to some extent, but I am not too sure of how I should start with converting the current interface to PIMPL design. I'm not used to this design pattern so its taking some time to sink in :) Say I start with the Clusterer class, I create a ClustererImpl class which is the internal class that
2017 Mar 23
2
GSoC 2017: Letor Click Data Mining
> You could do that by identifying the search session instead of the user, > which makes it closer to what we need than to something that might trip you > into privacy concerns. Okay, that would be much better. :) > Third records some information about what sort of query it is — add, > morelike or a plain query. Last provides the estimated match size and then > the HTTP
2014 Mar 18
2
Considering Parallel computing for Letor
Hi everyone, My name is Shreedhar Pawar. I have already introduced myself on Xapian-discuss... I feel that the Xapian Search/Letor Algorithm, can speed up using Parallel computing. Techniques like Map-reduce, 'compact n split', radix sort, scan, parallel hashing n much more can be used to speed up the learning algorithms as well as the search... support vector machines in the Letor
2012 Jul 27
1
A Little Help
Hi Rishabh, I think its better not to expose RankiList to Letor.h and make it better user friendly. So my suggestion is to convert RankList to the following statement in this method. std::map<Xapian::docid, double> letor_score(const Xapian::MSet & mset); So just convert the RankList in std::map<Xapian::docid, double> format in the methods where you need to return. Parth. On
2016 Aug 18
3
KMeans - Evaluation Results
> > > > Actually, you're doing something slightly unusual there: making the > internal member public. Protected would be better, and private is I think > most usual; library clients aren't going to have access to the Internal > class declaration, so they can't call things on it. This means it's > actually difficult right now to subclass Feature. > > I
2016 May 04
2
GSoC 2016 Introduction
Hello everyone, My name is Ayush Tomar. I'll be working on Learning to Rank stabilisation project over the summers. Here are a few things that I plan to do in coming few days: 1. Revise the timeline. There are some portions that I had kept for the first and second week of coding which have already been done (except writing tests). So, I'd like to adjust the timeline according to it. 2.
2016 Mar 13
2
Unable to generate lcov test coverage reports (Out of memory error)
Hi all, I was trying to generate lcov test coverage reports for xapian-core but got an out of memory error: $ lcov --capture --directory . --output-file xapian-core.info Capturing coverage data from . Found gcov version: 4.7.3 Scanning . for .gcda files ... Found 270 data files in . Processing bin/xapian-progsrv.gcda Out of memory! These are the steps I followed in xapian-core directory
2014 Mar 09
2
[GSOC 2014] Some questions about Letor module
Hi, I've read the code of letor module. And I have some questions about it: 1. In https://github.com/rishabhmehrotra/xapian/blob/master/xapian-letor/letor_internal.cc#L299, there is a write_to_file method, which save RankList into ?train.txt?. But the format for ?train.txt? is different from the one mentioned in http://trac.xapian.org/wiki/GSoC2011/LTR/Notes#QueryLevelNorm. And in
2015 Mar 02
1
GSoC 2015 - Weighting Schemes
Hello everyone! I'm Ayush Tomar, junior undergrad in Computer Science from New Delhi, India. I love C++ coding and working on machine learning and information retrieval project. I was exploring the GSoC ideas for Xapian and the project on "Adding Weighting Schemes" looked really interesting to me. I wanted to work on text mining/IR this summer and this idea seems perfect! I have
2014 Mar 04
4
Questions on letor module
Hi, I have several questions regarding the letor module,I looked at the framework of learning to rank in xapian http://rishabhmehrotra.com/gsoc/17.png, I am a little confused. Why using deep learning to find unsupervised features in test data? Since in my understanding, learning to rank model usually learn features from the training data then apply the model to the test data? Why test set and