thr3ads.net - similar to: "Letor: Feature sub-classes question"

Displaying 20 results from an estimated 3000 matches similar to: "Letor: Feature sub-classes question"

2016 Jun 27

xapian-letor: FeatureVector discussion

Hello James, Parth, Following our discussion on IRC and on code review, the way FeatureVector class works needs some discussion. Presently, the FeatureVector class is defined as follows, with a fixed number of feature count (19): class FeatureVector::Internal : public Xapian::Internal::intrusive_base{ friend class FeatureVector; double label; double score;

Letor stabilisation - project progress

2016 Jun 06

Letor stabilisation - project progress

Hello everyone, I have completed introducing some code from v-hasu's branch into mine, mainly for Features, FeatureVector and FeatureManager classes. I have pushed the changes to https://github.com/ayshtmr/xapian/tree/letor-update. I am now proceeding to write unit tests for feature modules. There are a few things that I wanted to clarify: 1. I have introduced a lot of code in a single

GSoC 2016 Letor dataset discussion

2016 May 14

GSoC 2016 Letor dataset discussion

Hello, I wanted to decide the dataset that should be used for Letor stabilisation project. I think 2009 INEX Wikipedia Collection <http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/inex/> should work fine. It's a collection of 2,666,190 XML articles, 115 topics <http://inex.mmci.uni-saarland.de/protected/adhoc/2009-topics.zip>, 50,275 qrel

GSoC 2016 Letor Stabilisation

2016 Mar 20

GSoC 2016 Letor Stabilisation

Hello, I'm Ayush from New Delhi, India. I am interested in Letor Stabilisation project for GSoC. I have a good background in machine learning. Sorry for getting in so late, university exams were holding me back. I'll try to cover as much as I can in the coming week. I am following the plan of attack suggested on the project page. Following are the things that I have completed: 1.

xapian-letor: FeatureVector discussion

2016 Jun 29

xapian-letor: FeatureVector discussion

> > > > The approach I was thinking would look something like this: > > * instead of Features, which is really a namespace implemented as a > class, we separate out the calculation of the different features > into distinct subclasses of Feature, whose only job is to calculate > a single feature. Currently the FeatureManager calls these (via >

Error while building from git - xapian-letor

2016 Mar 08

Error while building from git - xapian-letor

Hi all, While building from git with xapian-letor not ignored in bootstrap, I am getting the following make error: In function `main': /home/ayush/Desktop/xapian/xapian-letor/bin/xapian-letor-update.cc:98: undefined reference to `Xapian::Internal::str(unsigned int)' /home/ayush/Desktop/xapian/xapian-letor/bin/xapian-letor-update.cc:99: undefined reference to

GSoC 2016 Introduction

2016 May 04

GSoC 2016 Introduction

Hello everyone, My name is Ayush Tomar. I'll be working on Learning to Rank stabilisation project over the summers. Here are a few things that I plan to do in coming few days: 1. Revise the timeline. There are some portions that I had kept for the first and second week of coding which have already been done (except writing tests). So, I'd like to adjust the timeline according to it. 2.

Letor: returning MSet after re-ranking

2016 Jul 30

Letor: returning MSet after re-ranking

> > > I'd prefer to avoid adding things to the public API that don't get > used by end users. However because LTR is outside the Xapian build > tree, we can't easily give it privileged access to Xapian internals. > Sorry for a delayed response. The way I was thinking of performing reranking with updated weights was to add a class MSetRanker (basically containing a

xapian-letor refactoring and adding tests

2016 Apr 02

xapian-letor refactoring and adding tests

Hello, I applied to letor stabilisation project for gsoc. I'd like to use coming weeks to improve the workability of xapian-letor. For that, I'm planning to refactor code in current master and begin writing some tests for it. Before adding tests, I think it would be better if xapian-letor could be made consistent with how xapian-core is written. For that, I'd first like to

GSoC 2015 - Weighting Schemes

2015 Mar 02

GSoC 2015 - Weighting Schemes

Hello everyone! I'm Ayush Tomar, junior undergrad in Computer Science from New Delhi, India. I love C++ coding and working on machine learning and information retrieval project. I was exploring the GSoC ideas for Xapian and the project on "Adding Weighting Schemes" looked really interesting to me. I wanted to work on text mining/IR this summer and this idea seems perfect! I have

KMeans - Evaluation Results

2016 Aug 17

KMeans - Evaluation Results

I've gone through the link that you sent me and I currently understand how this helps and works to some extent, but I am not too sure of how I should start with converting the current interface to PIMPL design. I'm not used to this design pattern so its taking some time to sink in :) Say I start with the Clusterer class, I create a ClustererImpl class which is the internal class that

KMeans - Evaluation Results

2016 Aug 18

KMeans - Evaluation Results

> > > > Actually, you're doing something slightly unusual there: making the > internal member public. Protected would be better, and private is I think > most usual; library clients aren't going to have access to the Internal > class declaration, so they can't call things on it. This means it's > actually difficult right now to subclass Feature. > > I

Unable to generate lcov test coverage reports (Out of memory error)

2016 Mar 13

Unable to generate lcov test coverage reports (Out of memory error)

Hi all, I was trying to generate lcov test coverage reports for xapian-core but got an out of memory error: $ lcov --capture --directory . --output-file xapian-core.info Capturing coverage data from . Found gcov version: 4.7.3 Scanning . for .gcda files ... Found 270 data files in . Processing bin/xapian-progsrv.gcda Out of memory! These are the steps I followed in xapian-core directory

Normalization in Letor

2017 Mar 07

Normalization in Letor

Hi, Wanted to know if other normalization techniques like normalization by standard-deviation have been tried to normalize the Feature-list in Letor. Regards, Ayush Pandey.

GSoC 2017: Letor Click Data Mining

2017 Mar 21

GSoC 2017: Letor Click Data Mining

Hi Olly. Thanks for your reply to the previous email. To have an appropriate subject I've started this new thread for further discussions. > There's a $log{} command available in Omega templates. We can't log from > the result page template, as the clicks happen after that is used, but we > could make result links redirect via a second Omega template which does > the

Implementation of substring search in omegascript

2016 Feb 14

Implementation of substring search in omegascript

Hi, I'm Ayush an undergraduate Computer Science student from Thapar university, India. I was fiddling with xapian since the morning and trying to understand the code and internals of Xapian. I tried implementing the bite sized project idea posted here: https://trac.xapian.org/wiki/ProjectIdeas#AddnewOmegaScriptcommandtodoasubstringsearch but could not understand what needs to be returned when

GSoC 2017: Letor Click Data Mining

2017 Mar 23

GSoC 2017: Letor Click Data Mining

> You could do that by identifying the search session instead of the user, > which makes it closer to what we need than to something that might trip you > into privacy concerns. Okay, that would be much better. :) > Third records some information about what sort of query it is — add, > morelike or a plain query. Last provides the estimated match size and then > the HTTP

Letor: returning MSet after re-ranking

2016 Jul 31

Letor: returning MSet after re-ranking

On Sun, Jul 31, 2016 at 12:44:16AM +0100, Olly Betts wrote: > Would a method which swapped two elements of an MSet provide what you > need? That would provide a more generic way to adjust the ranking of > an MSet which for example could be used to implement a diversification > feature or something like SQL "GROUP BY". Isn't the most common use going to be that the

Integration of xapian in a framework

2016 Mar 10

Integration of xapian in a framework

Hello devs! Could you please expand on the project idea of integration of xapian in a framework with an example. I did not fully understand the requirements of this project. Also I want to discuss an idea of my own. Xapian doesn't have an auto complete feature. It is quite common for an search engine to have an auto complete feature. What I propose is a API that is totally separate from

GSoC 2017: Letor Click Data Mining

2017 Mar 22

GSoC 2017: Letor Click Data Mining

Hi James, > Isn't this from the query template, ie from the main web page of search > results? (It might make sense from opensearch as well, though.) Yes, you are right; it is the query template. The reason I said opensearch template is that I haven't quite read all sections of the Omega docs and I'm still in the process. Thanks for pointing that out. I'm aiming to cover

similar to: Letor: Feature sub-classes question