Displaying 20 results from an estimated 5000 matches similar to: "GSOC 2017 Project: Learning to Rank Click Data Mining"
2017 Mar 23
2
GSoC 2017: Letor Click Data Mining
> You could do that by identifying the search session instead of the user,
> which makes it closer to what we need than to something that might trip you
> into privacy concerns.
Okay, that would be much better. :)
> Third records some information about what sort of query it is — add,
> morelike or a plain query. Last provides the estimated match size and then
> the HTTP
2017 Mar 22
2
GSoC 2017: Letor Click Data Mining
Hi James,
> Isn't this from the query template, ie from the main web page of search
> results? (It might make sense from opensearch as well, though.)
Yes, you are right; it is the query template. The reason I said opensearch
template is that I haven't quite read all sections of the Omega docs and I'm
still in the process. Thanks for pointing that out.
I'm aiming to cover
2017 Mar 21
2
GSoC 2017: Letor Click Data Mining
Hi Olly. Thanks for your reply to the previous email.
To have an appropriate subject I've started this new thread for further
discussions.
> There's a $log{} command available in Omega templates. We can't log from
> the result page template, as the clicks happen after that is used, but we
> could make result links redirect via a second Omega template which does
> the
2017 Apr 23
2
Question about the ticket #743 omindex: delay libmagic checks
>
> I'd suggest to start with you just look at moving the libmagic check after
> the filesize checks, so you don't need to get into whether libmagic or
> the database check is cheaper on average.
hi, Olly, I have moved the libmagic check after the filesize check directly,
https://github.com/caiyulun/xapian/commit/3a97d9ee5397fa900a473aa9b3d8eeb720177a4e
can you provide
2013 Sep 25
2
Is the project learning to rank need to be improved?
As Olly has already pointed out the 2012 branch is not merged.
I think there are some compilation errors in the branch.
The code in branch is better refactored. The Ranker and FeatureManager
classes are well defined and implemented.
Parth.
On Wed, Sep 25, 2013 at 9:02 AM, Olly Betts <olly at survex.com> wrote:
> On Tue, Sep 24, 2013 at 08:34:10PM +0800, jiangwen jiang wrote:
>
2013 Sep 24
2
Is the project learning to rank need to be improved?
HI, Parth:
Thanks for your reply.
There's xapian-letor folder in current master trunk, is this code merged in
2011? the letor code of 2012 is not merged to the trunk?
Regards
2013/9/23 Parth Gupta <pargup8 at gmail.com>
> Dear Jiangwen Jiang,
>
> The current state of the project can be reached from the repo:
> http://trac.xapian.org/wiki/GSoC2012/LTR
>
> I would
2011 Jun 07
0
Introduction and Discussion for Learning to Rank Framework
Hello All,
This time we are working on a weighting scheme "Learning to Rank" which
involves machine learning and its a supervised ranking scheme unlike
unsupervised schemes like BM25 under GSoC project.
This mail intends to discuss the framework of the Learning to Rank in Xapian
as a whole. I have thought of the following framework, pour in your insights
or issues for the same. This is
2013 Mar 21
1
GSOC - 2013 - Introduction (Learning to Rank)
Hello Everyone,
I am looking forward to contribute to Xapian and also apply as a Google
Summer of Code student. I would like to start by introducing myself. I am a
final year M.Sc.(H) Chemistry and B.E. (H) Electronics and Instrumentation
student at BITS - Pilani, Goa. I am interested in Machine Learning and
presently pursuing my thesis on the same. I have been selected for Google
Summer of Code
2017 Jun 03
2
Logging the click data
Hi,
I spent this week mostly understanding how the existing templates work and
setting up Omega CGI on my system to have a better grasp over how things
actually look like when using different templates.
This helped me look at things associated with logging the click data from
a better perspective. As already documented on the project's wiki page,
we need the following fields in separater
2012 Apr 01
1
[GSoC2012] Learning to Rank: few thoughts/issues
Hello,
I would like to work with Orange as part of GSoC 2012(and continue
henceforth). Apologies for joining in a bit late- i was waiting to get a
proper grasp of things before discussing it here. Currently I am a Masters
students in Mathematics with my bachelors in Computer Science[integrated
dual degree]. Over the last year and a half, I have worked on a few ML
projects and have a couple of
2012 Mar 24
3
Learning to rank
Dear Sir,
I am Pankaj Singhal from Jaipur, India. I am very much
interested and strongly looking forward in getting involved in this project
Learning-to-Rank.
My previous experience in this field is good. Last semester I did a similar
job of ranking the URLs of the given huge dataset based on their attribute
values. The dataset consisted hundreds of thousands of URLs and each url
2014 Mar 11
2
[GSOC 2014] Indexing INEX dataset
Hi Parth,
I?ve implemented SVMRanker class and also sorted out most of current Letor APIs.
Now I?m trying to use INEX dataset to verify my implement. But I stuck in the indexing part. You said in the documentation that we have to add prefix when indexing. Also I notice that you set some metadata in omindex.cc of your version. But the omindex.cc has changed since 2011. I think that?s why my result
2013 Sep 23
2
Is the project learning to rank need to be improved?
Hi, guys:
I found this project idea
http://trac.xapian.org/wiki/GSoCProjectIdeas#Project:LearningtoRank
If it is need to be improved, I will try to handle it
thanks
Regards
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130923/d0ced0d6/attachment-0002.html>
2017 Jun 09
6
Logging the click data
> The log command will be executed wherever it is in the template.
> By "current system user", what do you mean? There'll need to be
> write permission for the CGI process, which is probably the same
> identity as the web server process.
I basically changed the ownership of the /var/log/omega directory to
the current user using:
sudo chown `whoami` /var/log/omega
But I
2017 Apr 20
2
Question about the ticket #743 omindex: delay libmagic checks
Hi,
I'm working on the ticket #743 omindex: delay libmagic checks
<https://trac.xapian.org/ticket/743>. As the ticket's
Description mention, the call to libmagic is expensive than call the stat,
so we can check the size by call the stat to get size before call
libmagic to get a mime type.
But how about the timestamps check? since timestamps check need to iterate
the DB to check if
2014 Mar 04
2
Test Dataset for performance and accuracy analysis
Hi Parth,
I implemented DFR algorithms in Xapian as
a part of GSOC last year under the mentorship of Olly. This year, I want to
work on analyzing and optimizing the performance of the DFR algorithms and
comparing them with BM25.I also want to work on profiling the query
expansion schemes and test the relevance(precision and recall) / speed(time
taken) of the
2016 Mar 08
2
Error while building from git - xapian-letor
Hi all,
While building from git with xapian-letor not ignored in bootstrap, I am
getting the following make error:
In function `main':
/home/ayush/Desktop/xapian/xapian-letor/bin/xapian-letor-update.cc:98:
undefined reference to `Xapian::Internal::str(unsigned int)'
/home/ayush/Desktop/xapian/xapian-letor/bin/xapian-letor-update.cc:99:
undefined reference to
2014 Oct 12
5
Help with xapian
Hi,
I am unable to build the letor module. I am generating the configure file
using autoconf. The configure file generated is throwing the error
./configure: line 2057: syntax error near unexpected token `1.10.1'
./configure: line 2057: `AM_INIT_AUTOMAKE(1.10.1 -Wportability tar-ustar)`
I am not too sure what to do with this. Need help with this.
Thank You
Regards
Karthik
On Mon, Sep 29,
2014 Mar 11
2
[GSOC 2013] Question about indexing INEX dataset
Hi,
I?m trying to use Omega to index INEX dataset for Letor. But omindex told me these xml files are unknown. Olly told me I could tell omindex to handle them as HTML. (Thanks Olly :) ) Is it appropriate? Parth, could you give me some suggestions?
Thank you!
Jiarong Wei
2014 Mar 09
2
[GSOC 2014] Some questions about Letor module
Hi,
I've read the code of letor module. And I have some questions about it:
1. In https://github.com/rishabhmehrotra/xapian/blob/master/xapian-letor/letor_internal.cc#L299, there is a write_to_file method, which save RankList into ?train.txt?. But the format for ?train.txt? is different from the one mentioned in http://trac.xapian.org/wiki/GSoC2011/LTR/Notes#QueryLevelNorm. And in