similar to: Weighting recent results

Displaying 20 results from an estimated 7000 matches similar to: "Weighting recent results"

2016 May 03
2
Weighting recent results
On 5/2/2016 9:03 PM, Olly Betts wrote: > On Fri, Apr 22, 2016 at 12:23:15PM -0400, Alex Aminoff wrote: >> I did some digging and found a thread from 2011 talking about how to >> subclass Xapian::PostingSource in order to incorporate the date or >> recency of a document in its weighting: >> >> http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856
2016 May 16
2
Weighting recent results
I was thinking about this some more: Is there a reason I can't just weight by some function of recency at indexing time? $weight = get_weight_based_on_recency(...); $tg->index_text($txt,$weight); If I wanted to allow the user the option of searching either in recency-weighted mode or not, I could index each document into 2 different databases, one with and one without. This avoids
2016 May 03
0
Weighting recent results
On Fri, Apr 22, 2016 at 12:23:15PM -0400, Alex Aminoff wrote: > I did some digging and found a thread from 2011 talking about how to > subclass Xapian::PostingSource in order to incorporate the date or > recency of a document in its weighting: > > http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856 > > As in that thread, I want to be clear that I
2013 Mar 02
3
How to add an custom weight to the relevancy value and sort it.
Hello guys, I have an weight value which is calculated by some factor and i need to add the weight with the relevancy value of a result and sort it with that value is that possible in xapian. Thanks, VishnuKumar -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130302/9831e287/attachment-0001.html>
2017 Sep 12
2
perl bindings to Xapian::Query
QueryParser is great, but I would like to make a query myself, so I can filter results by a specified value (in this case restricting by epoch time after a certain value) My code looks like this, and compiles, and appears like it should work according to the perl source:     my $query = $qp->parse_query($querystr);     if ($datefilter) {         my $filterepoch = time() - ($datefilter
2016 May 18
0
Weighting recent results
On Mon, May 16, 2016 at 12:35:53PM -0400, Alex Aminoff wrote: > I was thinking about this some more: Is there a reason I can't just > weight by some function of recency at indexing time? > > $weight = get_weight_based_on_recency(...); > $tg->index_text($txt,$weight); The second parameter there is a WDF multiplier, which isn't really "weight". It depends on
2017 Sep 28
1
Weighting the author of a doc when that term can also appear as a frequent term in other docs
We have a corpus of academic papers. Sometimes it happens that there is an academic controversy and one paper is a response or rebuttal to another paper. The name of the author of the first paper may appear many times in the second paper. So in light of this, how should we set our weight on the author field? Here is an example: http://www.nber.org/papers/w11215  in which the term
2016 May 03
0
Weighting recent results
On Tue, May 03, 2016 at 07:56:19AM -0400, Alex Aminoff wrote: > Perhaps I am not understanding the basic concept, but I was figuring > we would just write a subclass of PostingSource in C++ that does > what we want, and not bother with the perl bindings. Is that not > possible? I realize that ideally we would develop the general > solution and share our code out to the community,
2018 Jan 22
2
How to get the serialise score returned in Xapian::KeyMaker->operator().
>A possible workaround (and perhaps a better approach) would be to >set BoolWeight as the weighting scheme, then feed in your score as >a weight using a PostingSource. Then it's available via get_weight() >on the MSetIterator object: > >https://getting-started-with-xapian.readthedocs.io/en/latest/advanced/postingsource.html > >You may find that's faster because
2011 May 23
1
More relevance for recent documents
Good afternoon I would like to ask if is possible somehow give more relevance to the recent documents in search results. I dont want to sort results according to the date, I still prefer relevance, but I would like to see recent documents with better scoring. I was trying to add search query using AND_MAYBE, which should use relevance from both subqueries, but it didnt add any benefit to the
2017 Jun 06
1
Test for the end of PostingIterator in perl?
Hi all. I want to iterate over all the documents in my database. my $pi = $db->postlist_begin(""); while ("$pi" =~ qr/END/) { my $oldid = $pi->get_docid; $pi++; #... } That used to work with Search::Xapian in perl version 1.2, but now with xapian-bindings-1.4.4 it does not seem to. How are you supposed to tell when you have reached the
2017 Dec 15
5
How to get the serialise score returned in Xapian::KeyMaker->operator().
HI, all, I am a user of Xapian, and now I have a problem in using it. After using boolean terms to get some candidates of documents (still too much), we want sorted them by self-defined function which is used in Xapian::KeyMaker->operator(). But how can I get the serialise score in Xapian::MSetIterator object. c++ code likes this: class SortKeyMaker : public Xapian::KeyMaker { std::string
2016 May 09
1
Given a document, how do you get its ID? (perl bindings)
I am writing an indexer that will crawl our web site. Following the recommendation here: https://trac.xapian.org/wiki/FAQ/UniqueIds I'm using the URL as the unique ID for each document. I see how to get a document from the xapian database if I know its URL, but what I need is also to be able to find out the URL from the document. Does this mean I need to store the URL in a value in
2010 Aug 27
1
Using relevance when sorting by generated key
Hi all, I am trying to implement a scheme where documents in an MSet will be sorted based on relevance as well as geographical distance from a given (non-fixed) point. I understand that this can be accomplished by using PostingSource in order to implement a custom weighting scheme that would combine BM25 weighting and distance. The problem is that I am using perl and PostingSource is not
2012 Mar 05
1
Interested in IR, Getting started with Xapian
Hi everyone, I'm Akshay, an Information Science undergrad from Bangalore. I'm interested in Information Retrieval and I'd like to contribute to Xapian as a part of GSoC and later to feed my interests. I liked the idea of adding more weighting schemes (Project #2). I did a project last semester on Document Retrieval on Hadoop using TF-IDF and Cosine Similarity (the query had to be a
2010 Jun 08
1
PHP MatchDecider in 1.2.0
Good afternoon I would like to ask if is possible in new version of Xapian some kind of subclasing MatchDecider in PHP. I am developing application where is very complex permission management. so it cannot be stored in fultext database. MatchDeciders are perfect for this purpose. But I dont know if is possible to use them in PHP even if SWIG now supports directors. Now Iam using
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ? i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc. search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp. This method is ok , but
2008 Sep 10
1
mu-0.2, maildir indexer/searcher with xapian support
Hi all, [ Hopefully announcements like this are appropriate here... ] I've just released version 0.2 of my maildir scanner/search called 'mu': http://www.djcbsoftware.nl/code/mu/ It it's written in C and a bit of C++, and released under the GPL. Thanks for the help I got here, it was quite easy to integrate Xapian, and it works really nice -- a high quality product. Great job!
2011 Feb 11
2
PHP Binding and SWIG Director for XapianMatchDecider
Hi, I try to use XapianMatchDecider to filtering results, but the code : $Decider = new MatchDecider_Search($XapianDatabase->_cPtr); $mset = $XapEnquire->get_mset(0, 3000, null, $Decider); Always return : PHP Fatal error: No matching function for overloaded 'Enquire_get_mset' in xapian.php on line 1082 I saw that SWIG was not supporting Directors for PHP, but now, it seems to
2008 Oct 09
3
Sorting results by a "sort expression"
Olly, We currently use Sphinx for our website search function, but we're planning on using Xapian instead for a few of the extra features it has. Our website is written in Ruby on Rails, so of course we're using Xapian with Ruby bindings. I don't know if you're familiar with Sphinx but Sphinx allows you to pass a sort expression when you execute the search that will be evaluated