search for: postingsource

Displaying 20 results from an estimated 34 matches for "postingsource".

2018 Jan 22
2
How to get the serialise score returned in Xapian::KeyMaker->operator().
>A possible workaround (and perhaps a better approach) would be to >set BoolWeight as the weighting scheme, then feed in your score as >a weight using a PostingSource. Then it's available via get_weight() >on the MSetIterator object: > >https://getting-started-with-xapian.readthedocs.io/en/latest/advanced/postingsource.html > >You may find that's faster because it'll mean sorting by doubles instead >of strings. We realise our sco...
2020 Feb 07
2
prioritizing aggregated DBs
Hey all, I've been using ->add_database for a few years to tie sharded DBs together and it works great. Now, I want to be able to search across several DBs which aren't sharded, say: linux-DB, glibc-DB, freebsd-DB. I want to search for something across all of them, but prioritize results to favor one or some of those DBs over others. Is there a way to do that without reindexing? Or
2017 Dec 15
5
How to get the serialise score returned in Xapian::KeyMaker->operator().
HI, all, I am a user of Xapian, and now I have a problem in using it. After using boolean terms to get some candidates of documents (still too much), we want sorted them by self-defined function which is used in Xapian::KeyMaker->operator(). But how can I get the serialise score in Xapian::MSetIterator object. c++ code likes this: class SortKeyMaker : public Xapian::KeyMaker { std::string
2018 Jan 24
0
How to get the serialise score returned in Xapian::KeyMaker->operator().
On Tue, Jan 23, 2018 at 12:55:31AM +0800, 张少华 wrote: > We realise our score function using PostingSource instead of using > KeyMaker, we reference your python example and source code of xapian, > the simple demo is here. > https://github.com/xiangqianzsh/xapian_leaning/blob/master/postingsource/ExternalWeightPostingSource.h I'd just put the get_weight() and get_maxweight() implementati...
2016 Apr 22
2
Weighting recent results
I did some digging and found a thread from 2011 talking about how to subclass Xapian::PostingSource in order to incorporate the date or recency of a document in its weighting: http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856 As in that thread, I want to be clear that I don't want to sort by date, but rather incorporate date information into the score by which I sor...
2016 May 03
2
Weighting recent results
On 5/2/2016 9:03 PM, Olly Betts wrote: > On Fri, Apr 22, 2016 at 12:23:15PM -0400, Alex Aminoff wrote: >> I did some digging and found a thread from 2011 talking about how to >> subclass Xapian::PostingSource in order to incorporate the date or >> recency of a document in its weighting: >> >> http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856 >> >> As in that thread, I want to be clear that I don't want to sort by >> date, but rather incorpor...
2020 Feb 07
0
prioritizing aggregated DBs
...t; which aren't sharded, say: linux-DB, glibc-DB, freebsd-DB. > > I want to search for something across all of them, but > prioritize results to favor one or some of those DBs over > others. Is there a way to do that without reindexing? With git master you can achieve this with a PostingSource subclass as there's a new PostingSource::reset() method which gets passed the shard it is being called for, so you can set an extra weight contribution based on that. This is a replacement for PostingSource::init() in 1.4, which doesn't know which shard it is being called for. You can the...
2020 Feb 08
2
prioritizing aggregated DBs
...ay: linux-DB, glibc-DB, freebsd-DB. > > > > I want to search for something across all of them, but > > prioritize results to favor one or some of those DBs over > > others. Is there a way to do that without reindexing? > > With git master you can achieve this with a PostingSource subclass as > there's a new PostingSource::reset() method which gets passed the > shard it is being called for, so you can set an extra weight > contribution based on that. This is a replacement for > PostingSource::init() in 1.4, which doesn't know which shard it is being >...
2013 Apr 26
1
Compiling Xapian within a Cocoa project
...\ ^ In file included from /Users/hat/Projects/OCXapian/OCXapian/OXAppDelegate.mm:9: In file included from /Users/hat/Projects/OCXapian/OCXapian/OXAppDelegate.h:10: In file included from /opt/local/include/xapian.h:50: /opt/local/include/xapian/postingsource.h:223:43: error: too many arguments provided to function-like macro invocation virtual bool check(Xapian::docid did, Xapian::weight min_wt); ^ /opt/local/include/xapian/postingsource.h:223:5: error: 'virtual' can only appear on non-static memb...
2010 Aug 27
1
Using relevance when sorting by generated key
Hi all, I am trying to implement a scheme where documents in an MSet will be sorted based on relevance as well as geographical distance from a given (non-fixed) point. I understand that this can be accomplished by using PostingSource in order to implement a custom weighting scheme that would combine BM25 weighting and distance. The problem is that I am using perl and PostingSource is not available for perl. Besides, I find this solution unnecessarily complicated. A more simple solution would be if the sorter in Xapian::E...
2012 Apr 02
0
GSoC, Xapian Project Weighting Schemes
...the ability to rank search result by the mathematical > formulas like tf*idf andBM25. Actually, you can already rank results by incoming hyperlink counts, or any query-independent factor(s) you want to keep track of, and you can combine that with term-based weights.? This is done by creating a PostingSource subclass and using it to the query: http://xapian.org/docs/postingsource.html > weight S= S1(Weight calculated by BM25) * S2(weight of document > calculated based on You can't multiply the factors like this with a PostingSource, only add them - is there any theoretical or experimental...
2016 May 03
0
Weighting recent results
On Fri, Apr 22, 2016 at 12:23:15PM -0400, Alex Aminoff wrote: > I did some digging and found a thread from 2011 talking about how to > subclass Xapian::PostingSource in order to incorporate the date or > recency of a document in its weighting: > > http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856 > > As in that thread, I want to be clear that I don't want to sort by > date, but rather incorporate date information i...
2011 May 23
1
More relevance for recent documents
Good afternoon I would like to ask if is possible somehow give more relevance to the recent documents in search results. I dont want to sort results according to the date, I still prefer relevance, but I would like to see recent documents with better scoring. I was trying to add search query using AND_MAYBE, which should use relevance from both subqueries, but it didnt add any benefit to the
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ? i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc. search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp. This method is ok , but
2013 Mar 02
3
How to add an custom weight to the relevancy value and sort it.
Hello guys, I have an weight value which is calculated by some factor and i need to add the weight with the relevancy value of a result and sort it with that value is that possible in xapian. Thanks, VishnuKumar -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130302/9831e287/attachment-0001.html>
2008 Sep 10
1
mu-0.2, maildir indexer/searcher with xapian support
Hi all, [ Hopefully announcements like this are appropriate here... ] I've just released version 0.2 of my maildir scanner/search called 'mu': http://www.djcbsoftware.nl/code/mu/ It it's written in C and a bit of C++, and released under the GPL. Thanks for the help I got here, it was quite easy to integrate Xapian, and it works really nice -- a high quality product. Great job!
2008 Dec 17
1
using ValueWeightPostingSource
Hi, I'm currently using PostingSource to add some weight over the result using a value. I didn't find any documentation on how to use it with the query so i link a query constructed using the posting source and a query made using the query parser with an AND operator : Xapian.Query queryText = parser.ParseQuery("test:"...
2017 Dec 16
0
How to get the serialise score returned in Xapian::KeyMaker->operator().
...and it seems like it ought to be accessible but there's no accessor method for it - I can add one but that won't help for existing releases. A possible workaround (and perhaps a better approach) would be to set BoolWeight as the weighting scheme, then feed in your score as a weight using a PostingSource. Then it's available via get_weight() on the MSetIterator object: https://getting-started-with-xapian.readthedocs.io/en/latest/advanced/postingsource.html You may find that's faster because it'll mean sorting by doubles instead of strings. Also you'll get told the lowest weight...
2010 Aug 09
2
File descriptor leak (?) in Python
Hi all, Recently I have upgraded a Python application from Xapian 1.0.7 to 1.2.2 in order to use the PostingSource class. It is a long-running process, and I am seeing the number of open file descriptors to the Xapian database steadily increase. I suspect what I am seeing is some kind of resource leak. I have no idea if it is a problem in our code or in the Xapian Python bindings. How do I debug this problem?...
2016 May 16
2
Weighting recent results
...ght = get_weight_based_on_recency(...); $tg->index_text($txt,$weight); If I wanted to allow the user the option of searching either in recency-weighted mode or not, I could index each document into 2 different databases, one with and one without. This avoids having to mess with subclassing PostingSource and C++ and all that. - Alex Aminoff NBER On 05/03/2016 08:15 AM, James Aylett wrote: > On Tue, May 03, 2016 at 07:56:19AM -0400, Alex Aminoff wrote: > >> Perhaps I am not understanding the basic concept, but I was figuring >> we would just write a subclass of PostingSou...