Displaying 20 results from an estimated 34 matches for "postingsource".
2018 Jan 22
2
How to get the serialise score returned in Xapian::KeyMaker->operator().
>A possible workaround (and perhaps a better approach) would be to
>set BoolWeight as the weighting scheme, then feed in your score as
>a weight using a PostingSource. Then it's available via get_weight()
>on the MSetIterator object:
>
>https://getting-started-with-xapian.readthedocs.io/en/latest/advanced/postingsource.html
>
>You may find that's faster because it'll mean sorting by doubles instead
>of strings.
We realise our sco...
2020 Feb 07
2
prioritizing aggregated DBs
Hey all, I've been using ->add_database for a few years
to tie sharded DBs together and it works great.
Now, I want to be able to search across several DBs
which aren't sharded, say: linux-DB, glibc-DB, freebsd-DB.
I want to search for something across all of them, but
prioritize results to favor one or some of those DBs over
others. Is there a way to do that without reindexing?
Or
2017 Dec 15
5
How to get the serialise score returned in Xapian::KeyMaker->operator().
HI, all,
I am a user of Xapian, and now I have a problem in using it.
After using boolean terms to get some candidates of documents (still too much), we want sorted them by self-defined function which is used in Xapian::KeyMaker->operator(). But how can I get the serialise score in Xapian::MSetIterator object.
c++ code likes this:
class SortKeyMaker : public Xapian::KeyMaker {
std::string
2018 Jan 24
0
How to get the serialise score returned in Xapian::KeyMaker->operator().
On Tue, Jan 23, 2018 at 12:55:31AM +0800, 张少华 wrote:
> We realise our score function using PostingSource instead of using
> KeyMaker, we reference your python example and source code of xapian,
> the simple demo is here.
> https://github.com/xiangqianzsh/xapian_leaning/blob/master/postingsource/ExternalWeightPostingSource.h
I'd just put the get_weight() and get_maxweight() implementati...
2016 Apr 22
2
Weighting recent results
I did some digging and found a thread from 2011 talking about how to
subclass Xapian::PostingSource in order to incorporate the date or
recency of a document in its weighting:
http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856
As in that thread, I want to be clear that I don't want to sort by date,
but rather incorporate date information into the score by which I sor...
2016 May 03
2
Weighting recent results
On 5/2/2016 9:03 PM, Olly Betts wrote:
> On Fri, Apr 22, 2016 at 12:23:15PM -0400, Alex Aminoff wrote:
>> I did some digging and found a thread from 2011 talking about how to
>> subclass Xapian::PostingSource in order to incorporate the date or
>> recency of a document in its weighting:
>>
>> http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856
>>
>> As in that thread, I want to be clear that I don't want to sort by
>> date, but rather incorpor...
2020 Feb 07
0
prioritizing aggregated DBs
...t; which aren't sharded, say: linux-DB, glibc-DB, freebsd-DB.
>
> I want to search for something across all of them, but
> prioritize results to favor one or some of those DBs over
> others. Is there a way to do that without reindexing?
With git master you can achieve this with a PostingSource subclass as
there's a new PostingSource::reset() method which gets passed the
shard it is being called for, so you can set an extra weight
contribution based on that. This is a replacement for
PostingSource::init() in 1.4, which doesn't know which shard it is being
called for.
You can the...
2020 Feb 08
2
prioritizing aggregated DBs
...ay: linux-DB, glibc-DB, freebsd-DB.
> >
> > I want to search for something across all of them, but
> > prioritize results to favor one or some of those DBs over
> > others. Is there a way to do that without reindexing?
>
> With git master you can achieve this with a PostingSource subclass as
> there's a new PostingSource::reset() method which gets passed the
> shard it is being called for, so you can set an extra weight
> contribution based on that. This is a replacement for
> PostingSource::init() in 1.4, which doesn't know which shard it is being
>...
2013 Apr 26
1
Compiling Xapian within a Cocoa project
...\
^
In file included from /Users/hat/Projects/OCXapian/OCXapian/OXAppDelegate.mm:9:
In file included from /Users/hat/Projects/OCXapian/OCXapian/OXAppDelegate.h:10:
In file included from /opt/local/include/xapian.h:50:
/opt/local/include/xapian/postingsource.h:223:43: error: too many arguments provided to function-like macro invocation
virtual bool check(Xapian::docid did, Xapian::weight min_wt);
^
/opt/local/include/xapian/postingsource.h:223:5: error: 'virtual' can only appear on non-static memb...
2010 Aug 27
1
Using relevance when sorting by generated key
Hi all,
I am trying to implement a scheme where documents in an MSet will be
sorted based on relevance as well as geographical distance from a given
(non-fixed) point.
I understand that this can be accomplished by using PostingSource in
order to implement a custom weighting scheme that would combine BM25
weighting and distance. The problem is that I am using perl and
PostingSource is not available for perl. Besides, I find this solution
unnecessarily complicated.
A more simple solution would be if the sorter in
Xapian::E...
2012 Apr 02
0
GSoC, Xapian Project Weighting Schemes
...the ability to rank search result by the mathematical
> formulas like tf*idf andBM25.
Actually, you can already rank results by incoming hyperlink counts, or
any query-independent factor(s) you want to keep track of, and you can
combine that with term-based weights.? This is done by creating a
PostingSource subclass and using it to the query:
http://xapian.org/docs/postingsource.html
> weight S= S1(Weight calculated by BM25) * S2(weight of document
> calculated based on
You can't multiply the factors like this with a PostingSource, only add
them - is there any theoretical or experimental...
2016 May 03
0
Weighting recent results
On Fri, Apr 22, 2016 at 12:23:15PM -0400, Alex Aminoff wrote:
> I did some digging and found a thread from 2011 talking about how to
> subclass Xapian::PostingSource in order to incorporate the date or
> recency of a document in its weighting:
>
> http://thread.gmane.org/gmane.comp.search.xapian.general/8849/focus=8856
>
> As in that thread, I want to be clear that I don't want to sort by
> date, but rather incorporate date information i...
2011 May 23
1
More relevance for recent documents
Good afternoon
I would like to ask if is possible somehow give more relevance to the
recent documents in search results.
I dont want to sort results according to the date, I still prefer
relevance, but I would like to see recent documents with better scoring.
I was trying to add search query using AND_MAYBE, which should use
relevance from both subqueries, but it didnt add any benefit to the
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ?
i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc.
search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp.
This method is ok , but
2013 Mar 02
3
How to add an custom weight to the relevancy value and sort it.
Hello guys,
I have an weight value which is calculated by some factor and i need to add
the weight with the relevancy value of a result and sort it with that value
is that possible in xapian.
Thanks,
VishnuKumar
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130302/9831e287/attachment-0001.html>
2008 Sep 10
1
mu-0.2, maildir indexer/searcher with xapian support
Hi all,
[ Hopefully announcements like this are appropriate here... ]
I've just released version 0.2 of my maildir scanner/search called 'mu':
http://www.djcbsoftware.nl/code/mu/
It it's written in C and a bit of C++, and released under the GPL.
Thanks for the help I got here, it was quite easy to integrate Xapian,
and it works really nice -- a high quality product. Great job!
2008 Dec 17
1
using ValueWeightPostingSource
Hi,
I'm currently using PostingSource to add some weight over the result
using a value.
I didn't find any documentation on how to use it with the query so i
link a query constructed using the posting source and a query made
using the query parser with an AND operator :
Xapian.Query queryText = parser.ParseQuery("test:"...
2017 Dec 16
0
How to get the serialise score returned in Xapian::KeyMaker->operator().
...and it seems like it ought to be accessible
but there's no accessor method for it - I can add one but that won't
help for existing releases.
A possible workaround (and perhaps a better approach) would be to
set BoolWeight as the weighting scheme, then feed in your score as
a weight using a PostingSource. Then it's available via get_weight()
on the MSetIterator object:
https://getting-started-with-xapian.readthedocs.io/en/latest/advanced/postingsource.html
You may find that's faster because it'll mean sorting by doubles instead
of strings.
Also you'll get told the lowest weight...
2010 Aug 09
2
File descriptor leak (?) in Python
Hi all,
Recently I have upgraded a Python application from Xapian 1.0.7 to
1.2.2 in order to use the PostingSource class. It is a long-running
process, and I am seeing the number of open file descriptors to the
Xapian database steadily increase. I suspect what I am seeing is some
kind of resource leak.
I have no idea if it is a problem in our code or in the Xapian Python
bindings. How do I debug this problem?...
2016 May 16
2
Weighting recent results
...ght = get_weight_based_on_recency(...);
$tg->index_text($txt,$weight);
If I wanted to allow the user the option of searching either in
recency-weighted mode or not, I could index each document into 2
different databases, one with and one without.
This avoids having to mess with subclassing PostingSource and C++ and
all that.
- Alex Aminoff
NBER
On 05/03/2016 08:15 AM, James Aylett wrote:
> On Tue, May 03, 2016 at 07:56:19AM -0400, Alex Aminoff wrote:
>
>> Perhaps I am not understanding the basic concept, but I was figuring
>> we would just write a subclass of PostingSou...