Displaying 20 results from an estimated 7000 matches similar to: "Sorting by value - direction"
2018 Mar 30
2
sorting large msets
Hello, is there a way to optimize sorting by certain values
for queries which return a huge amount of results?
For example, I just want a simple query that gives me the 200
most recent emails out of millions. The elapsed time for
get_mset increases as the number of documents ($n * 2000)
increases.
I suppose I could store a pre-sorted set using SQLite or
similar. Thanks in advance for any
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ?
i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc.
search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp.
This method is ok , but
2013 Feb 20
1
Sticky results
Hi there,
I have a xapian index whose results are being sorted by a value, with (PHP bindings):
$enquire->set_sort_by_value($sort_data_value);
This is because I want the results returned in chronological order of publication date. However, I now have a need to have certain results be 'sticky' at the top of the resultset, regardless of their publication date. Obviously there are
2018 Mar 31
2
sorting large msets
Olly Betts <olly at survex.com> wrote:
> On Fri, Mar 30, 2018 at 05:21:43PM +0000, Eric Wong wrote:
> > Hello, is there a way to optimize sorting by certain values
> > for queries which return a huge amount of results?
> [...]
> > $enquire->set_sort_by_value_then_relevance(0, 1);
>
> If you're just wanting the 200 newest, it'll be faster not to
2011 Aug 11
3
Fwd: Re: what is the fastest way to fetch results which are sorted by timestamp ?
(Forwarded off-list message)
-------- Original Message --------
Subject: Re: [Xapian-discuss] what is the fastest way to fetch results
which are sorted by timestamp ?
Date: Thu, 11 Aug 2011 01:06:36 +0800
From: ??? <panjunyong at gmail.com>
To: Tim Brody <tdb2 at ecs.soton.ac.uk>
On Wed, Aug 10, 2011 at 6:39 PM, Tim Brody <tdb2 at ecs.soton.ac.uk> wrote:
> Hi,
>
> In
2005 Jun 29
2
Sort by docid
Hello,
I wonder if there is a way to cause Xapian to order a result set purely by
docid. In other words, once the result set has been determined, I'd like the
results to be returned to me ordered by their docid, as opposed to by their
match relevance.
The problem at hand is that I'm building a search engine for a mailing list
and I would like to return matches sorted by date; ordering by
2006 May 10
2
ruby bindings odds and ends
In the Ruby bindings, set_X and get_X are renamed to remove the get/set,
so MSet::get_matches_estimated() becomes a read-only property (if I have
my terminology right) of the Ruby MSet class called matches_estimated.
This matches how a Ruby programmer would expect a class to be designed.
I was looking through the getters and setters list. I wonder if a few
entries shouldn't be there, (the
2018 Mar 30
0
sorting large msets
On Fri, Mar 30, 2018 at 05:21:43PM +0000, Eric Wong wrote:
> Hello, is there a way to optimize sorting by certain values
> for queries which return a huge amount of results?
[...]
> $enquire->set_sort_by_value_then_relevance(0, 1);
If you're just wanting the 200 newest, it'll be faster not to calculate
weights, so:
$enquire->set_sort_by_value(0, 1);
2018 Apr 03
0
sorting large msets
On Sat, Mar 31, 2018 at 12:58:19AM +0000, Eric Wong wrote:
> Olly Betts <olly at survex.com> wrote:
> > If you're just wanting the 200 newest, it'll be faster not to calculate
> > weights, so:
> >
> > $enquire->set_sort_by_value(0, 1);
> > $enquire->set_weighting_scheme(new Xapian::BoolWeight());
> >
> > For me, this drops the time
2006 Mar 22
2
searching and sorting by date
I hope I'm not beating a dead horse here, but we recently started
evaluating xapian and Xapwrap (python wrapper around it) for our project
and I've been quickly trying to soak up a lot of the docs and concepts,
so I hope I can explain my question clearly.
We have a use case where we must return the first 50 most recent
documents that match our query. We don't want the first 50
2009 Jan 29
1
Xapian Ruby bindings do not implement full multi-value-sorting functionality?
Hello,
this is a question that could be answered by collaborators of the Ruby
bindings.
Today I've played around with the Xapian::MultiValueSorter class. I've set
everything up and then I tried following on an instance of Xapian::Enquire:
:
enquire = Xapian::Enquire.new(database)
enquire.query = options[:query]
:
sorter = Xapian::MultiValueSorter.new
sorter.add(0, true)
sorter.add(1,
2010 Jan 28
3
Problem getting Xapian working with Burmese
On Fri, Aug 21, 2009 at 02:44:44PM +0200, emmanuel at engelhart.org wrote:
>> I want to update my request.
>> Is my question bad formulated? too trivial? ... or maybe pretty
>> complicated/unclear?
>
>I think nobody answered as it was hard to follow your example because
>the Burmese characters seem to have been mangled (at least the message I
>received wasn't
2014 Mar 26
3
about sort_by_value
Hello, I have found that the use of sort_by_value very slow.
16800 result, return to the previous 10, sorting takes about 25ms.
And if you do not sort, returns 10, need only about 0.3ms.
How to make the sort faster?
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
2012 Apr 27
4
GSoC xapian node binding
Posting recent offline discussion...
On Fri, Apr 27, 2012 at 10:55 AM, Marius Tibeica <mtibeica at gmail.com> wrote:
> Hi Liam,
>
> I've added the Enquire class and designed a query spec structured as a JS
> object. Hope you like it :)
> I'll probably be off a few days (there is a national holiday Tuesday which
> means i have a long weekend :D) but maybe I'll
2009 Mar 18
2
Xapian 1.0.11 released
I've uploaded Xapian 1.0.11 (including Search::Xapian 1.0.11.0), which
as usual you can download from:
http://xapian.org/download
Starting from this release, I'm going to include SHA1 checksums of
the released files in the release announcement. For 1.0.11 these are:
480fd99617975c1aaf9127cd7c7166bc05d07eb1 Search-Xapian-1.0.11.0.tar.gz
217c7fb1754d0aeaf82c3ab83d5ec0881099bb61
2010 Aug 27
1
Using relevance when sorting by generated key
Hi all,
I am trying to implement a scheme where documents in an MSet will be
sorted based on relevance as well as geographical distance from a given
(non-fixed) point.
I understand that this can be accomplished by using PostingSource in
order to implement a custom weighting scheme that would combine BM25
weighting and distance. The problem is that I am using perl and
PostingSource is not
2012 Nov 03
1
get the title from the document
Dear all,
I am working on a very simple project, in which I wanna get the title from the document.
For instance, this is what I have done so far.
///////////// code? for building the index file
??????? # Load content
??????? content = open(filePath).read()
??????? # Prepare document
??????? document = xapian.Document()
??????? document.set_data(content)
??????? # Store fileName
???????
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2007 Oct 16
1
Matches estimate varies with sorting method
Hi all,
I found that the figure returned by MSet::get_matches_estimated() varies
depending on how results are to be sorted.
For instance, in my index, value 4 contains date and time in the format
"yyyymmddhhmmss". For the same query, the number of results will be
estimated to 20000+ when results are first sorted by date and time
with set_sort_by_value_then_relevance(4) and to only 100
2013 Sep 02
2
having trouble with prefixes
I've got a small test database setup with one record.
$ delve -r 1 -V /tmp/1/
Values for record #1: 0:DD4F2162FFFF0E43741A4A1C2B8EC0E7 1:./Text_page_scan_2.jpg 2:jpg 3:.jpg
Term List for record #1: E:.jpg P:./Text_page_scan_2.jpg Q:DD4F2162FFFF0E43741A4A1C2B8EC0E7 T:jpg
The terms were added with lines like this:
doc.add_term(string("P:") + path);
Problem is, I can't seem to