Displaying 20 results from an estimated 1000 matches similar to: "searching and sorting by date"
2005 Jun 29
2
Sort by docid
Hello,
I wonder if there is a way to cause Xapian to order a result set purely by
docid. In other words, once the result set has been determined, I'd like the
results to be returned to me ordered by their docid, as opposed to by their
match relevance.
The problem at hand is that I'm building a search engine for a mailing list
and I would like to return matches sorted by date; ordering by
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2013 Aug 26
2
Perl interface isn't working in 1.2.x
On 08/25/2013 05:02 PM, Olly Betts wrote:
> So the simple fix is
> probably just to install the perl-Search-Xapian RPM instead.
Thanks, the Centos 6 repos don't have that rpm and the
http://xapian.org/download page seems to only cover the XS bindings, if
I am reading this correctly:
But I was able to remove the rpm packages and compile and install the
core and swig from source.
2018 Mar 30
2
sorting large msets
Hello, is there a way to optimize sorting by certain values
for queries which return a huge amount of results?
For example, I just want a simple query that gives me the 200
most recent emails out of millions. The elapsed time for
get_mset increases as the number of documents ($n * 2000)
increases.
I suppose I could store a pre-sorted set using SQLite or
similar. Thanks in advance for any
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote:
> On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> > I'm already retrying the ->get_mset operations; but now I'm
> > wondering where I'd hit DatabaseModifiedErrors while inside a
> > Xapian::MSetIterator loop.
> >
> > I assume ->get_document is a place where it gets thrown;
> > but
2018 Mar 31
2
sorting large msets
Olly Betts <olly at survex.com> wrote:
> On Fri, Mar 30, 2018 at 05:21:43PM +0000, Eric Wong wrote:
> > Hello, is there a way to optimize sorting by certain values
> > for queries which return a huge amount of results?
> [...]
> > $enquire->set_sort_by_value_then_relevance(0, 1);
>
> If you're just wanting the 200 newest, it'll be faster not to
2023 Aug 27
1
DatabaseModifiedError while iterating on mset
On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> I'm already retrying the ->get_mset operations; but now I'm
> wondering where I'd hit DatabaseModifiedErrors while inside a
> Xapian::MSetIterator loop.
>
> I assume ->get_document is a place where it gets thrown;
> but once a document is retrieved, can iterating through
> terms in one document
2013 Mar 05
1
Sorting by value - direction
Hi,
I'm trying to sort a list of results alphabetically, but it's coming out in the wrong direction. Indexing is done like this (PHP bindings):
// Tag name as a value (for sorting)
$doc->add_value(2, strtolower($obj->tag));
And searching like this:
$enquire->set_sort_by_value(2);
But for some reason, the results are coming out Z-A, not A-Z. I've tried setting true as
2013 Feb 20
1
Sticky results
Hi there,
I have a xapian index whose results are being sorted by a value, with (PHP bindings):
$enquire->set_sort_by_value($sort_data_value);
This is because I want the results returned in chronological order of publication date. However, I now have a need to have certain results be 'sticky' at the top of the resultset, regardless of their publication date. Obviously there are
2012 Jan 20
3
get_docid???
my $mset = $enq->get_mset($nstart,$nrecords);
for(my $mit=$mset->begin(); $mit != $mset->end();$mit++) {
my $doc = $mit->get_document();
my $dat = $doc->get_data();
my $id = $doc->get_docid();
}
[Fri Jan 20 10:35:06 2012] newmail.cgi: Can't locate
auto/Search/Xapian/Document/get_docid.al in @INC (@INC contains:
/etc/perl
2005 Sep 14
1
Python binding and Xapian fields
Hi all,
I have spent more than 3 hours trying to figure out how Xapian documents
fields works ! It seems that I failed so I ask for help.
I am using Xapian 0.9.2 with Python bindings and Xapwrap (Python wrapper for
xapian) on Fedora Core 3. I have added custom fields like 'uri', 'title',
etc and I wish to view them in my Omega results.
I am also using an Omega XML customized
2007 Apr 03
3
More weirdness with PHP bindings
Hi all,
There's been a odd bug reported to us by Daniel Menard while working on
the PHP bindings:
"I then tried to run the dotest target...All tests passed, except the
one about get_matching_terms (smoketest.php line 94).
I added this line before the exit:
for ($i=0; $i<strlen($terms); $i++) echo $c=ord($terms[$i]), ' ',
($c>31?$terms[$i]:''), "\n";
2006 May 10
2
ruby bindings odds and ends
In the Ruby bindings, set_X and get_X are renamed to remove the get/set,
so MSet::get_matches_estimated() becomes a read-only property (if I have
my terminology right) of the Ruby MSet class called matches_estimated.
This matches how a Ruby programmer would expect a class to be designed.
I was looking through the getters and setters list. I wonder if a few
entries shouldn't be there, (the
2014 Mar 26
3
about sort_by_value
Hello, I have found that the use of sort_by_value very slow.
16800 result, return to the previous 10, sorting takes about 25ms.
And if you do not sort, returns 10, need only about 0.3ms.
How to make the sort faster?
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
2017 Apr 09
3
Omega: Missing support for newer weighting schemes
On Sun, Apr 09, 2017 at 11:34:07PM +0530, Vivek Pal wrote:
> > Each scheme already has a human-readable name, and Xapian::Registry
> > can map that to an "examplar" object of the right type, so we
> > could take a string like "bm25 1 0.8", see the first word is "bm25"
> > and get a BM25Weight object, then call parse_params("1 0.8") on
2010 Oct 13
1
bwplot change whiskers position to percentile 5 and P95
Dear R-community,
Using bwplot, how can I put the whiskers at percentile 5 and percentile 95,
in place of the default position coef=1.5??
Using panel=panel.bwstrip, whiskerpos=0.05, from the package agsemisc gives
satisfaction, but changes the appearance of my boxplot and works with an old
version of R, what I don’t want, and I didn’t find the option in
box.umbrella parameters
Many thanks
2007 Sep 30
1
Perl example of using termitrator?
I'm having trouble translating from C++ to perl objects.
The TermIterator class looks like to get a set of terms in a document
you might have C++ code like:
Enquire::TermIterator termIt =enquire->get_matching_terms_begin(id);
for(;termIt != enquire->get_matching_terms_end(id);termIt++) {
string term = *termIt;
}
Or something similar. However when I attempt to translate that
2017 Apr 12
4
Omega: Missing support for newer weighting schemes
> Each scheme already has a human-readable name, and Xapian::Registry
> can map that to an "examplar" object of the right type, so we
> could take a string like "bm25 1 0.8", see the first word is "bm25"
> and get a BM25Weight object, then call parse_params("1 0.8") on it to
> create the correct Weight object (broadly similar to how
2008 Jan 04
2
Agents and AddQueueMember
Hi,
I have callcenter running with v 1.2 with AgentCallbackLogin and now
trying to move to 1.4 using the example doc,
doc/queues-with-callback-members.txt. From what I understand the basic
idea in the example is to
1. Authenticate a caller with VMAuthenticate
2. Get his SIP Channel number
3. Use