Displaying 20 results from an estimated 1000 matches similar to: "Sort by docid"
2011 Aug 11
3
Fwd: Re: what is the fastest way to fetch results which are sorted by timestamp ?
(Forwarded off-list message)
-------- Original Message --------
Subject: Re: [Xapian-discuss] what is the fastest way to fetch results
which are sorted by timestamp ?
Date: Thu, 11 Aug 2011 01:06:36 +0800
From: ??? <panjunyong at gmail.com>
To: Tim Brody <tdb2 at ecs.soton.ac.uk>
On Wed, Aug 10, 2011 at 6:39 PM, Tim Brody <tdb2 at ecs.soton.ac.uk> wrote:
> Hi,
>
> In
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ?
i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc.
search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp.
This method is ok , but
2018 Mar 30
2
sorting large msets
Hello, is there a way to optimize sorting by certain values
for queries which return a huge amount of results?
For example, I just want a simple query that gives me the 200
most recent emails out of millions. The elapsed time for
get_mset increases as the number of documents ($n * 2000)
increases.
I suppose I could store a pre-sorted set using SQLite or
similar. Thanks in advance for any
2018 Mar 31
2
sorting large msets
Olly Betts <olly at survex.com> wrote:
> On Fri, Mar 30, 2018 at 05:21:43PM +0000, Eric Wong wrote:
> > Hello, is there a way to optimize sorting by certain values
> > for queries which return a huge amount of results?
> [...]
> > $enquire->set_sort_by_value_then_relevance(0, 1);
>
> If you're just wanting the 200 newest, it'll be faster not to
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2010 Jul 26
2
related documents
Hi All,
I would like to take a doc in the xapian DB and find all related
documents by relevance e.g. so when you view one document it says
"Related entries X Y Z".
I'm aware of the "Morelikethis" Lucene plugin that is supposed to do
something like this, by generating a query from a document based on term
frequency.
Has anyone developed a tool to generate a query from a
2005 Jul 20
1
docid type redifine
Hello all.
I need to redefine a docid type (and all dependent types) like this: typedef unsigned long long docid;
I think it would be enough to edit "include/xapian/types.h", but it isn't so.
1) I've added :
string
om_tostring(unsigned long long val)
{
CONVERT_TO_STRING("%llu")
}
in common/utils.{h,cc}
2) In include/enquire.h (line 438) I've found the
2011 Aug 10
0
xapian enquire.set_docid_order(Xapian::Enquire::DESCENDING so slow!
i have 300 millions records and my search file like this , i want the newest 10 results that match my query , so i use boolean search and "enquire.set_docid_order(enquire.DESCENDING)" , but this method seems a little slow . when i remove "enquire.set_docid_order(enquire.DESCENDING)" it run much faster .
how can i fetch the newest 10 results as fast as possible?
search.py
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2006 Mar 22
2
searching and sorting by date
I hope I'm not beating a dead horse here, but we recently started
evaluating xapian and Xapwrap (python wrapper around it) for our project
and I've been quickly trying to soak up a lot of the docs and concepts,
so I hope I can explain my question clearly.
We have a use case where we must return the first 50 most recent
documents that match our query. We don't want the first 50
2007 Apr 11
1
Deprecation Policy
When going through the xapian bindings yesterday, I noticed that several
of the methods were not wrapped for Ruby because they were deprecated at
the time the ruby bindings were created. I filed a bug (#126) saying
that they should be removed entirely, which led to the suggestion from
Olly that it would be good to make a semi-formal policy about
deprecating features. I've written such a
2012 Jan 20
3
get_docid???
my $mset = $enq->get_mset($nstart,$nrecords);
for(my $mit=$mset->begin(); $mit != $mset->end();$mit++) {
my $doc = $mit->get_document();
my $dat = $doc->get_data();
my $id = $doc->get_docid();
}
[Fri Jan 20 10:35:06 2012] newmail.cgi: Can't locate
auto/Search/Xapian/Document/get_docid.al in @INC (@INC contains:
/etc/perl
2018 Jun 20
2
Welcome to the "Xapian-discuss" mailing list
Hi,
I'm new to Xapian and wanted to know if it has a specific feature. I want
to be able to check the relation between two terms on a page based on how
close they are together on the page. I want to use a combination of n-gram
based labeling and the "slop" feature found in Elasticsearch. Does Xapian
have this/a similar feature? I haven't been able to find any programs that
have
2018 Mar 30
0
sorting large msets
On Fri, Mar 30, 2018 at 05:21:43PM +0000, Eric Wong wrote:
> Hello, is there a way to optimize sorting by certain values
> for queries which return a huge amount of results?
[...]
> $enquire->set_sort_by_value_then_relevance(0, 1);
If you're just wanting the 200 newest, it'll be faster not to calculate
weights, so:
$enquire->set_sort_by_value(0, 1);
2018 Apr 03
0
sorting large msets
On Sat, Mar 31, 2018 at 12:58:19AM +0000, Eric Wong wrote:
> Olly Betts <olly at survex.com> wrote:
> > If you're just wanting the 200 newest, it'll be faster not to calculate
> > weights, so:
> >
> > $enquire->set_sort_by_value(0, 1);
> > $enquire->set_weighting_scheme(new Xapian::BoolWeight());
> >
> > For me, this drops the time
2010 Jan 28
3
Problem getting Xapian working with Burmese
On Fri, Aug 21, 2009 at 02:44:44PM +0200, emmanuel at engelhart.org wrote:
>> I want to update my request.
>> Is my question bad formulated? too trivial? ... or maybe pretty
>> complicated/unclear?
>
>I think nobody answered as it was hard to follow your example because
>the Burmese characters seem to have been mangled (at least the message I
>received wasn't
2011 May 30
1
How to check docid
I have a bit of code (Python) to delete a number of documents:
for f in Flist: xapian_store.delete_document(f.pri_key)
in which I am using a unique primary key from an SQL database as the docid for the Xapian database.
The problem I have is that some of the documents may not have been created - so I get an error. Now I could just ignore the error (try-recover), but what would be the
2013 Aug 26
2
Perl interface isn't working in 1.2.x
On 08/25/2013 05:02 PM, Olly Betts wrote:
> So the simple fix is
> probably just to install the perl-Search-Xapian RPM instead.
Thanks, the Centos 6 repos don't have that rpm and the
http://xapian.org/download page seems to only cover the XS bindings, if
I am reading this correctly:
But I was able to remove the rpm packages and compile and install the
core and swig from source.
2013 Feb 21
1
Parsing fields with phrases.
I have the following code:
my $par =
$qp->parse_query($query,Search::Xapian::FLAG_SPELLING_CORRECTION);
print LOG "Query $query, par $par\n";
my $enq = $xDatabase->enquire( $par );
The output from the LOG file is:
Query title:"new dolphin", par Xapian::Query(0 * Snew dolphin)
No results are returned. If I change the search to title:dolphin it
finds a
2007 Sep 30
1
Perl example of using termitrator?
I'm having trouble translating from C++ to perl objects.
The TermIterator class looks like to get a set of terms in a document
you might have C++ code like:
Enquire::TermIterator termIt =enquire->get_matching_terms_begin(id);
for(;termIt != enquire->get_matching_terms_end(id);termIt++) {
string term = *termIt;
}
Or something similar. However when I attempt to translate that