Displaying 20 results from an estimated 1000 matches similar to: "Problem with Perl bindings (enquire)"
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2011 Apr 21
1
How to Retrieve content of the document?
Hi,
I have just started using xapian and I may sound like a noob. I want to know
how i can access the content of the document retrieved while searching. I
have used the code found on this mailing list itself to index my database.
#!/usr/bin/perl -w
use strict;
use Search::Xapian;
use File::Find;
my $DATABASE_DIR = '/home/rohit/Desktop/SET/DB';
my $db =
2018 Jun 21
0
Welcome to the "Xapian-discuss" mailing list
Please keep replies on the mailing list — more people can help (and benefit) that way :)
So OP_NEAR looks for its terms close to each other (hence "near"). The window is how far away they can be. Probably the easiest way to play with this is using the NEAR syntax in the query parser. So if you had a plain text document:
I am walking, always walking.
And index it in a very simple
2013 Jun 19
2
Compact databases and removing stale records at the same time
I'm trying to compact (or at least merge) multiple databases, while stripping search records which are no longer required.
Backstory:
I've inherited the Cyrus IMAPd xapian-based search code from Greg Banks when he left Opera.
One of the unfinished parts was removing expunged emails from the search database.
We moved from having a single search database to supporting multiple
2018 Mar 30
2
sorting large msets
Hello, is there a way to optimize sorting by certain values
for queries which return a huge amount of results?
For example, I just want a simple query that gives me the 200
most recent emails out of millions. The elapsed time for
get_mset increases as the number of documents ($n * 2000)
increases.
I suppose I could store a pre-sorted set using SQLite or
similar. Thanks in advance for any
2010 Jan 28
3
Problem getting Xapian working with Burmese
On Fri, Aug 21, 2009 at 02:44:44PM +0200, emmanuel at engelhart.org wrote:
>> I want to update my request.
>> Is my question bad formulated? too trivial? ... or maybe pretty
>> complicated/unclear?
>
>I think nobody answered as it was hard to follow your example because
>the Burmese characters seem to have been mangled (at least the message I
>received wasn't
2007 Sep 30
1
Perl example of using termitrator?
I'm having trouble translating from C++ to perl objects.
The TermIterator class looks like to get a set of terms in a document
you might have C++ code like:
Enquire::TermIterator termIt =enquire->get_matching_terms_begin(id);
for(;termIt != enquire->get_matching_terms_end(id);termIt++) {
string term = *termIt;
}
Or something similar. However when I attempt to translate that
2018 Nov 30
1
Xapian Benchmark results
Hi,
I am currently trying to benchmark a multithreaded xapian implementation on
a chameleon baremetal instance written in C++. My workload is a 3 Gig
wikipedia xml dump consisting of ~286 file of different sizes. My results
are showing me that indexing on xapian is an order of magnitude faster than
my lucene and lucene plusplus implementations. This is a result that I did
not expect. Just want to
2014 Feb 13
2
A beginner in "Posting list encoding improvements"
I uninstall xapian1.3 and install xapian-1.2.17
but i still failed
hurricanetong at hurricanetong-VirtualBox:~/workspace$ g++ `xapian-config --cxxflags --libs` demo2.cc
/tmp/cc2wsfDJ.o: In function `main':
demo2.cc:(.text+0x4a): undefined reference to `Xapian::WritableDatabase::WritableDatabase(std::basic_string<char, std::char_traits<char>, std::allocator<char> >
2007 Feb 02
1
Working demo of search engine using boolean query.
Lately I was reading many articles about using boolean queries for search
engine but I haven't seen any complete working demo. Therefore I put
together very simple working demo of search engine using boolean query. Feel
free to suggest any performance improvement or error while keeping it as
simple as possible for understanding.
Thanks,
-Kevin Duraj
http://myhealthcare.com
2010 Dec 16
1
Enquire::get_mset() "first" and MatchDecider
Hello,
This is a somewhat idle question about Enquire::get_mset(), possibly based
on wrong assumptions on how Xapian works, but I would be grateful if
someone satisfied my curiosity.
Xapian::MSet Xapian::Enquire::get_mset(Xapian::doccount first,
Xapian::doccount maxitems,
const Xapian::RSet * omrset = 0,
const
2005 Jul 20
1
docid type redifine
Hello all.
I need to redefine a docid type (and all dependent types) like this: typedef unsigned long long docid;
I think it would be enough to edit "include/xapian/types.h", but it isn't so.
1) I've added :
string
om_tostring(unsigned long long val)
{
CONVERT_TO_STRING("%llu")
}
in common/utils.{h,cc}
2) In include/enquire.h (line 438) I've found the
2013 Aug 21
2
Perl interface isn't working in 1.2.x
At least it isn't working the way it used to.
Code:
$db = Search::Xapian::Database->new( $dx );
my $qp = Search::Xapian::QueryParser->new();
my $dbSize=$db->get_doccount();
# $qp->set_stemmer(new Search::Xapian::Stem("english"));
# $qp->set_stemming_strategy(STEM_SOME);
# $qp->set_default_op($defaultop);
my $par =
2011 Aug 10
0
xapian enquire.set_docid_order(Xapian::Enquire::DESCENDING so slow!
i have 300 millions records and my search file like this , i want the newest 10 results that match my query , so i use boolean search and "enquire.set_docid_order(enquire.DESCENDING)" , but this method seems a little slow . when i remove "enquire.set_docid_order(enquire.DESCENDING)" it run much faster .
how can i fetch the newest 10 results as fast as possible?
search.py
2006 Nov 08
2
set_bias() in bindings
I'm trying to call Enquire.set_bias() from the python bindings. This fails
as follows:
>>> db=xapian.Database('/home/richard/foo')
>>> enq=xapian.Enquire(db)
>>> enq.set_bias(1.0, 3600)
Traceback (most recent call last):
File "<stdin>", line 1, in ?
TypeError: in method 'Enquire_set_bias', argument 3 of type 'time_t'
I
2014 Feb 13
2
回复: A beginner in "Posting list encoding improvements"
I think what i did is the same with you except i use make rather than make -sj8, and I did as root.
And I do as you wrote again:
root at hurricanetong-VirtualBox:/home/hurricanetong/xapian-1.2.17/xapian-core-1.2.17# ./configure
[...]
root at hurricanetong-VirtualBox:/home/hurricanetong/xapian-1.2.17/xapian-core-1.2.17# make -sj8
Making all in .
Making all in docs
Making all in tests
root at
2013 Feb 21
1
Parsing fields with phrases.
I have the following code:
my $par =
$qp->parse_query($query,Search::Xapian::FLAG_SPELLING_CORRECTION);
print LOG "Query $query, par $par\n";
my $enq = $xDatabase->enquire( $par );
The output from the LOG file is:
Query title:"new dolphin", par Xapian::Query(0 * Snew dolphin)
No results are returned. If I change the search to title:dolphin it
finds a
2011 Nov 20
2
I'm writing this letter to enquire where can I download the package of "lmtest".
Dear editor:
I'm writing this letter to enquire where can I download the package of "lmtest". Can you send me this package?
THanks a lot.
Best regards,
Shu-Fei Wu
2007 Mar 29
1
"Bad serialised query" error
Hi all,
We've struck a problem when using remote queries and hoping someone can
offer some guidance. I haven't managed to get a C++ reproduction in the
test suite, but I have got a demo that fails using Python:
import xapian
db = xapian.remote_open('localhost', 8100)
qp = xapian.QueryParser()
qp.set_database(db)
query = qp.parse_query('test')
enq =