Displaying 20 results from an estimated 700 matches similar to: "get_docid???"
2013 Aug 21
2
Perl interface isn't working in 1.2.x
At least it isn't working the way it used to.
Code:
$db = Search::Xapian::Database->new( $dx );
my $qp = Search::Xapian::QueryParser->new();
my $dbSize=$db->get_doccount();
# $qp->set_stemmer(new Search::Xapian::Stem("english"));
# $qp->set_stemming_strategy(STEM_SOME);
# $qp->set_default_op($defaultop);
my $par =
2013 Aug 26
2
Perl interface isn't working in 1.2.x
On 08/25/2013 05:02 PM, Olly Betts wrote:
> So the simple fix is
> probably just to install the perl-Search-Xapian RPM instead.
Thanks, the Centos 6 repos don't have that rpm and the
http://xapian.org/download page seems to only cover the XS bindings, if
I am reading this correctly:
But I was able to remove the rpm packages and compile and install the
core and swig from source.
2007 Sep 30
1
Perl example of using termitrator?
I'm having trouble translating from C++ to perl objects.
The TermIterator class looks like to get a set of terms in a document
you might have C++ code like:
Enquire::TermIterator termIt =enquire->get_matching_terms_begin(id);
for(;termIt != enquire->get_matching_terms_end(id);termIt++) {
string term = *termIt;
}
Or something similar. However when I attempt to translate that
2016 May 09
1
Given a document, how do you get its ID? (perl bindings)
I am writing an indexer that will crawl our web site. Following the
recommendation here:
https://trac.xapian.org/wiki/FAQ/UniqueIds
I'm using the URL as the unique ID for each document. I see how to get a
document from the xapian database if I know its URL, but what I need is
also to be able to find out the URL from the document. Does this mean I
need to store the URL in a value in
2007 Feb 09
1
PHP Binding and dbi2omega questions
Hi All,
I've installed Xapian and the php module. I've set up a script for use with
scriptindex and dbi2omega for getting data from the db into the index
easily, the script file is as follows:
===============================
id : field=id
title : index
title: field=title
description : index
description : truncate=50 field=content
=============================
However, when querying
2006 Jan 31
1
retrieving attributes of searchresults
i use the perl interface of Search::Xapian to index documents, now i got
metadata i store with the index like title, date, author, .. and i wonder how to
retrieve them from the index again without pulling them from the database.
i am pretty sure this is a stupid question and that the answer is obvious i dont
seem to be able to find it.
regards m
2017 Jun 06
1
Test for the end of PostingIterator in perl?
Hi all. I want to iterate over all the documents in my database.
my $pi = $db->postlist_begin("");
while ("$pi" =~ qr/END/) {
my $oldid = $pi->get_docid;
$pi++;
#...
}
That used to work with Search::Xapian in perl version 1.2, but now with
xapian-bindings-1.4.4 it does not seem to. How are you supposed to tell
when you have reached the
2013 Jan 17
1
FASTER Search
I am suffering for slow searching performance on Xapian.
I am using Xapian for indexing about 150,000,000 documents.
It was implemented in C++;
The performance of searching was not that fast.
e.g. Searching a query, which includes about 20 terms, needs 2 secs avg.
For searching, I followed such steps:
1. construct a QueryParser for certain string
2. parse the query to get a Xapian::Query
2007 Feb 02
1
Working demo of search engine using boolean query.
Lately I was reading many articles about using boolean queries for search
engine but I haven't seen any complete working demo. Therefore I put
together very simple working demo of search engine using boolean query. Feel
free to suggest any performance improvement or error while keeping it as
simple as possible for understanding.
Thanks,
-Kevin Duraj
http://myhealthcare.com
2010 Apr 16
2
best practices - combining sql database and xapian, size of database?
Newbie-alert: I'm just getting started on a new project involving a
full text search requirement, and my initial investigation points to
xapian being the way to go.
Two questions:
- eventually I'll most likely be indexing towards 50 million
documents - is this reasonable to expect or attempt with xapian?
- each of my documents come with a set of attributes. These are easily
stored
2011 Apr 21
1
How to Retrieve content of the document?
Hi,
I have just started using xapian and I may sound like a noob. I want to know
how i can access the content of the document retrieved while searching. I
have used the code found on this mailing list itself to index my database.
#!/usr/bin/perl -w
use strict;
use Search::Xapian;
use File::Find;
my $DATABASE_DIR = '/home/rohit/Desktop/SET/DB';
my $db =
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi,
I'm using xapian with c# and mono and i'm having a segfault in get_weight.
When i print the index variable, the value is clearly too high.
I think something write over it. Do you have any idea on how i could
trace the beginning of the segmentation fault ?
Thanks,
--
Yann
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2004 Oct 18
1
xapian-bindings compile creates 34mb xapian_wrap.o file?
Questions are threaded into the output:
********************
#./configure --without-python
checking for a BSD-compatible install... /usr/bin/install -c
checking whether build environment is sane... yes
checking for gawk... no
checking for mawk... mawk
checking whether make sets $(MAKE)... yes
checking build system type... i686-pc-linux-gnu
checking host system type... i686-pc-linux-gnu
checking
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote:
> On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> > I'm already retrying the ->get_mset operations; but now I'm
> > wondering where I'd hit DatabaseModifiedErrors while inside a
> > Xapian::MSetIterator loop.
> >
> > I assume ->get_document is a place where it gets thrown;
> > but
2004 Dec 21
1
Search::Xapian add_database'd search results are odd?
Sorry if this is the wrong forum to discuss Search::Xapian issues --
this just seems like the best place..
Anyways, I've been testing out using $db->add_database() when searching,
and it seems like the docids I'm getting out of it are incorrect, almost
as though they're "double" what they should be (numerically)...
the docids that exist should be around 950,000 and
2023 Aug 27
1
DatabaseModifiedError while iterating on mset
On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> I'm already retrying the ->get_mset operations; but now I'm
> wondering where I'd hit DatabaseModifiedErrors while inside a
> Xapian::MSetIterator loop.
>
> I assume ->get_document is a place where it gets thrown;
> but once a document is retrieved, can iterating through
> terms in one document
2013 Feb 05
0
make error in xapian-application/omega (jiangwen jiang)
Hey Hi jiangwen,hope you are doing fine :) You need some libraries and
tools installed on your system before you build Xapian and omega.The
complete list can be found in the "Building from svn or git" section of
this document.:-
http://svn.xapian.org/trunk/xapian-core/HACKING?view=co
Make sure you have all the required tools installed and it will work fine.
Please let me know if you
2007 Apr 03
3
More weirdness with PHP bindings
Hi all,
There's been a odd bug reported to us by Daniel Menard while working on
the PHP bindings:
"I then tried to run the dotest target...All tests passed, except the
one about get_matching_terms (smoketest.php line 94).
I added this line before the exit:
for ($i=0; $i<strlen($terms); $i++) echo $c=ord($terms[$i]), ' ',
($c>31?$terms[$i]:''), "\n";