Displaying 20 results from an estimated 7000 matches similar to: "How to locate the query term in search results?"
2008 Apr 20
1
Exception DatabaseCorruptError under php
Hello,
I'm having a litle issue with Xapian-1.0.6 with php bindings under
php5 when trying to add a new document it throws:
Fatal error: Uncaught exception 'Exception' with message
'DatabaseCorruptError: Failed to unlink
/var/lib/xapian/trade.ar/termlist.baseA: No such file or directory'
in /usr/share/php5/xapian.php:1140
Stack trace:
#0 /usr/share/php5/xapian.php(1140):
2013 Feb 07
0
Ideas for allowing specification of weighing scheme for Eset
Hey guys ,Hi :) I am working on a hack which will allow the user to
specify a weighing scheme (along with the parameters , if he does not not
want to use the default values) to build the Eset (rather than using the
hard coded TradWeight scheme with default k=1 ) as Olly had suggested
that we can probably get better terms (a more relevant Eset) for query
expansion if we use say something
2012 Feb 17
2
DatabaseModifiedError on get_data - best practice?
Hi,
I have previously had a problem with getting this error on a get_mset
call, and solved it by subclassing XapianEnquire with a
backoff-and-retry algorithm (as suggested by this list, many thanks!).
However, I now get it intermittently when calling get_data on a
XapianDocument. The same solution doesn't seem to be quite as easy in
this case, because:
1. The document is not instantiated
2023 Jul 04
1
Internal error: Message without type term
On Mon, Jul 03, 2023 at 02:26:03PM +0200, David Bremner wrote:
> "Peter P." <peterparker at fastmail.com> writes:
>
> > I ran xapian-check on ~/.notmuch/xapian and include its messages
> > below at the end of this mail. Everyone please forgive me for
> > pasting 1121 there. :)
>
> H'mm. It doesn't look familiar to me, but I will check with
2010 Aug 16
1
No position.{DB,baseA,baseB}
I've just noticed that new indexes no longer have
position.{DB,baseA,baseB} files, all previous indexes (I roll indexes
every week using xapian-compact) have the position files. The index
seems to work but it is returning some odd results, for example if I run
a query with the phrase "machine learning" it mostly returns documents
containing "machine learning" but it also
2013 Sep 22
2
How to filter search result with query with has white space.
Hello,
include <iostream>#include <string>#include <xapian.h>struct document{
std::string title;
std::string content;
std::string url;};
void indexData(document d) {
try {
Xapian::WritableDatabase db("/Users/ramesh/Desktop/xapian",
Xapian::DB_CREATE_OR_OPEN);
Xapian::TermGenerator indexer;
Xapian::Stem
2013 Sep 22
2
How to filter search result with query with has white space.
Hello,
include <iostream>#include <string>#include <xapian.h>struct document{
std::string title;
std::string content;
std::string url;};
void indexData(document d) {
try {
Xapian::WritableDatabase db("/Users/ramesh/Desktop/xapian",
Xapian::DB_CREATE_OR_OPEN);
Xapian::TermGenerator indexer;
Xapian::Stem
2011 Feb 18
1
Is it possible to reset the parameters in BM25 each time a new query enters?
Hi guys,
I'm trying to improve the search results of our collection by tuning the parameters in the BM25 weighting schema. Since our collection includes several databases, such as for pictures, websites, etc., I would like to use different values of the same schema to calculate the weights. Yet, rebuilding each time after the change was done to the head file seems not an optimal approach and
2007 May 15
1
Document ID 0 is invalid... but not always...
Note: this is rather long and not very important and I don't want to
prevent the team from releasing version 1.0, so go on reading only if
you have too much free time !!! ;-)
0 is not a valid document ID, never, ever, but I just found a special
case in which xapian will create a record and return 0 for the newly
created record.
In fact, I was "hacking", trying to store metadata
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote:
> On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> > I'm already retrying the ->get_mset operations; but now I'm
> > wondering where I'd hit DatabaseModifiedErrors while inside a
> > Xapian::MSetIterator loop.
> >
> > I assume ->get_document is a place where it gets thrown;
> > but
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2016 Jul 30
2
Letor: returning MSet after re-ranking
>
>
> I'd prefer to avoid adding things to the public API that don't get
> used by end users. However because LTR is outside the Xapian build
> tree, we can't easily give it privileged access to Xapian internals.
>
Sorry for a delayed response.
The way I was thinking of performing reranking with updated weights was to
add a class MSetRanker (basically containing a
2018 Jul 12
1
Error while compacting: Bad position key
Mike Hommey <mh at glandium.org> writes:
> Hi,
>
> When running `notmuch compact` today, it stopped with the following
> output:
>
> Compacting database...
> compacting table postlist
> Reduced by 25% 648656K (2498904K -> 1850248K)
> compacting table docdata
> Reduced by 15% 24K (152K -> 128K)
> compacting table termlist
> Reduced by
2019 Feb 03
0
Amount of writes during index creation
On Thu, Jan 31, 2019 at 08:44:44PM +0100, Jean-Francois Dockes wrote:
> I have run a number of tests, with data mostly from a project gutenberg dvd
> and other books, with relatively modest index sizes, from 1 to 24 GB.
>
> Quite curiously, in this zone, with all Xapian versions I tried, the ratio
> from index size to the amount of writes is roughly proportional to the index
>
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2015 Jul 26
1
Get term from document by position
mple (see attachment).
>
> Attachments get stripped out by the mailing list, so I?ve made a private gist of the two files here: <https://gist.github.com/jaylett/ce8455b37e2b84422346>.
>
> Actually, when I run it I get 0 matches, which would explain why you?re just getting the start of the document. However if I adjust things (match the stemming strategy for TermGenerator to
2006 Aug 06
1
How to use omega to search remote back end?
Folks,
Having trouble getting this to work. OMEGA cgi is not reading my stub file properly because it is trying to read it as a directory instead of a file. Is there an easy fix? Here is a transcript.
Thanks,
OSC
oscar@epsilon:/svr/xapian/beta$ ls -aFl
total 21335200
drwxr-xr-x 2 oscar oscar 4096 Aug 6 10:15 ./
drwxr-xr-x 5 oscar oscar 4096 Aug 6 12:59 ../
lrwxrwxrwx 1 oscar
2010 Jan 09
1
Using HASH() and REALTIME_HASH()
Hi,
I'm playing around with asterisk 1.6.2.0 and the first try was to
replace my now non-functionning
'app-realtime' macro which emulated RealTime with REALTIME_HASH()
There is very few documentation on the subject except for this bug report:
https://issues.asterisk.org/view.php?id=13651#c94998
However when i try this syntax:
2009 Feb 12
1
problem when using xapian's static libs in windows
I have download source ?1.10? from the internet
and build it into lib
Then I create a project as the helpdoc said
I using vc2005(vc8)
The source in my test project is as follow??copy from the helpdoc?
#include <xapian.h>
#include <iostream>
using namespace std;
int main(int argc, char **argv)
{
// Simplest possible options parsing: we just require three or more
2007 Feb 09
1
PHP Binding and dbi2omega questions
Hi All,
I've installed Xapian and the php module. I've set up a script for use with
scriptindex and dbi2omega for getting data from the db into the index
easily, the script file is as follows:
===============================
id : field=id
title : index
title: field=title
description : index
description : truncate=50 field=content
=============================
However, when querying