Displaying 20 results from an estimated 10000 matches similar to: "How to rebuild database from scratch without preventing searches"
2007 Dec 13
2
backing up a database by locking while backup program runs
Hi.
I want to back up a Xapian database. If I copy the database directory
and its contents using (for instance) rsync, and I know that throughout
the duration of the rsync process another process has had the database
open for writing, but has not actually made any modifications, can I
be sure that, on completion of the rsync process, I have a copy of the
database which is consistent and fully
2010 Oct 24
1
Cannot index with dynamic spelling data (Perl/Search::Xapian)
This is my test case, what am I doing wrong? It seems that the API is used
incorrectly, but I cannot find the problem...
--- 8< ---
#!/usr/bin/perl
use Search::Xapian qw(:all);
use strict;
my $xa = new Search::Xapian::WritableDatabase ("/tmp/xapian",
DB_CREATE_OR_OVERWRITE);
my $indexer = Search::Xapian::TermGenerator->new();
2010 Oct 21
2
In-memory databases vs PHP Bindings
I can't quite connect the dots on this, perhaps someone can help. I'm
simply trying to create an in-memory database comprising a single document,
so that I can run a load of queries against it and see if any of them match
the new document (this is to enable users to have 'subscriptions' to saved
searches and be alerted every time a new item is published that matches
their
2008 Nov 21
1
Multiple databases vs Single large database
Hi
I've decided to use xapian because my files table in my mysql database is going
to grow very large, and it seems mysql isn't good at full text searching. I'm
doing this with the php wrapper by the way.
The way my system is set out, each user has their own set of files, and when
doing a search it is going to be for a specific user's file (based on file
name, title,
2015 Jan 26
2
delve not found
Hi all,
Today I just cloned Xapian and Xapian-Docsprint from your GitHub repos and was working through the practical example in Python in your Getting Started with Xapian 1.2 guide. However after the index was generated, I wasn't able to inspect it with delve because the command was not found.
My system set-up is a fresh install of Debian 7.8.0 as a virtual machine under Virtual Box, and I
2008 Jul 22
1
Xapian across multiple servers
Hi,
We're looking to deploy Xapian as a search solution for a website
that's run across a number of load-balanced web servers. However,
I've not seen any clear indication in the documentation as to how it's
recommended to operate Xapian in a load balanced / high availability
configuration, or how to distribute the content-changes into the
indexer. Has anyone got any particular
2019 Jul 09
2
Transitioning notmuch/Xapian from 32-bit to 64-bit system
Hi!
Suppose you have a huge notmuch/Xapian database, built on a 32-bit system
(well, actually on x86_64-pc-linux-gnu, but using a years old 32-bit
notmuch binary; notmuch 0.9, Xapian 1.2.21 -- don't laugh), and suppose
you're finally going to update that years old notmuch installation
(release by release, forward-porting a bunch of patches). Naturally, I'd
now do a native 64-bit
2010 Jan 14
1
Latest revision and backwards compatibility
Greetings,
I've been wondering about the index format and backwards compatibility.
We're using the dev version (for chert) and each svn up means that any
indexes created prior to this revision cannot be read. Is this purely a
cautious move to prevent errors, and, barring any obvious index format
changes, can I safely force the current revision to read existing indexes?
eg, by
2007 May 15
1
Document ID 0 is invalid... but not always...
Note: this is rather long and not very important and I don't want to
prevent the team from releasing version 1.0, so go on reading only if
you have too much free time !!! ;-)
0 is not a valid document ID, never, ever, but I just found a special
case in which xapian will create a record and return 0 for the newly
created record.
In fact, I was "hacking", trying to store metadata
2015 Feb 23
2
Perl bindings and MatchAll / MatchNone
I've just had a quick look at the XS, but I can't see quite how to make
these (or just MatchAll, really) available through the Perl binding ...
where should I start please?
2018 Nov 30
1
Xapian Benchmark results
Hi,
I am currently trying to benchmark a multithreaded xapian implementation on
a chameleon baremetal instance written in C++. My workload is a 3 Gig
wikipedia xml dump consisting of ~286 file of different sizes. My results
are showing me that indexing on xapian is an order of magnitude faster than
my lucene and lucene plusplus implementations. This is a result that I did
not expect. Just want to
2010 Oct 14
1
Robust "online" xapian backup?
Hi.
I'm struggling a bit with getting the Xapian indexes safely stored on tape.
LVM snapshots is not really an option given the performance penalties
of that technology. And the index often encounter writes in the
time it takes for the backup to copy off the files. Thus the
index is potentially unsafe on tape.
I dont know much about Xapian's internal versioning of data, but
I suspect it
2018 Apr 29
1
Database corruption after clean rebuild
Hi notmuch developers,
I also had this database corruption, I waited for the fix to land
in notmuch 0.26.2, build it, moved the xapian directory away, did
a notmuch new and restored the tags from a dump. But the problem
remains:
~$ xapian-check ~/Mail/.notmuch/xapian
docdata:
blocksize=8K items=10841 firstunused=75 revision=82 levels=1 root=2
B-tree checked okay
docdata table structure checked
2018 Apr 07
3
Database corruption after clean rebuild
Javier Garcia <javiertury at gmail.com> writes:
> I've applied the path to notmuch 0.26.1 without success.
>
> $ rm -rf ~/.mail/.notmuch
> $ LD_LIBRARY_PATH=/hidden-path/notmuch-0.26.1/lib/:$LD_LIBRARY_PATH
> ./notmuch new
> Found 20065 total files (that's not much mail).
> Processed 20065 total files in 58s (341 files/sec.).
> Added 19605 new
2018 Apr 07
1
Database corruption after clean rebuild
Unfortunately I can't share my emails without the approval of other
parties. The minimum subsets that trigger the error are in the range of
1000-5000 mails, so asking each and everyone of them is out of my reach.
I tried to replicate the problem using just spam folders without success.
The following is a solid workaround I've stumbled upon. Afew no longer
complains and database corruption
2015 Jul 26
1
Get term from document by position
mple (see attachment).
>
> Attachments get stripped out by the mailing list, so I?ve made a private gist of the two files here: <https://gist.github.com/jaylett/ce8455b37e2b84422346>.
>
> Actually, when I run it I get 0 matches, which would explain why you?re just getting the start of the document. However if I adjust things (match the stemming strategy for TermGenerator to
2010 Jun 24
1
Quickest way to retrieve data for a large match set?
We're using the Perl binding to access Xapian in a simple search of image
metadata (title and keywords). Due to the specification for the search engine,
by default we have to sort the results using a function of the search rank,
age (well, newness) and popularity (rated by sales of the image). As a result,
we have to fetch the complete result set and then calculate a new ranking
based on
2010 Jun 21
1
How to search in many database?
Hi, I'm newbie in xapian. I just use xapian for a few week ago and I would like to know:
How I can search in many database at once time?
Please send some answer to me.
p.s. Sorry about my english.
Regrad
Mr.T
_________________________________________________________________
Hotmail: ??????????????????????????????????????????????????????????????????????
2007 Jan 27
4
Xapian vs Lucene
Hello,
It's probably quite troll-risky to put a title like this, but did anyone
take the trouble to compare Lucene to Xapian and make a list of
differences?
As I told the list at the end of last year, I'm going to have to
integrate an indexing/search engine in the coming weeks or months. It
will be integrated to Dokeos, an open-source e-learning application in
PHP, and at the moment we
2007 Oct 27
2
Remote database search issues
Hi all.
First, a note about remote database connection over Perl. We actually
found an easy way to work around the unwrapped Renote::open issue.. We
use a stub file.
You might say that open_stub in also not wrapped.. which is true...
HOWEVER... looking at the code, we realized that Database::open() opts
to using stub_open if the argument is a string pointing to a stub file
rather than a