Displaying 20 results from an estimated 81 matches for "termlists".
Did you mean:
termlist
2018 Jul 12
1
Error while compacting: Bad position key
Mike Hommey <mh at glandium.org> writes:
> Hi,
>
> When running `notmuch compact` today, it stopped with the following
> output:
>
> Compacting database...
> compacting table postlist
> Reduced by 25% 648656K (2498904K -> 1850248K)
> compacting table docdata
> Reduced by 15% 24K (152K -> 128K)
> compacting table termlist
> Reduced by
2020 Apr 07
2
crash after running notmuch new
Matt <mattator at gmail.com> writes:
> thanks didn't know about xapian-check !
> the output
> ===
> docdata:
> blocksize=8K items=70 firstunused=3 revision=421 levels=0 root=2
> B-tree checked okay
> docdata table structure checked OK
>
> termlist:
> blocksize=8K items=186136 firstunused=62058 revision=421 levels=2 root=12260
> B-tree checked okay
>
2016 Apr 12
2
Xapian 1.3.5 snapshot performance and index size
Olly Betts writes:
> On Mon, Apr 11, 2016 at 09:54:36AM +0200, Jean-Francois Dockes wrote:
> > The question which remains for me is if I should run xapian-compact
> > after an initial indexing operation. I guess that this depends on the
> > amount of expected updates and that there is no easy answer ?
>
> I think it's not obvious whether it's a good plan
2010 Jan 30
2
Failure trying to update document.
Hi list.
I have a specific document that does not handle updates sitting in the
index. What can I do about that?
2010-01-30T13:58:07 Eval failure: Exception: No termlist for
document 287376 at /usr/lib/perl5/Search/Xapian/Enquire.pm line 56.
2010-01-30T13:58:07 job failed. considering retry. is max_retries
of 1000 >= failures of 1?
2010-01-30T13:58:07 job failed: Exception: No
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
Xapians,
I am maintaining about two indexes for my search engines which
approximately is each the same size. I would like to share this
knowledge with you, since many of you have never seen Xapian index of
this size. And of course you can search the index by yourself at
- http://myhealthcare.com/
- http://find1friend.com/
I need 2 x 100 million more documents into each index, and I hope it
will
2009 Apr 12
2
Indexing speed benchmark - Xapian, Solr
I came across this benchmark between Xapian & Solr:
http://www.anur.ag/blog/2009/03/xapian-and-solr/
According to the benchmark, a doc set that took Solr 34 min to index took Xapian 7 hours. Solr's index is also much smaller - 2.5GB to Xapian's 8.9GB.
I'm new to Xapian. Just wondering if results like these are typical? Is indexing speed & size a known issue in Xapian? Or is
2019 Feb 03
0
Amount of writes during index creation
On Thu, Jan 31, 2019 at 08:44:44PM +0100, Jean-Francois Dockes wrote:
> I have run a number of tests, with data mostly from a project gutenberg dvd
> and other books, with relatively modest index sizes, from 1 to 24 GB.
>
> Quite curiously, in this zone, with all Xapian versions I tried, the ratio
> from index size to the amount of writes is roughly proportional to the index
>
2018 Mar 29
2
bug: "no top level messages" crash on Zen email loops
On 2018-03-29 04:17:21, Olly Betts wrote:
> On Mon, Mar 19, 2018 at 05:03:21PM -0300, David Bremner wrote:
>> I can confirm this reproduces both the xapian-check and the notmuch-show
>> error. Olly agrees that whatever notmuch is doing wrong, it shouldn't
>> lead to a corrupted database
>
> There was a Xapian bug here, which I fixed on master last week and will
>
2016 May 09
1
Given a document, how do you get its ID? (perl bindings)
I am writing an indexer that will crawl our web site. Following the
recommendation here:
https://trac.xapian.org/wiki/FAQ/UniqueIds
I'm using the URL as the unique ID for each document. I see how to get a
document from the xapian database if I know its URL, but what I need is
also to be able to find out the URL from the document. Does this mean I
need to store the URL in a value in
2018 Mar 19
2
bug: "no top level messages" crash on Zen email loops
Antoine Beaupré <anarcat at orangeseeds.org> writes:
> On 2018-03-19 13:36:49, David Bremner wrote:
>>
>> I can't duplicate that part.
>
> That's very strange. I can reproduce this on my workstation here, but
> taking the tarball I sent in the original message, I can't reproduce
> anymore. So something changed! I suspect it's the
2018 Apr 29
1
Database corruption after clean rebuild
Hi notmuch developers,
I also had this database corruption, I waited for the fix to land
in notmuch 0.26.2, build it, moved the xapian directory away, did
a notmuch new and restored the tags from a dump. But the problem
remains:
~$ xapian-check ~/Mail/.notmuch/xapian
docdata:
blocksize=8K items=10841 firstunused=75 revision=82 levels=1 root=2
B-tree checked okay
docdata table structure checked
2011 Mar 31
0
Xapian Index: 607GB = 219 million of unique documents
It took approximately five days, having single process using one core
CPU and 6GB of memory to build this giant 607GB single Xapian index,
containing 219 million of unique documents (web sites). So far I did
not found any other implementation that would enable me to build such
a single index containing over 200 million documents, while testing
Lucene, Solr, MySQL, Hadoop and Oracle. Probably
2011 May 13
0
Xapian Index 253 million documents = 704G
Xapian Index 253 million documents = 704G
I just build my largest single Xapian index with 253 million unique
documents on single server using single hard disk, less that 8G RAM
and single processor 2.0 GHz. I do not see any search performance
decreases in searching my indexes between 100 million and 250 million,
which indicates a good scalability of Xapian and it looks like, I can
push it easily
2020 Apr 07
0
crash after running notmuch new
On Tue, Apr 07, 2020 at 05:21:47PM -0300, David Bremner wrote:
> Matt <mattator at gmail.com> writes:
[...]
> > termlist:
> > blocksize=8K items=186136 firstunused=62058 revision=421 levels=2 root=12260
> > B-tree checked okay
> > termlist table structure checked OK
> >
> > postlist:
> > blocksize=8K items=2598971 firstunused=61412 revision=421
2011 Jun 10
2
Just starting to experiment with php
I took one of the examples and tried to run against my database
ls -l /data1/mail/db/cur.1
total 1129624
-rw-r--r-- 1 jwl jwl 0 2011-06-09 02:27 flintlock
-rw-r--r-- 1 jwl jwl 28 2011-06-09 02:27 iamchert
-rwxrwxrwx 1 jwl jwl 7258 2011-06-09 02:27 position.baseA
-rwxrwxrwx 1 jwl jwl 7046 2011-06-09 02:27 position.baseB
-rwxrwxrwx 1 jwl jwl 474226688 2011-06-09 02:28
2018 Apr 07
3
Database corruption after clean rebuild
Javier Garcia <javiertury at gmail.com> writes:
> I've applied the path to notmuch 0.26.1 without success.
>
> $ rm -rf ~/.mail/.notmuch
> $ LD_LIBRARY_PATH=/hidden-path/notmuch-0.26.1/lib/:$LD_LIBRARY_PATH
> ./notmuch new
> Found 20065 total files (that's not much mail).
> Processed 20065 total files in 58s (341 files/sec.).
> Added 19605 new
2012 Nov 21
1
about index speed of xapian
hi,
i use xapian to index a txt file, it's size is 268M. i take each line as a document, and each line has two field like 13445511 | 111115151. the recored size is 10000000. the XAPIAN_FLUSH_THRESHOLD set 1000000. it takes 1026544ms to index the file, it is more slower than lucene. The lucene speed is about 40000 records per second.
code:
try
{
Xapian::WritableDatabase
2013 Jun 19
2
Compact databases and removing stale records at the same time
On Wed, Jun 19, 2013, at 03:49 PM, Olly Betts wrote:
> On Wed, Jun 19, 2013 at 01:29:16PM +1000, Bron Gondwana wrote:
> > The advantage of compact - it runs approximately 8 times as fast (we
> > are CPU limited in each case - writing to tmpfs first, then rsyncing
> > to the destination) and it takes approximately 75% of the space of a
> > fresh database with maximum
2011 Jul 19
1
xapian-compact ok, xapian-check failure
Greets,
I've encountered the following while performing test merges (and writing code
to handle errors, etc so things can be automated) and wondering about the best
way to proceed:
xapian-compact -b64k -m src1 src2.... tmp_dst -- works as expected, exit code 0.
xapian-check tmp_dst -- produces the following error for the postlist:
postlist:
baseB blocksize=64K items=28175410
2011 Apr 01
0
Xapian-discuss Digest, Vol 83, Issue 1
I think this is a shining example of how well Xapian works with large
document collections. I was just discussing this with my colleagues here
and one of the issues that came up is that we'd love Xapian to become
really lot more popular but have found that the documentation's a bit
difficult to get into, as is the API.
So I was wondering: do you have any thoughts on improving this and