similar to: Cleaning the index

Displaying 20 results from an estimated 1000 matches similar to: "Cleaning the index"

2013 Aug 27
2
What does collection_freq means?
Hi, all: I am confused with the concept of colletion_freq There's no informations about it on http://xapian.org/docs/glossary.html What does it means? Thanks Regards! -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130827/370cc6a3/attachment.html>
2016 Jan 09
1
synonym expansion for boolean prefixes.
Olly Betts <olly at survex.com> writes: > On Tue, Jan 05, 2016 at 08:43:13AM -0400, David Bremner wrote: >> Olly Betts <olly at survex.com> writes: >> >> > To achieve this with synonyms in a configurable way you'd need to >> > rewrite the synonyms in the database to match the current configuration, >> > so it's not as dynamic as the
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ? i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc. search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp. This method is ok , but
2005 Aug 05
2
uint4 in quartz/flint
We have: ---------------------------------------------------------------------- typedef unsigned long uint4; ---------------------------------------------------------------------- which on an LP64 system is a lie. If uint4 is used on disk (and it's used in the btree implementation) then it would make the databases unportable, which probably isn't a good idea (unless we already have
2010 Mar 22
0
[PATCH] Btrfs: change direct I/O read to not use i_mutex.
This depends on the change to ordered data search. Signed-off-by: jim owens <owens6336@gmail.com> --- fs/btrfs/dio.c | 150 +++++++++++++++++++++++++++++++++++++++----------------- 1 files changed, 104 insertions(+), 46 deletions(-) diff --git a/fs/btrfs/dio.c b/fs/btrfs/dio.c index b6934be..c930ff5 100644 --- a/fs/btrfs/dio.c +++ b/fs/btrfs/dio.c @@ -435,14 +435,81 @@ static void
2004 Aug 23
1
postlist chunking
Postlists are split up into chunks, so that skip_to can avoid reading all the postlist. Currently the chunk threshold is 2048, but this is checked before adding an entry, so the postlist chunk can actually grow a little larger. Something like 2060 at most. Unfortunately this isn't a good threshold with the default blocksize (8192 bytes). Internally the B-tree splits up items with a large
2010 Feb 02
1
Optimal usage of xapian-compact for merging
Greets, I've been wondering, what's the sane/optimal use of xapian-compact when merging many indexes with a view to maximum merging performance? The obvious: - only use -F on the final db. - use -m since I'm merging more than 3 dbs. Best strategy? a) loop: merge batches (of say 50, where the individual db's are small) into a temp index, then merge the (larger) temp into the
2016 Apr 12
2
Xapian 1.3.5 snapshot performance and index size
Olly Betts writes: > On Mon, Apr 11, 2016 at 09:54:36AM +0200, Jean-Francois Dockes wrote: > > The question which remains for me is if I should run xapian-compact > > after an initial indexing operation. I guess that this depends on the > > amount of expected updates and that there is no easy answer ? > > I think it's not obvious whether it's a good plan
2006 Jun 21
0
fcgi_watch (reaper replacement for shared servers)
fcgi_watch is a supplement to (or replacement for) <rails_app>/script/process/reaper. The standard reaper script searches for all processes started with a given pathname using the output from a ps command, and performs an action on them. There are problems with this approach, at least on some shared servers: - The pathname seen from a script is not necessarily the same as the
2013 Jun 19
2
Compact databases and removing stale records at the same time
On Wed, Jun 19, 2013, at 03:49 PM, Olly Betts wrote: > On Wed, Jun 19, 2013 at 01:29:16PM +1000, Bron Gondwana wrote: > > The advantage of compact - it runs approximately 8 times as fast (we > > are CPU limited in each case - writing to tmpfs first, then rsyncing > > to the destination) and it takes approximately 75% of the space of a > > fresh database with maximum
2006 Aug 12
3
Capistrano deployment pathname error
When I try to do a deployment with Capistrano I get the error "No such file to load -- pathname" when Cap tries to execute reaper. Any ideas what''s causing this or how I can troubleshoot it? (Output follows.) Thanks. Evan -----BEGIN OUTPUT----- [evans-powerbook:~/Desktop/asktheguys] evan% clear ; rake remote:deploy (in /Users/evan/Desktop/asktheguys) loading
2007 Mar 29
1
How to restart server programmatically?
I need to be able to restart my server based on a specific database change. I''ve been trying a call to system("script/process/reaper -a graceful -d dispatch.fcgi") but that raises a transaction error which rolls back the save. I wasn''t aware that after_save would rollback. That''s potentially useful but pretty much not right now. Here''s my code: def
2006 Jul 23
1
reaper
Hi there, I have been running the reaper on dreamhost and I''m not sure it is actually restarting the processes will the reaper create new processes? because all the PID''s are the same before and after I run the reaper Restarting [1344] ruby dispatch.fcgi Restarting [10172] ruby dispatch.fcgi Restarting [21592] ruby dispatch.fcgi Restarting [4931] ruby dispatch.fcgi Restarting
2006 May 21
2
Reaper and symbolic links
Hello, I''m trying to restart my application fastcgi processes using reaper. But I get the following error: $sudo -u www ./script/process/reaper Couldn''t find any process matching: /var/www/example/200605200000/public/dispatch.fcgi The problem is that it should not look for 20060520000 but for current which is a symbolic link pointing to 200605200000. I did the
2006 Jun 29
0
Restart/Deploy - Reaper can''t find PID
Hi I''ve been using Capistrano to deploy my application for around 3 months now, and in the last week or so the reaper part of the restart/deploy tasks is failing with the following message: executing task restart * executing "/home/user/myapp/current/script/process/reaper --dispatcher=dispatch.fcgi" servers: ["myapp.com"] [myapp.com] executing command **
2004 May 11
2
"Error reading block xxx: got end of file"
Xapian (0.7.5) is spitting out this error on a regular basis: org.xapian.errors.DatabaseError: Error reading block 136618: got end of=20= file =A0=A0=A0=A0=A0=A0=A0 at=20 org.xapian.XapianJNI.writabledatabase_repalce_document(Native Method) =A0=A0=A0=A0=A0=A0=A0 at=20 org.xapian.WritableDatabase.replaceDocument(WritableDatabase.java:67) I don't have a gdb backtrace, only the Java
2007 Apr 05
1
Re: [Xapian-commits] 8107: trunk/xapian-core/ trunk/xapian-core/backends/
olly wrote: > Log message (7 lines): > backends/database.cc: Database::Internal can't call the > PostingIterator(PostingIterator::Internal*) ctor (at least under > g++ 3.3.5) because it isn't a friend (only class Database is). For the record, Mark just reported this to me under windows so it was a problem there too, but it does work under GCC 4.1. No idea which compiler is
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
Xapians, I am maintaining about two indexes for my search engines which approximately is each the same size. I would like to share this knowledge with you, since many of you have never seen Xapian index of this size. And of course you can search the index by yourself at - http://myhealthcare.com/ - http://find1friend.com/ I need 2 x 100 million more documents into each index, and I hope it will
2007 Mar 30
1
Re: FLAC: same features as WavPack
To clarify, I am one of the REAPER developers. To say that FLAC itself is not broken is sort of meaningless--FLAC itself is just a specification. So yes, it is just libFLAC that is broken. I have spent QUITE a bit of time trying to sort out why it fails on 24 bit mode.. If there are some specific alignment requirements of samples, please point me to the documentation of them, but we have
2020 Apr 07
2
crash after running notmuch new
Matt <mattator at gmail.com> writes: > thanks didn't know about xapian-check ! > the output > === > docdata: > blocksize=8K items=70 firstunused=3 revision=421 levels=0 root=2 > B-tree checked okay > docdata table structure checked OK > > termlist: > blocksize=8K items=186136 firstunused=62058 revision=421 levels=2 root=12260 > B-tree checked okay >