search for: recoll

Displaying 20 results from an estimated 72 matches for "recoll".

Did you mean: recall
2012 Dec 23
1
Fwd: Re: Another ue for Recoll/Xapian? - AI/Eliza
People, I sent this note to JF at Recoll and he suggested asking here (his response below) - any suggestions? Thanks, Phil. -------- Original Message -------- Subject: Re: Another ue for Recoll? - AI/Eliza Date: 2012-12-23 19:22 From: jf at dockes.org To: <phil at pricom.com.au> Philip Rhoades writes: > Jean, > &g...
2016 Jan 14
3
Strange index consistency issue
Olly Betts writes: > On Sun, Jan 10, 2016 at 02:53:14AM +0000, Bob Cargill wrote: > > I am the recoll user mentioned in the first post above. I still have a copy > > of the (potentially) corrupted index and I did the requested testing. > > > > I ran delve -t '' ./xapiandb on the index and it returned a very long list > > of document IDs, separated by spaces. I...
2016 Jan 08
2
Strange index consistency issue
Hi, A Recoll user is reporting an index corruption problem. In general, index corruption happens from time to time with Recoll, because of crashes, reboots, misc Recoll bugs, etc. The strange thing here is that xapian-check does not seem to detect anything. In a nutshell, some document numbers seem to point t...
2017 May 17
2
Xapian 1.4.3 "Db block overwritten - are there multiple writers?"
Hi, I have a user reporting the following error during recoll indexing: flush() failed: Db block overwritten - are there multiple writers? "flush() failed" is from recoll, the rest is, I think the text of the Xapian exception. This is with Xapian 1.4.3 on Linux (I asked for more details, should be coming). I don't think that I've eve...
2011 May 26
0
Desktopsearch "Recoll" for CentOS 5.5 64bit
Hi Folks, is there a rpm-package for desktopsearch "recoll" for CentOS 5.5 64bit If yes - where is it? I've tried fedora-packages from http://www.lesbonscomptes.com/recoll/download.html#rpms but got much dependencies-errors Thx Timothy
2024 Apr 22
2
How to use Xapian Omega directly (i.e., without using `recoll` and `xapiandb`) ... Full Set Of Questions Below:
Dear senior ML members and developers of Xapian Omega, Mr. Olly has helped me cross the bump of the initial learning curve. (ref: https://lists.xapian.org/pipermail/xapian-discuss/2024-April/010034.html) How can I use Xapian Omega directly (i.e., without using `recoll` and `xapiandb`) to index a directory of text files with all strings greater than 3 characters, to create an index text file typically occurs in the End of a Book, with location in specific files, without using Recoll database? I want to create an extensive list first with xapian omega, then have t...
2016 Apr 12
2
Xapian 1.3.5 snapshot performance and index size
...not much luck. I used a script to edit the SEQ_START_POINT value, then rebuild and install Xapian, then run the indexing. Sizes don't change much... Maybe I did something wrong, https://gist.github.com/medoc92/1ad2a232e4b36e2993ce9adc5789a60a The output follows (I edited out the unchanging recoll config dumps). Jf *******LIB***************** Tue Apr 12 10:43:14 CEST 2016 #define SEQ_START_POINT (-10) -rwxr-xr-x 1 root root 30728315 Apr 12 10:43 /usr/lib/libxapian-1.3.so.6 ************************* 452.68user 124.94system 4:42.27elapsed 204%CPU (0avgtext+0avgdata 1055204maxresident)k 0inp...
2017 Dec 07
2
xapian 1.4 performance issue
Hi, I have had reports that Recoll has become unbearingly slow in some instances. After inquiry, this happens with Xapian 1.4 only, and the part which does not work any more is the snippets extraction. Recoll builds snippets by partially reconstructing documents out of index contents. For this, after determining a set of document...
2018 Sep 14
3
How to make database build threaded?
On 14/09/2018 at 09:30, Jean-Francois Dockes wrote: > Hi, > > You may be interested by how Recoll does it: > > https://www.lesbonscomptes.com/recoll/idxthreads/threadingRecoll.html > > A few things in the document are slightly obsolete (esp. the last > paragraph: recollindex now does use vfork()), but it's overall quite close > to how the current indexer works. > &gt...
2008 Dec 21
0
Profiling a recoll stress-test
Hi all, I don''t know if you''re familiar with recoll, it''s a very handy xapian based desktop search engine system. I''m trying to index a really big folder containing lots of files (18M), the disk size is ~220Gig. The files are quite small text files (mean size ~ 1K). The OS is latest leopard. I think the process is io-bound. 63061...
2019 Aug 26
2
Commit error with Xapian 1.4.11
A Recoll user gets the following message while indexing: "Attempted to delete or modify an entry in a non-existent posting list for #bannerholder" The exception happens during a commit call. Xapian version 1.4.11, Debian Buster A little more detail here: https://opensourceprojects.eu/p/recoll1/...
2017 Dec 08
2
xapian 1.4 performance issue
Olly Betts writes: > On Thu, Dec 07, 2017 at 10:29:09AM +0100, Jean-Francois Dockes wrote: > > Recoll builds snippets by partially reconstructing documents out of index > > contents. > > > [...] > > > > The specific operation which has become slow is opening many term position > > lists, each quite short. > > The difference will actually be chert vs...
2016 Jan 10
2
Strange index consistency issue
Olly Betts <olly <at> survex.com> writes: > > You could try: > > delve -t '' ./xapiandb > > That will list the document lengths, so you can see if document 6 is in > that list or not. I am the recoll user mentioned in the first post above. I still have a copy of the (potentially) corrupted index and I did the requested testing. I ran delve -t '' ./xapiandb on the index and it returned a very long list of document IDs, separated by spaces. I than ran delve -t '' ./xapiandb | gr...
2016 Jan 14
2
Strange index consistency issue
...If that bug is not the cause, it would be good to get to the bottom of this - > > > the database shouldn't become corrupt like this. > > > > I remembered something: I could only reproduce issue #645 with separate > > read/write database objects, but this one is with recoll 1.21, which uses a > > single object, so maybe a different problem. > > The underlying bug for #645 was that cursors weren't getting rebuilt in > some situations where they needed to be, and could end up with bad data > in, and that bad data could be stale data. So it's...
2016 Dec 10
6
Plain requirement: desktop search
Just wondering, what exactly is supported/suggested: I need a comprehensive desktop search functionality. Not only searching for file names but also for content and meta data. The environment is EL6.8 / Gnome2. I have noticed that "beagle" is not part of the distro anymore. Any suggestions for such requirement? Thanks! LF
2024 Mar 15
1
Using multiple temporary indexes during updates
On Fri, Mar 15, 2024 at 08:15:55PM +0100, Jean-Francois Dockes wrote: > I have been playing at converting the index update stage of the Recoll indexer to use > multiple temporary indexes and a final merge. > > This yields an improvement factor of almost 3 (on my quad-core CPU), for the total > indexing time for "easy" files like HTML pages. This is nice (!) and I wanted to share my > admiration for the "comp...
2018 Sep 13
2
How to make database build threaded?
Hi everybody, I'm the author of a small C++11 program called XDGSearch. The source code is hosted on Github, for a quick overview you can visit this link https://github.com/frank67/XDGSearch/blob/master/README.md I'm writing to the mailing list because I'd like to make the database build process splitted in more thread. Is it possible? If you are a C++ programmer you can take a look at
2019 Jan 21
2
Amount of writes during index creation
Hi, I have had a problem report from a Recoll user about the amount of writes during index creation. https://opensourceprojects.eu/p/recoll1/tickets/67/ The issue is that the index is on SSD and that the amount of writes is significant compared to the SSD life expectancy (index size > 250 GB). >From the numbers he supplied, it seems t...
2017 May 22
2
Xapian 1.4.3 "Db block overwritten - are there multiple writers?"
Olly Betts writes: > On Wed, May 17, 2017 at 09:08:32PM +0200, Jean-Francois Dockes wrote: > > I have a user reporting the following error during recoll indexing: > > > > flush() failed: Db block overwritten - are there multiple writers? > > > > "flush() failed" is from recoll, the rest is, I think the text of the Xapian > > exception. > > > > This is with Xapian 1.4.3 on Linux (I as...
2024 Mar 15
1
Using multiple temporary indexes during updates
Hi, I have been playing at converting the index update stage of the Recoll indexer to use multiple temporary indexes and a final merge. This yields an improvement factor of almost 3 (on my quad-core CPU), for the total indexing time for "easy" files like HTML pages. This is nice (!) and I wanted to share my admiration for the "compact()" method. If s...