Displaying 20 results from an estimated 50000 matches similar to: "omindex alternatives?"
2011 Oct 18
2
patch proposal: omindex library or daemon
Olly (looking at commit logs, I think this is your dept :-)
For apps which re/index files frequently and need format conversion, I'd
like to propose a patch for one of...
Omindex library (thread safe):
Omindex::init(options) // struct Omindex::options { ... }
initialize mime_map, store default options
session = new Omindex::Session(db_pathname)
user threads use different sessions
2013 May 15
1
How to omindex some sub-directories?
Given a directory tree like ...
/foo
|
+-- A
|
+-- B
|
+-- C
... what is the best way to index A and C into a single Xapian database?
AFAIK the alternatives are:
omindex --db /my_db --no-delete /foo /foo/A
omindex --db /my_db --no-delete /foo /foo/B
or
omindex --db /my_A_db /foo /foo/A
omindex --db /my_B_db /foo /foo/B
xapian-compact /my_A_db /my_B_db /my_db
The first alternative does not
2012 Dec 13
1
omindex one file at a time?
Hi, all -- I want to do Plain Old Omindex'ing *but* the mapping
between my documents' filenames and the URLs where I hope search
users to find them is, uh..., strange. The simplest thing (to
me) would be to run omindex for each document, e.g.
omindex --no-delete -U /cool-url-1 /funky/doc/file-blah.pdf
omindex --no-delete -U /cool-url-7 /doc/funky/ohmy/blah-file.txt
... and so on...
2009 Feb 02
2
Ticket #282: omindex-assorted-enhancements.patch woes
I would really like to try out the features in the patch above. But I
can't ever seem to get the resulting omindex.cc to "make".
I tried updating to rev 10801 from the SVN then run /bootstrap but then
I seem to get errors compiling everything when I try and do "make" (I'm
using ubuntu 8.10).
So I thought I'd try an apply the patch to the latest stable version
2010 Dec 15
2
excluding child folders in omindex search
hi there,
is there an option to exclude child folders when running omindex?
For example:
omindex -p --db /var/blah/default --url /something /var/www --exclude
/var/www/ignore
Thanks,
Jeff
2009 Apr 06
2
omindex => Unknown extension
Hi all,
I'm having a recurrent problem with Omega's indexing.
When I run omindex, it sometimes misses to recognize the extension of
some files (.doc, .pdf) and skips them. In the same run, omindex is
otherwise perfectly able to index other files with same extensions. The
reason is not clear but it should occur before it selects a content
converter since for example, if I manually run
2009 May 19
4
omindex options
Hi.
I am writing a python equivalent of omindex (we are using scriptindex
currently - but I wanted to use omindex instead, and extend it to work with
our internal file format.. BUT did not want to compile code if possible...
so anyway).
I have tried to keep the code as close to possible to the omindex native
code, but am facing a bit of confusion: what exactly is the reason for
omindex to take
2009 Jun 20
3
omindex hangs while scanning
Hello,
I was looking for a search engine for a small internal documentation
site and found xapian and
omega. Downloaded and compiled it using msys and ming on a german
windows xp system. Finally
installed apache on the same box.
Following the omega example I copied the book to .../apache/htdocs and
startet the omindex
which hang up on the first document found. Even on very short doc with
2006 Oct 02
1
Omindex.cc BSD bug
Hi guys:
I was trying to index a large set of PDF documents using omindex
and the system started to run out of forks (sh: fork temporarily
unavailable) making the system unusable and probably skipping documents.
I'm using MAC Osx Server 10.4.3 (Darwin/BSD) and GCC 4.0.
The problem: On function stdout_to_string a popen is called, but is not
closed properly (according the popen
2017 Apr 23
2
Question about the ticket #743 omindex: delay libmagic checks
>
> I'd suggest to start with you just look at moving the libmagic check after
> the filesize checks, so you don't need to get into whether libmagic or
> the database check is cheaper on average.
hi, Olly, I have moved the libmagic check after the filesize check directly,
https://github.com/caiyulun/xapian/commit/3a97d9ee5397fa900a473aa9b3d8eeb720177a4e
can you provide
2005 Mar 31
1
omindex and scriptindex question
Hi,
I was researching indexing of text in omindex and scriptindex.
While indexing text with omindex.cc possition of terms is saved with gap.
This is not happening with scriptindex.cc
While this is happening ?
Another question is why in omindex.cc the term possition starts with 0 while
in scriptindex it starts from 1 ?
Code snippet from omindex.cc
// Add postings for terms to the document
2006 Aug 20
1
omindex patch
Attached is my rather largish omindex.cc patch with ChangeLog.
It needs autoreconf to update configure and the Makefiles.
Note that unrar is not patent infected, only rar, the compressor.
I've put some AC_PATH_PROG checks into configure for all helpers.
The patch is not yet complete.
2006-08-18 15:13:32 Reini Urban <reinhard.urban at avl.com>
omega-0.9.6b:
* omindex.cc: last_mod as
2011 Apr 27
2
Omindex: what are the default numbered indexes?
> -----Original Message-----
> Date: Tue, 26 Apr 2011 13:35:20 +0100
> From: James Aylett <james-xapian at tartarus.org>
> Subject: Re: [Xapian-discuss] Omindex: what are the default numbered
> indexes?
> To: <xapian at catcons.co.uk> <xapian at catcons.co.uk>
> Cc: 'Xapian Discussion' <xapian-discuss at lists.xapian.org>
> Message-ID:
2007 Jul 12
1
omega: omindex behaviour with duplicate files
Hi all
I need a little clarification with regard to Omega's behaviour with
'duplicate' files when running 'omindex'.
How is a duplicate recognised? Is it simply by file path? How is an
unmodified file detected, if at all?
I would like to set up subversion post-commit hook to update my index.
If possible I would like to just update the index with the newly
commited files.
2017 Apr 20
2
Question about the ticket #743 omindex: delay libmagic checks
Hi,
I'm working on the ticket #743 omindex: delay libmagic checks
<https://trac.xapian.org/ticket/743>. As the ticket's
Description mention, the call to libmagic is expensive than call the stat,
so we can check the size by call the stat to get size before call
libmagic to get a mime type.
But how about the timestamps check? since timestamps check need to iterate
the DB to check if
2012 Dec 29
3
omindex killed
I'm finding that omindex is consistently ending prematurely when
indexing certain files. The last output looks like this:
[Entering directory /compounds/Acetic_acid]
Indexing "/MATLAB/compounds/Acetic_acid/AACID_50T.TXT" as text/plain ...
added.
Indexing "/MATLAB/compounds/Acetic_acid/AACID_50T.pdf" as
application/pdf ... "pdftotext -enc UTF-8
2004 Dec 10
0
Omindex and symlinks
I've just been playing with using omindex to build an index of the
documentation in /usr/share/doc on my Debian workstation. It's all
working pretty well - indexing took only a few minutes, reindexing
appears to work acceptably, and the search results are okay. The main
problem I'm seeing is a lot of duplicate results due to directory
symlinks. Omindex's current behaviour is to
2019 Jun 14
2
Text-Extraction Libraries for Omindex
This is a list with some libraries that I have been looking at.
The idea is to discuss the advantages and disadvantages of adding some of
these libraries to Xapian.
If anyone knows another library that could be add to the list it would be
great!
Libfreexl:
* For Excel (.xls)
* Last release: 2018-02
* Info: gaia-gis.it/fossil/freexl/index
* License: MPL tri-license
2024 Apr 22
1
Fwd: Is there a front-end for using xapian-omega rather than the terminal? Could a Xapian database be accessed from web-browsers?
On Thu, Apr 18, 2024 at 03:46:53PM +0530, Susmita/Rajib wrote:
> On Thu, 18 Apr 2024 at 06:24, Olly Betts <olly at survex.com> wrote:
> > You may need to configure apache to tell it to use this for a URL on
> > your site.
>
> How Mr. Betts? Suppose I have a ready open document text file, a
> LibreOffice text file. say BettsBio.odt, for which I want to create an
>
2012 Apr 27
4
GSoC xapian node binding
Posting recent offline discussion...
On Fri, Apr 27, 2012 at 10:55 AM, Marius Tibeica <mtibeica at gmail.com> wrote:
> Hi Liam,
>
> I've added the Enquire class and designed a query spec structured as a JS
> object. Hope you like it :)
> I'll probably be off a few days (there is a national holiday Tuesday which
> means i have a long weekend :D) but maybe I'll