Displaying 20 results from an estimated 1000 matches similar to: "confusion about term prefixes"
2012 Dec 29
3
omindex killed
I'm finding that omindex is consistently ending prematurely when
indexing certain files. The last output looks like this:
[Entering directory /compounds/Acetic_acid]
Indexing "/MATLAB/compounds/Acetic_acid/AACID_50T.TXT" as text/plain ...
added.
Indexing "/MATLAB/compounds/Acetic_acid/AACID_50T.pdf" as
application/pdf ... "pdftotext -enc UTF-8
2012 Dec 30
1
combining databases for omega
From the documentation I've read, omega can read from multiple
databases, but I'm not sure how to go about this.
I have three databases created using omindex, currently located at
/var/lib/xapian-omega/data/share, /var/lib/xapian-omega/data/users, and
/var/lib/xapiax-omega/data/management
The quickstart guide says that in omega.conf database_dir should point
to the directory
2012 Dec 20
1
omega with samba share
Hello,
I have installed and configured omega to index samba shares on a debian server. I would like to know if it's possible to change the HREF links on the search results page to begin "file://" instead of "http://"
I had a look at the templates files and there was no obvious solution that I could see.
Thanks.
--
Chris Purves
Visit my blog: http://chris.northfolk.ca
2013 Apr 05
1
problems with indexing xlsx files
Hello,
I have a number of Excel .xlsx files that aren't indexed properly. To illustrate, I have a file called "this is a test.xlsx". It consists of four cells:
| this |
| is |
| a |
| test |
It gets indexed but I am unable to search for it.
I was able to determine the index number and use delve to see the term list:
#delve users -r 16496
Term List for record #16496:
2010 Feb 26
2
[LLVMdev] RegisterScavenging on targets without subregisters
No, I wasn't having a management lobotomy moment. If the target's registers
have no subregisters, SubUsed is false and the assert gets tripped.
Ok, back to the original question: What was the original intent in this code
(lines 186-193 in lib/CodeGen/RegisterScavenging.cpp)?
-scooter
On Thu, Feb 25, 2010 at 7:00 PM, Scott Michel <scooter.phd at gmail.com> wrote:
> Ugh.
2005 Jan 14
0
Term prefixes (was: Xapian Feedback)
I wrote:
> I think it's a bug. Or at least QueryParser uses a rather delicate rule
> for when to add a ":" between the prefix and the term, which scriptindex
> doesn't implement. The rule is undocumented (except in the code) so
> it's arguable who is correct.
I've been looking at this some more.
We need some way to distinguish the term prefix from the term
2011 Oct 18
2
patch proposal: omindex library or daemon
Olly (looking at commit logs, I think this is your dept :-)
For apps which re/index files frequently and need format conversion, I'd
like to propose a patch for one of...
Omindex library (thread safe):
Omindex::init(options) // struct Omindex::options { ... }
initialize mime_map, store default options
session = new Omindex::Session(db_pathname)
user threads use different sessions
2012 Dec 13
1
omindex one file at a time?
Hi, all -- I want to do Plain Old Omindex'ing *but* the mapping
between my documents' filenames and the URLs where I hope search
users to find them is, uh..., strange. The simplest thing (to
me) would be to run omindex for each document, e.g.
omindex --no-delete -U /cool-url-1 /funky/doc/file-blah.pdf
omindex --no-delete -U /cool-url-7 /doc/funky/ohmy/blah-file.txt
... and so on...
2014 Mar 11
2
[GSOC 2014] Indexing INEX dataset
On Tue, Mar 11, 2014 at 12:02:15PM +0100, Parth Gupta wrote:
> During the indexing with omindex, only you need to make sure is indexing
> with prefix 'S' for title as explained here in Letor documentation:
> xapian-letor/docs/letor.rst
>
> Previously when I edited omindex.cc it was modified as can be seen
>
2016 Sep 27
1
omega issues/notes
All,
I've run into a couple of things using omega/omindex under cygwin. I don't
think I'd attribute them to xapian, omega or omindex, but wanted to get
them out to the list so that if anyone else should run into these things
down the road, hopefully someone will remember and be able to help.
1) after compiling and building omega, and doing make install, I get a set
violation when
2016 Jan 11
0
Security permissions issues after changing idmap backend from RID to AUTORID
Thanks Michael,
Please see the inline answers.
> On Jan 10, 2016, at 5:16 PM, Michael Adam <obnox at samba.org> wrote:
>
> On 2016-01-10 at 17:58 +0000, Rowland penny wrote:
>> On 10/01/16 17:05, Partha Sarathi wrote:
>>>
>>>> This could have a lot to do with the fact that idmap_rid &
>>>> idmap_autorid calculate the uids differently
2006 Aug 20
1
omindex patch
Attached is my rather largish omindex.cc patch with ChangeLog.
It needs autoreconf to update configure and the Makefiles.
Note that unrar is not patent infected, only rar, the compressor.
I've put some AC_PATH_PROG checks into configure for all helpers.
The patch is not yet complete.
2006-08-18 15:13:32 Reini Urban <reinhard.urban at avl.com>
omega-0.9.6b:
* omindex.cc: last_mod as
2005 Mar 31
1
omindex and scriptindex question
Hi,
I was researching indexing of text in omindex and scriptindex.
While indexing text with omindex.cc possition of terms is saved with gap.
This is not happening with scriptindex.cc
While this is happening ?
Another question is why in omindex.cc the term possition starts with 0 while
in scriptindex it starts from 1 ?
Code snippet from omindex.cc
// Add postings for terms to the document
2009 Feb 02
2
Ticket #282: omindex-assorted-enhancements.patch woes
I would really like to try out the features in the patch above. But I
can't ever seem to get the resulting omindex.cc to "make".
I tried updating to rev 10801 from the SVN then run /bootstrap but then
I seem to get errors compiling everything when I try and do "make" (I'm
using ubuntu 8.10).
So I thought I'd try an apply the patch to the latest stable version
2013 May 15
1
How to omindex some sub-directories?
Given a directory tree like ...
/foo
|
+-- A
|
+-- B
|
+-- C
... what is the best way to index A and C into a single Xapian database?
AFAIK the alternatives are:
omindex --db /my_db --no-delete /foo /foo/A
omindex --db /my_db --no-delete /foo /foo/B
or
omindex --db /my_A_db /foo /foo/A
omindex --db /my_B_db /foo /foo/B
xapian-compact /my_A_db /my_B_db /my_db
The first alternative does not
2009 May 19
4
omindex options
Hi.
I am writing a python equivalent of omindex (we are using scriptindex
currently - but I wanted to use omindex instead, and extend it to work with
our internal file format.. BUT did not want to compile code if possible...
so anyway).
I have tried to keep the code as close to possible to the omindex native
code, but am facing a bit of confusion: what exactly is the reason for
omindex to take
2004 Dec 17
2
Omega changes
I propose making a few changes to the way omega (and omindex) operate.
I'm posting these to the list before doing so to check if they'll cause
obvious problems for anyone.
1) Configuration handling for omega. Omega has a configuration file,
which specifies where databases, templates and logfiles are to be found.
It currently looks for this configuration file in its current working
2017 Apr 23
2
Question about the ticket #743 omindex: delay libmagic checks
>
> I'd suggest to start with you just look at moving the libmagic check after
> the filesize checks, so you don't need to get into whether libmagic or
> the database check is cheaper on average.
hi, Olly, I have moved the libmagic check after the filesize check directly,
https://github.com/caiyulun/xapian/commit/3a97d9ee5397fa900a473aa9b3d8eeb720177a4e
can you provide
2009 Apr 06
2
omindex => Unknown extension
Hi all,
I'm having a recurrent problem with Omega's indexing.
When I run omindex, it sometimes misses to recognize the extension of
some files (.doc, .pdf) and skips them. In the same run, omindex is
otherwise perfectly able to index other files with same extensions. The
reason is not clear but it should occur before it selects a content
converter since for example, if I manually run
2017 Apr 20
2
Question about the ticket #743 omindex: delay libmagic checks
Hi,
I'm working on the ticket #743 omindex: delay libmagic checks
<https://trac.xapian.org/ticket/743>. As the ticket's
Description mention, the call to libmagic is expensive than call the stat,
so we can check the size by call the stat to get size before call
libmagic to get a mime type.
But how about the timestamps check? since timestamps check need to iterate
the DB to check if