HI all, I'm just testing out the capabilities of xapian and omega. Environment - Fedora15. Disk to be indexed - 2GB? - FAT16 filesystem. Named "New Volume" When I add a text file to the disk, by right-clicking in Fedora and choosing Create New - > Text File The system creates the text file as expected. I added some content/words, however, xapian-omega will not index it: [root at Eagle1 /]# omindex --db /var/lib/omega/data/default --url /media/NEW\ VOLUME/? /media/NEW\ VOLUME/ Text File: Skipping - Unknown extension and unrecognised format The file it's skipping over is a true text file: [root at Eagle1 NEW VOLUME]# file Text\ File Text File: ASCII text BUT if I rename the file to Text File.txt This seems like a bug to me, since the indexer shouldn't care what the file is called, it should simply index the content (so long as it fits in within the guidelines of files it's designed to index). Am I doing something wrong? Kind regards, Encee.
On Fri, Jul 01, 2011 at 01:36:28PM -0700, The Dog's Bollix wrote:> The system creates the text file as expected. I added some > content/words, however, xapian-omega will not index it: > > [root at Eagle1 /]# omindex --db /var/lib/omega/data/default --url /media/NEW\ VOLUME/? /media/NEW\ VOLUME/ > Text File: Skipping - Unknown extension and unrecognised format > > The file it's skipping over is a true text file: > > [root at Eagle1 NEW VOLUME]# file Text\ File > Text File: ASCII textYou don't seem to say what version you're using, but if it's Omega 1.2.4 or later, and you've compiled it with libmagic support, this ought to work. What does this report? file --mime-type Text\ File And if you're using Omega >= 1.2.4, check if it's built with libmagic support: ldd `which omindex`|grep libmagic If not, you'll need to install the appropriate package (probably libmagic-devel, but I'm not very familiar with Fedora) and rebuilt Omega. Cheers, Olly
Maybe Matching Threads
- "Exception: Couldn't read format template `query' (Permission denied)" - help?
- Question about the ticket #743 omindex: delay libmagic checks
- Plain text files without extension
- Plain text files without extension
- index everything? (no extensions/no mime-types)