Hello, I am currently looking for an IR-Framework for a perl based project. The xapian perl module seems very well up to date and nearly complete, so xapian looks very interesting. (I only did find lucene as an alternativ, but where perl support is not so grown up yet as it feels.) Before diving into xapian I would like to know if there is snippet highlighting support for the result list. (The searched word(s) and fuzzily related words should be marked up.) I could not find information (classes/methods) about it. Some mailing list postings seem that this is on the todo-list but not yet available (only in the tool/project omega). So: Is there highlighting support in xapian? Oh, btw: Searching on xapian.org brings up an error: "Warning: virtual() [function.virtual]: Unable to include '/omega.cgi?FMT=xapian.org&P=highlight' - request execution failed in /data/srv/www/xapian.org/search.php on line 13" Also when trying to find the wishlist in bugzilla (via the menu entry "bugs" and "Search existing bug reports") brought up an 404 error: "The requested URL /cgi-bin/bugzilla/query.cgi was not found on this server." Thanks, Gert
Hi Gert On 7 Apr 2008, at 12:20, Gert Brinkmann wrote:> Hello, > > I am currently looking for an IR-Framework for a perl based project. > The > xapian perl module seems very well up to date and nearly complete, so > xapian looks very interesting. (I only did find lucene as an > alternativ, > but where perl support is not so grown up yet as it feels.) > > Before diving into xapian I would like to know if there is snippet > highlighting support for the result list. (The searched word(s) and > fuzzily related words should be marked up.) I could not find > information > (classes/methods) about it. Some mailing list postings seem that > this is > on the todo-list but not yet available (only in the tool/project > omega). > > So: Is there highlighting support in xapian?Xapian currently does not do highlighting, it's possible to add it yourself though. If you look back into last months posts you'll see I posted some C++ code that could do what your asking. This is what I am currently using as my solution. You should be able to convert it to PERL easily enough.
On Mon, Apr 07, 2008 at 01:20:32PM +0200, Gert Brinkmann wrote:> Oh, btw: Searching on xapian.org brings up an error: > "Warning: virtual() [function.virtual]: Unable to include > '/omega.cgi?FMT=xapian.org&P=highlight' - request execution failed in > /data/srv/www/xapian.org/search.php on line 13" > > Also when trying to find the wishlist in bugzilla (via the menu entry > "bugs" and "Search existing bug reports") brought up an 404 error: "The > requested URL /cgi-bin/bugzilla/query.cgi was not found on this server."Thanks for reporting, but we're already aware of these problems: http://thread.gmane.org/gmane.comp.search.xapian.general/5976 http://thread.gmane.org/gmane.comp.search.xapian.general/6006 We've restored everything we can without getting data off the old server. I'm hoping everything will be working properly again this week. Cheers, Olly
Gert Brinkmann wrote on 4/7/08 6:20 AM:> Hello, > > I am currently looking for an IR-Framework for a perl based project. The > xapian perl module seems very well up to date and nearly complete, so > xapian looks very interesting. (I only did find lucene as an alternativ, > but where perl support is not so grown up yet as it feels.) >For the sake of completeness, you've missed some Perl projects out there if all you found was Lucene. Google for 'kinosearch' for starters.> Before diving into xapian I would like to know if there is snippet > highlighting support for the result list. (The searched word(s) and > fuzzily related words should be marked up.)I post this every time the subject comes up so forgive me if you've seen it in the archives already. Check out http://search.cpan.org/dist/Search-Tools/ -- it is IR-library independent. -- Peter Karman . http://peknet.com/ . peter at peknet.com