similar to: Xapian document matching

Displaying 20 results from an estimated 700 matches similar to: "Xapian document matching"

2012 Dec 08
2
Want to contribute code to the Xapian project
Hey guys,I am a 3rd year Computer Science undergrad student.I a extremely interested in contributing code to the XAPIAN project. The work you people do sounds extremely fascinating and interesting.Can someone just give me a brief overview of how to proceed ?. I Can code in C,C++ and Python and have experience in Natural Lanuage Processing.Am also quite comfortable with NLTK and using Wordnet.Am
2011 Jun 14
2
Facets with Multiple Values
I'm trying to determine the best way to index facets with multiple values. For example, let's say we have several books and we want an Author facet where one can filter the books by the author. The problem is each book can have multiple authors. Is there a way to index multiple author names into a single value slot for each book? I see there is a Xapian::StringListSerialiser class that
2013 Mar 06
1
Xapian web fronends
Hi, Is there any php or other web (not cli) front end (same as omega) for Xapian index? I would like to use Xapian for an intranet search portal, and modify the style of it. Many thanks, Omer
2010 Apr 16
2
best practices - combining sql database and xapian, size of database?
Newbie-alert: I'm just getting started on a new project involving a full text search requirement, and my initial investigation points to xapian being the way to go. Two questions: - eventually I'll most likely be indexing towards 50 million documents - is this reasonable to expect or attempt with xapian? - each of my documents come with a set of attributes. These are easily stored
2005 Sep 08
1
C API
Hi. I am one of the developers of Swish-e (http://swish-e.org/), an indexing/search tool similar to Xapian. I am currently researching future development directions for the Swish-e project. Three of our most often-requested features are UTF-8 support, incremental indexing, and large (multimillion) doc sets, all of which seem to be ably handled in the Xapian library. So one possible
2010 Jul 09
3
multiple value with the same number?
I am working on a system for indexing email, and it would be nice to be able to store multiple values for the same value_no - for instance, if an email has multiple attachments, I might want to attach a value for each attachment, but a value with the same number. One option would be store the three values as one, e.g. comma-separated, but I was just wondering how others have dealt with such a
2009 Nov 15
1
setting ValueRangeProcessor at runtime
I want to set a variable number and type of ValueRangeProcessors at run time based on a configuration file. But I seem to be running into (what I think is) a C++ scope issue. I tried just a simple test to see if I could add 5 VPs in a loop. Xapian::QueryParser qparser; Xapian::Query query; int vp = 0; while(vp < 5) { Xapian::StringValueRangeProcessor sproc(vp++);
2014 Jan 27
4
Perl Search::Xapian
Hi, Trying to learn Search::Xapian and be better at perl at the same time, I'm stuck, at the DB_CREATE_OR_OPEN error. Perl says this: ~/dev/sandbox/Xapian-perl$ ./Index1-Xap.pl 100-objects-v1.csv db "db" is not exported by the Search::Xapian module Can't continue after import errors at ./Index1-Xap.pl line 7. BEGIN failed--compilation aborted at ./Index1-Xap.pl line 7. What I
2010 Jan 01
1
Document values vs data
In a recent post, someone asked about storing "metadata" in a document. My guess would have been to use add_value. Olly's recommendation was to use set_data. What are the general guidelines for deciding whether to use values or data in a document? Garrett
2010 Feb 22
2
Accents problem
Hi, I've a problem with the accents. How do I configure Xapian to ignore the accents in the search/indexes? Thanks, Nuno Mariz
2010 Dec 15
2
excluding child folders in omindex search
hi there, is there an option to exclude child folders when running omindex? For example: omindex -p --db /var/blah/default --url /something /var/www --exclude /var/www/ignore Thanks, Jeff
2015 Jun 20
1
Xapian with Json
Hi ! I?m newbie with Xapian ! I?m looking for the best way to index json data to be able to search by name/value pair and Full Text Search using only Xapian. I did haven?t success to found a complete or a good documentation about json index and search. Do you know some documentation about ? Another problem that I have is that: In the same name/value pair sometimes I have different data types
2009 Dec 17
4
Text snippets
Hello, Is there advancements in snippeting? (Besides what mentioned in the wiki.) I think extracting snippets is clearly IR task. And I hope Xapian will provide at least helpers to do that. I have set of documents up to 5M of extracted text and 1M in average (they are even bigger pdfs but I pre-extracted text into some sort of text cache, pdftotext is very slow). To parse ~1M documents on the fly
2009 Dec 01
0
SWISH::Prog::Xapian
Early[0] implementations of Swish3 for Perl are now available on CPAN. http://search.cpan.org/dist/SWISH-Prog-KSx/ http://search.cpan.org/dist/SWISH-Prog-Xapian/ These Perl implementations offer KinoSearch[1] or Xapian[2] as alternate backend libraries. SWISH::Prog::Xapian should build indexes compatible with the swish_xapian tool that comes with libswish3. Feedback welcome. [0] They are
2011 Jan 18
1
Loading wordnet in R
I ve installed wordnet 2.1 and R 2.11.0 on windows 7. Whenver i try to load wordnet in R, i get an error initDict() FALSE cannot find wordnet 'dict' directory, Please set the WNHOME variable to its parent. I have tried setting WNHOME to "C:\Program Files (x86)\WordNet\2.1\dict","C:\\Program Files (x86)\\WordNet\\2.1\\dict","C:\\\\Program Files
2008 Sep 30
1
problem loading WordNet into R
Hello, I am hoping someone can assist me with a problem I have trying to load the WordNet dictionary into R. Below is the error message I receive: > library(wordnet) > initDict() [1] FALSE Warning message: In initDict() : cannot find WordNet 'dict' directory: please set the environment variable WNHOME to its parent I have entered the Variable value via: control -> panel
2009 Aug 20
1
Problems with loading 'wordnet' in a standalone pc
Hi group, I have the following error code after submitting library(wordnet) in a standalone pc. rJava is already in place. Error in .jpackage(pkgname,lib.loc=libname) unused argument(s) (lib.loc=libname) Error: .onLoad failed in 'loadNamespace' for 'wordnet' Error: package/namespace load failed for 'wordnet' My guess is I miss something from the namespace but
2007 Jan 27
4
Xapian vs Lucene
Hello, It's probably quite troll-risky to put a title like this, but did anyone take the trouble to compare Lucene to Xapian and make a list of differences? As I told the list at the end of last year, I'm going to have to integrate an indexing/search engine in the coming weeks or months. It will be integrated to Dokeos, an open-source e-learning application in PHP, and at the moment we
2009 Aug 15
1
extracting text from wordnet using synonym()
Dear group, There are postings on extracting partial string using sub()/gsub()/gsubfn() but I still couldn't figure this one out. Supposed I am pulling the synonyms of "help" using synonyms("help") from wordnet and get the followings: > synonyms("help") [1] "c(\"aid\", \"assist\", \"assistance\", \"help\")"
2006 Jul 10
3
Plurals and synonym lists
I want to correct spelling errors automatically. I have used search in the past where I can pass an argument through standard search to correct a word with up to 2 spelling errors for example or do the more Google like "Did ya mean?". In this case I just want to change it automatically and search. I am not too interested in specifying the number of characters it is out by. What is