Displaying 20 results from an estimated 8000 matches similar to: "C API"
2009 Dec 01
0
SWISH::Prog::Xapian
Early[0] implementations of Swish3 for Perl are now available on CPAN.
http://search.cpan.org/dist/SWISH-Prog-KSx/
http://search.cpan.org/dist/SWISH-Prog-Xapian/
These Perl implementations offer KinoSearch[1] or Xapian[2] as alternate backend
libraries.
SWISH::Prog::Xapian should build indexes compatible with the swish_xapian tool
that comes with libswish3.
Feedback welcome.
[0] They are
2009 Nov 15
1
setting ValueRangeProcessor at runtime
I want to set a variable number and type of ValueRangeProcessors at run time
based on a configuration file. But I seem to be running into (what I think is) a
C++ scope issue.
I tried just a simple test to see if I could add 5 VPs in a loop.
Xapian::QueryParser qparser;
Xapian::Query query;
int vp = 0;
while(vp < 5) {
Xapian::StringValueRangeProcessor sproc(vp++);
2011 Jun 14
2
Facets with Multiple Values
I'm trying to determine the best way to index facets with multiple
values. For example, let's say we have several books and we want an
Author facet where one can filter the books by the author. The problem
is each book can have multiple authors. Is there a way to index
multiple author names into a single value slot for each book?
I see there is a Xapian::StringListSerialiser class that
2007 Apr 30
1
Xapian document matching
Hi, i'm wondering is there a possibility to do like ABCSok do
(http://nyheter.abcsok.no/), to make "Main article" and "Same articles"
collapsed to it.
Like on http://news.google.com/?hl=en the same thing. "Parent" and "same
article on other sites" (they do differ from each other a little bit).
Maybe somebody know how to do that thing or where to read
2013 Mar 06
1
Xapian web fronends
Hi,
Is there any php or other web (not cli) front end (same as omega) for
Xapian index?
I would like to use Xapian for an intranet search portal, and modify the
style of it.
Many thanks,
Omer
2010 Apr 16
2
best practices - combining sql database and xapian, size of database?
Newbie-alert: I'm just getting started on a new project involving a
full text search requirement, and my initial investigation points to
xapian being the way to go.
Two questions:
- eventually I'll most likely be indexing towards 50 million
documents - is this reasonable to expect or attempt with xapian?
- each of my documents come with a set of attributes. These are easily
stored
2014 Jan 27
4
Perl Search::Xapian
Hi,
Trying to learn Search::Xapian and be better at perl at the same time,
I'm stuck, at the DB_CREATE_OR_OPEN error. Perl says this:
~/dev/sandbox/Xapian-perl$ ./Index1-Xap.pl 100-objects-v1.csv db
"db" is not exported by the Search::Xapian module
Can't continue after import errors at ./Index1-Xap.pl line 7.
BEGIN failed--compilation aborted at ./Index1-Xap.pl line 7.
What I
2012 Dec 08
2
Want to contribute code to the Xapian project
Hey guys,I am a 3rd year Computer Science undergrad student.I a extremely
interested in contributing code to the XAPIAN project. The work you people
do sounds extremely fascinating and interesting.Can someone just give me a
brief overview of how to proceed ?. I Can code in C,C++ and Python and
have experience in Natural Lanuage Processing.Am also quite comfortable
with NLTK and using Wordnet.Am
2010 Jul 09
3
multiple value with the same number?
I am working on a system for indexing email, and it would be nice to be
able to store multiple values for the same value_no - for instance, if
an email has multiple attachments, I might want to attach a value for
each attachment, but a value with the same number.
One option would be store the three values as one, e.g. comma-separated,
but I was just wondering how others have dealt with such a
2010 Feb 22
2
Accents problem
Hi,
I've a problem with the accents. How do I configure Xapian to ignore
the accents in the search/indexes?
Thanks,
Nuno Mariz
2010 Dec 15
2
excluding child folders in omindex search
hi there,
is there an option to exclude child folders when running omindex?
For example:
omindex -p --db /var/blah/default --url /something /var/www --exclude
/var/www/ignore
Thanks,
Jeff
2009 Dec 17
4
Text snippets
Hello,
Is there advancements in snippeting? (Besides what mentioned in the wiki.) I think extracting snippets is clearly IR task. And I hope Xapian will provide at least helpers to do that. I have set of documents up to 5M of extracted text and 1M in average (they are even bigger pdfs but I pre-extracted text into some sort of text cache, pdftotext is very slow). To parse ~1M documents on the fly
2007 Jan 27
4
Xapian vs Lucene
Hello,
It's probably quite troll-risky to put a title like this, but did anyone
take the trouble to compare Lucene to Xapian and make a list of
differences?
As I told the list at the end of last year, I'm going to have to
integrate an indexing/search engine in the coming weeks or months. It
will be integrated to Dokeos, an open-source e-learning application in
PHP, and at the moment we
2015 Jun 20
1
Xapian with Json
Hi !
I?m newbie with Xapian !
I?m looking for the best way to index json data to be able to search by name/value pair and Full Text Search using only Xapian.
I did haven?t success to found a complete or a good documentation about json index and search.
Do you know some documentation about ?
Another problem that I have is that: In the same name/value pair sometimes I have different data types
2010 Jan 01
1
Document values vs data
In a recent post, someone asked about storing "metadata" in a
document. My guess would have been to use add_value. Olly's
recommendation was to use set_data.
What are the general guidelines for deciding whether to use values or
data in a document?
Garrett
2006 Mar 17
1
omega crawler: ht://dig or wget?
At wiki page: http://wiki.xapian.org/Omega
I added a comment that ht://Dig looks like dead.
Does anybody really use it?
>From brief glance at docs I had a feeling it is not easy to configure.
Maybe better crawler is GNU wget? Mature, stable, maintained?
--
Peter Masiar
2001 Sep 30
1
wine won't load anything
I have been able to install Starcraft and Swish 2.0 using wine (under
Mandrake 8.0) but when I try to run either program I get a "can't load
*.dll" error. Both errors are from dll files that the are in the program
directory itself. I have attempted to specify these dlls like all of the
default dlls in the wine config file but it still can't find them. Anyone
know if I have
2002 Jan 24
1
Re: coding factor replicates
How about this. Its not as 'swish' as Doug's response, but it might be a
little simpler to think about and adapt...
# show the test vector
> tmp
[1] A B C B C A C B A A B
Levels: A B C
>
# create a named vector to store the number of occurences of each level
> index <- numeric(length=length(levels(tmp)))
> names(index) <- levels(tmp)
> index
A B C
[1] 0 0
2006 Apr 11
3
Robust Search Solution (with CentOS 4.3)
I've got about 10,000 docs I'd like to devise a
search/index for. I found a perl script called
Perlfect that can do that on an old P3 but at the
astronomical time of 7 hours. Another script(cgi/perl)
at hotscripts can do the same but allows the "rm -rf
/" exploit. DoH!?
Is there anything perl/flatfile that can search/index
faster? This is a nice job for an aging P3 in the
2002 Jan 24
0
Re: coding factor replicates
> -----Original Message-----
> From: Douglas Bates [mailto:bates at stat.wisc.edu]
> Sent: Thursday, January 24, 2002 8:55 AM
> To: Uwe Ligges
> Cc: Brad Buchsbaum; r-help at stat.math.ethz.ch
> Subject: Re: [R] Re: coding factor replicates
>
> Douglas Bates <bates at cs.wisc.edu> writes:
>
> > Uwe Ligges <ligges at statistik.uni-dortmund.de>