Displaying 20 results from an estimated 800 matches similar to: "How to search in many database?"
2020 Aug 27
4
Xapian on Android?
Friends,
I would like to hear from anyone who has experience deploying Xapian on Android. I'm new to Xapian, but I know it is used by a couple partners for offline projects on Linux and Windows.
Our small nonprofit, WiderNet, provides off-line access to thousands of Web sites for people who lack Internet connectivity (www.widernet.org). Over 2,000 universities, schools, health care sites,
2012 Jan 20
3
get_docid???
my $mset = $enq->get_mset($nstart,$nrecords);
for(my $mit=$mset->begin(); $mit != $mset->end();$mit++) {
my $doc = $mit->get_document();
my $dat = $doc->get_data();
my $id = $doc->get_docid();
}
[Fri Jan 20 10:35:06 2012] newmail.cgi: Can't locate
auto/Search/Xapian/Document/get_docid.al in @INC (@INC contains:
/etc/perl
2018 Jun 20
2
Welcome to the "Xapian-discuss" mailing list
Hi,
I'm new to Xapian and wanted to know if it has a specific feature. I want
to be able to check the relation between two terms on a page based on how
close they are together on the page. I want to use a combination of n-gram
based labeling and the "slop" feature found in Elasticsearch. Does Xapian
have this/a similar feature? I haven't been able to find any programs that
have
2009 Apr 23
1
PHP Total document
I was also wondering if someone could tell me how to extract the total
number of documents contained in a database via PHP.
Thanks,
Frank
2011 May 30
1
How to check docid
I have a bit of code (Python) to delete a number of documents:
for f in Flist: xapian_store.delete_document(f.pri_key)
in which I am using a unique primary key from an SQL database as the docid for the Xapian database.
The problem I have is that some of the documents may not have been created - so I get an error. Now I could just ignore the error (try-recover), but what would be the
2004 Dec 21
1
Search::Xapian add_database'd search results are odd?
Sorry if this is the wrong forum to discuss Search::Xapian issues --
this just seems like the best place..
Anyways, I've been testing out using $db->add_database() when searching,
and it seems like the docids I'm getting out of it are incorrect, almost
as though they're "double" what they should be (numerically)...
the docids that exist should be around 950,000 and
2020 Feb 19
2
prioritizing aggregated DBs
Olly Betts <olly at survex.com> wrote:
> On Sat, Feb 08, 2020 at 06:04:42PM +0000, Eric Wong wrote:
> > Olly Betts <olly at survex.com> wrote:
> > > On Fri, Feb 07, 2020 at 09:33:08PM +0000, Eric Wong wrote:
> > > > Or would I fiddle with wdf_inc for all ->index_text and ->add_term
> > > > calls on a per-DB basis?
> > >
>
2017 Apr 09
3
Omega: Missing support for newer weighting schemes
On Sun, Apr 09, 2017 at 11:34:07PM +0530, Vivek Pal wrote:
> > Each scheme already has a human-readable name, and Xapian::Registry
> > can map that to an "examplar" object of the right type, so we
> > could take a string like "bm25 1 0.8", see the first word is "bm25"
> > and get a BM25Weight object, then call parse_params("1 0.8") on
2007 Sep 30
1
Perl example of using termitrator?
I'm having trouble translating from C++ to perl objects.
The TermIterator class looks like to get a set of terms in a document
you might have C++ code like:
Enquire::TermIterator termIt =enquire->get_matching_terms_begin(id);
for(;termIt != enquire->get_matching_terms_end(id);termIt++) {
string term = *termIt;
}
Or something similar. However when I attempt to translate that
2013 Oct 30
2
Lucene 3.6.2 backend for xapian (#25)
[Replying to xapian-devel, as I think a wider audience would be useful]
On Mon, Oct 21, 2013 at 11:24:51PM +0800, jiangwen jiang wrote:
> yes, it's less efficient. Lucene database has multiple segments, each
> segment can treat as a independent database. The same term may exists in >=
> 1 segments.
Sorry for taking a while to respond - I've been both busy and mulling
this
2017 Apr 12
4
Omega: Missing support for newer weighting schemes
> Each scheme already has a human-readable name, and Xapian::Registry
> can map that to an "examplar" object of the right type, so we
> could take a string like "bm25 1 0.8", see the first word is "bm25"
> and get a BM25Weight object, then call parse_params("1 0.8") on it to
> create the correct Weight object (broadly similar to how
2008 Jan 04
2
Agents and AddQueueMember
Hi,
I have callcenter running with v 1.2 with AgentCallbackLogin and now
trying to move to 1.4 using the example doc,
doc/queues-with-callback-members.txt. From what I understand the basic
idea in the example is to
1. Authenticate a caller with VMAuthenticate
2. Get his SIP Channel number
3. Use
2013 Aug 26
2
Perl interface isn't working in 1.2.x
On 08/25/2013 05:02 PM, Olly Betts wrote:
> So the simple fix is
> probably just to install the perl-Search-Xapian RPM instead.
Thanks, the Centos 6 repos don't have that rpm and the
http://xapian.org/download page seems to only cover the XS bindings, if
I am reading this correctly:
But I was able to remove the rpm packages and compile and install the
core and swig from source.
2004 Sep 01
1
Omega cgi question
IRIX 6.5.22, compiled omega.0.8.1 with native MIPS compiler.
omega.cgi isn't running from apache. From the log:
[Wed Sep 1 10:04:36 2004] [error] [client 198.149.32.31] malformed
header from script. Bad header=Caught unknown exception: /var/www/cgi-
bin/omega.cgi
This is with a url of http://<machine>/cgi-bin/omega.cgi?DB=/open/
When I run it from the command line with no
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2017 Apr 08
2
Omega: Missing support for newer weighting schemes
On Sat, Apr 08, 2017 at 09:11:22PM +0100, James Aylett wrote:
> On 8 Apr 2017, at 19:15, Vivek Pal <vivekpal.dtu at gmail.com> wrote:
>
> >> and the details of which weighting schemes were available in which version
> >> isn't a key part of the $set command itself.
> >
> > Do you suggest dropping that piece of information out? Since the reason behind
2006 Mar 29
1
Using boolean terms in PHP bindings
OK, I'm indexing my data with the scriptindex. I want to be able to
restrict the search by the category field. Do I need to do anything to
the data itself? Like, literally prefix it with the characters "XC"?
Below is my indexor for scriptindex and the my php code...
document_id : field=ref unique=Q boolean=Q
search_id : field=document_id index=S
document_title : field=title
2017 Apr 13
2
Omega: Missing support for newer weighting schemes
On Mon, Apr 10, 2017 at 11:47:36PM +0530, Vivek Pal wrote:
> > No, use Xapian::Registry to find the weighting scheme from the name
> > like how Weight::unserialise() does (otherwise every caller would need
> > code similar to that above).
>
> Okay, I looked into Xapian::Registry and it seems you are referring to using
> the get_weighting_scheme method? (which expects a
2013 Mar 05
1
Remote database & local database, and adding new weight found vtable error
Hello, guys.
Q1.
now I have load all the docid and its document data into a dictionary for
faster loading data instead of calling
Xapian::MSetIterator i;
i.get_document().get_data();
but I was happened to discover that the dictionaries got by such two method
were different:
both methods use DB1, DB2
method 1:
Xapian::Database db = Xapian::Database(the path of DB1);
Xapian::Database db2 =
2010 Jul 26
2
related documents
Hi All,
I would like to take a doc in the xapian DB and find all related
documents by relevance e.g. so when you view one document it says
"Related entries X Y Z".
I'm aware of the "Morelikethis" Lucene plugin that is supposed to do
something like this, by generating a query from a document based on term
frequency.
Has anyone developed a tool to generate a query from a