Displaying 20 results from an estimated 200 matches similar to: "Question about Document and TermIterator.get_termfreq()"
2009 Feb 12
1
problem when using xapian's static libs in windows
I have download source ?1.10? from the internet
and build it into lib
Then I create a project as the helpdoc said
I using vc2005(vc8)
The source in my test project is as follow??copy from the helpdoc?
#include <xapian.h>
#include <iostream>
using namespace std;
int main(int argc, char **argv)
{
// Simplest possible options parsing: we just require three or more
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi,
I'm using xapian with c# and mono and i'm having a segfault in get_weight.
When i print the index variable, the value is clearly too high.
I think something write over it. Do you have any idea on how i could
trace the beginning of the segmentation fault ?
Thanks,
--
Yann
2012 Sep 19
1
java-swig TermIterator
Hello,
Been using Xapian and the Java bindings for years, all was working
great, and I all of a sudden decided to upgrade to the latest 1.2.12 and
use the new java-swig bindings instead of the old hand-crafted JNI which
I think have been deprecated now.
I'm struggling with the new design of the TermIterator. More
specifically, I can't tell when I've reached the end of the list of
2005 Feb 25
2
Bug in TermIterator::skip_to() ?
Hi all,
I've been toying with xapian (mostly using the Python bindings) and I
think I've hit a bug in the TermIterator::skip_to() method (or maybe
in QuartzAllTermsList::skip_to()).
I've attached a c++ source file that demonstrates the issue. In short,
if you have a WritableDatabase, ask for the all-terms TermIterator
with db.allterms_begin(), and then skip_to() a word that is itself
2006 Jan 30
1
More than one Index?
Morning All,
I use scriptindex to build my database and the PHP bindings to pull it
all out.
Is it possible to have more than one index but select what the bindings
search on?
So at the moment I index property addresses, I would also like to index
property descriptions for more advanced searching but only as an
optional extra...probably in an extra search box.
Also I would like to analyse the
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote:
> On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> > I'm already retrying the ->get_mset operations; but now I'm
> > wondering where I'd hit DatabaseModifiedErrors while inside a
> > Xapian::MSetIterator loop.
> >
> > I assume ->get_document is a place where it gets thrown;
> > but
2012 Apr 27
4
GSoC xapian node binding
Posting recent offline discussion...
On Fri, Apr 27, 2012 at 10:55 AM, Marius Tibeica <mtibeica at gmail.com> wrote:
> Hi Liam,
>
> I've added the Enquire class and designed a query spec structured as a JS
> object. Hope you like it :)
> I'll probably be off a few days (there is a national holiday Tuesday which
> means i have a long weekend :D) but maybe I'll
2015 Jul 23
1
Get term from document by position
Hello. Is there any FAST way to get a term from the xapian document by it's position, something like
std::string term = Xapian::Document::GetTermByPosition(int position) ?
Below i have described a task that i am trying to solve, in case if somebody is interested.
============================================================================
When displaying search results, i would like to
2007 Nov 08
1
Perl make test fails on threads in rhel5
Hi all,
I've tried building RPMs for RHEL5 and hit this problem in Search::Xapian:
make test fails on test 37:
ok 34 - check PositionIterator
ok 35 - create TermIterator
ok 36 - check TermIterator
dubious
Test returned status 0 (wstat 11, 0xb)
DIED. FAILED tests 37-65
Failed 29/65 tests, 55.38% okay
$ xapian-config --version
xapian-config - xapian-core 1.0.4
$ cat
2023 Dec 01
1
termlist_begin ordering in older versions
Hey, I noticed commit 145503bbe4a5bf702cd13cb2e592111e8d7ca89a
(Reimplement Database and WritableDatabase, 2017-10-05) added
the phrase:
"The terms are returned ascending string order (by byte value)"
for termlist_begin. Is that also true for the 1.4 (or even 1.2) series?
Also, is allterms_begin also the same w.r.t. ordering?
Thanks.
2023 Aug 27
1
DatabaseModifiedError while iterating on mset
On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> I'm already retrying the ->get_mset operations; but now I'm
> wondering where I'd hit DatabaseModifiedErrors while inside a
> Xapian::MSetIterator loop.
>
> I assume ->get_document is a place where it gets thrown;
> but once a document is retrieved, can iterating through
> terms in one document
2014 Aug 28
1
xapian golang binding
Hi,
I would like to create a golang binding for xapian. Has anyone made the
necessary swig files for it? If not can someone who has experience with
swig offer tips or ready-made files?
Cheers,
Marius
PS1: A sample of how the binding will be used:
https://gist.github.com/mtibeica/7d2c680046b4bd0da751#file-quickstart-go
PS2: I wrote the nodejs binding for xapian (found on
2023 Aug 23
1
DatabaseModifiedError while iterating on mset
I'm already retrying the ->get_mset operations; but now I'm
wondering where I'd hit DatabaseModifiedErrors while inside a
Xapian::MSetIterator loop.
I assume ->get_document is a place where it gets thrown;
but once a document is retrieved, can iterating through
terms in one document (using TermIterator) also throw DB modified?
I'm dumping multiple terms per-document to a
2010 Oct 21
2
In-memory databases vs PHP Bindings
I can't quite connect the dots on this, perhaps someone can help. I'm
simply trying to create an in-memory database comprising a single document,
so that I can run a load of queries against it and see if any of them match
the new document (this is to enable users to have 'subscriptions' to saved
searches and be alerted every time a new item is published that matches
their
2012 Aug 06
1
Question about serialise
Hi,
Do the serialise method of Document and Query produce a valid utf8 string?
My problem is with the Node.JS binding. I serialise an object, convert the
string to a javascript string, convert the javascript string to a std
string and then try to unserialise. The error I get from Xapian is "Bad
encoded length: no data".
Thanks,
Marius
-------------- next part --------------
An HTML
2011 Mar 07
1
Set Term Frequency for a Query
Hello,
I have a problem when trying to define a query and setting for each term its
"term frequency" with the classical constructor
Xapian::Query<http://xapian.org/docs/apidoc/html/classXapian_1_1Query.html#f396e213df0d8bcffa473a75ebf228d6>(const
std::string &tname_,
2013 Oct 30
2
Lucene 3.6.2 backend for xapian (#25)
[Replying to xapian-devel, as I think a wider audience would be useful]
On Mon, Oct 21, 2013 at 11:24:51PM +0800, jiangwen jiang wrote:
> yes, it's less efficient. Lucene database has multiple segments, each
> segment can treat as a independent database. The same term may exists in >=
> 1 segments.
Sorry for taking a while to respond - I've been both busy and mulling
this
2016 May 05
2
GSoC 2016 - Introduction
Hello,
Thanks James for the reply. That cleared a few things out. Apologies for
replying late because of exams going on.
I was going through the previous clustering API to understand how it worked
and it seems like the the approach for construction of the termlists which
are used for distance metrics use TF-IDF weighting with cosine similarity,
which is very similar to the approach I would need
2007 Sep 30
1
Perl example of using termitrator?
I'm having trouble translating from C++ to perl objects.
The TermIterator class looks like to get a set of terms in a document
you might have C++ code like:
Enquire::TermIterator termIt =enquire->get_matching_terms_begin(id);
for(;termIt != enquire->get_matching_terms_end(id);termIt++) {
string term = *termIt;
}
Or something similar. However when I attempt to translate that
2006 Mar 02
2
Different Collation (utf8_slovak_ci, utf8_danish_ci, latin1_german1_ci) etc.
One issue left for me to figure out is that in different languages there are
different characters and Xapian takes only english characters.
Thefore many word entered by users that contains their own language special
characters will not return any result. MySQL offers different collations ...
Now when I see how much data Xapian can search perhaps I could expand my
index spider different European