Displaying 20 results from an estimated 100 matches similar to: "Indexing more than 15 billion documents"
2005 Jul 20
1
docid type redifine
Hello all.
I need to redefine a docid type (and all dependent types) like this: typedef unsigned long long docid;
I think it would be enough to edit "include/xapian/types.h", but it isn't so.
1) I've added :
string
om_tostring(unsigned long long val)
{
CONVERT_TO_STRING("%llu")
}
in common/utils.{h,cc}
2) In include/enquire.h (line 438) I've found the
2007 Jul 24
1
Xapian::DocNotFoundError on replace_document? (Called from Search::Xapian)
Hello,
I'm using Xapian 1.0.2 (flint) and matching Search::Xapian.
I'm getting:
terminate called after throwing an instance of
'Xapian::DocNotFoundError', which dumps core.
at first it was after adding my 2nd document (to an empty db, although
I don't know if that has any bearing) to the database with a
replace_document() call.
I shifted the first document off the
2015 Mar 11
2
stub-file and get_doccount
Hello,
i switched from one big index to a stub file with many indexes and running
into a problem.
i have a tool to fetch a random document via:
get_doccount
random id up to get_doccount
get_document with that id
after changing to stub file this failes. Is there a nice way to get a
random document from a stub file?
?MfG?
Felix Ostmann
2013 Aug 27
2
What does collection_freq means?
Hi, all:
I am confused with the concept of colletion_freq
There's no informations about it on http://xapian.org/docs/glossary.html
What does it means?
Thanks
Regards!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130827/370cc6a3/attachment.html>
2014 Aug 07
2
Global number of matches
Hi,
is there a way to get the number of all possible matches for a specific
query?
Some api call like
Xapian::doccount Xapian::Enquire::get_number_of_matches();
Best wishes
Wilm
2016 Jul 30
2
Letor: returning MSet after re-ranking
>
>
> I'd prefer to avoid adding things to the public API that don't get
> used by end users. However because LTR is outside the Xapian build
> tree, we can't easily give it privileged access to Xapian internals.
>
Sorry for a delayed response.
The way I was thinking of performing reranking with updated weights was to
add a class MSetRanker (basically containing a
2010 Aug 18
16
[PATCH 00 of 16] libxl: autogenerate type definitions and destructor functions
The series introduces auto-generation of the type definitions used in
the libxl interface followed by auto-generation of a destructor
function for each type. In the future it may be possible to use the
related data structures for other purposes, for example auto-generation
of the functions to marshal between C and language binding data types.
tools/_libxl_types.h should be identical both before
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi,
I'm using xapian with c# and mono and i'm having a segfault in get_weight.
When i print the index variable, the value is clearly too high.
I think something write over it. Do you have any idea on how i could
trace the beginning of the segmentation fault ?
Thanks,
--
Yann
2014 Mar 06
2
Regarding GSOC 2014
Sir,
I am a 4th yr undergraduate student pursuing my BTech in CSE at IIIT
Hyderbad, India.
I am interested in applying for Xapian in Gsoc 2014. I had gone through
this year's idea page and interested in applying for 'posting list encoding
improvements' project.
I am good at C/C++,python; which is one of the requirement. I had done gone
through the information Retrieval and
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2006 Mar 27
4
Remote databases and daemons
I've looked over the docs on remote backends, the protocol, and a bit
of the c++ for doing distributed and remote searches. I've got a
couple of questions:
* The remote protocol is usable only as a Database, not as a
WriteableDatabase -- is this correct? So, if I don't want my
application to have a copy of the database on the same machine I'll
need to write an indexer daemon on
2010 Sep 09
2
[PATCH]: add libxl python binding
Changes since last time:
- split auto-generated code in to c and h files
- un-break the build system
- fix ocaml binding due to libxl API change
- lot''s of tidy-ups too numerous to mention
Please consider and apply :)
-----8<---------------------------------------------------------------
Introduce python binding for libxl. The binding is not yet complete but
list_domains,
2014 Dec 18
4
Replace atoi and atol with strtol strtoul:Need Help
Hello,
I came across the file *omega.cc* which is in directory*
xapain-application/omega/*
In this file , atoi is used in *Percentage Relevance cutoff *(293 line no)
as Percentage lies between 0-100 their is no need to modify atoi . But do
we need to check for error's ?
Second Implementation is in *collapsing* (301) in which we collapse set of
document under a key,range of this key has not
2007 May 07
0
ISDN with Billion
Hello again.
I can't configure the Billion PCI in my ISDN.
I want to know if AsteriskNow and the TrixBox LiveCDs configure it automatically.
Thanks to all
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.digium.com/pipermail/asterisk-users/attachments/20070507/8882b000/attachment.htm
2007 May 29
0
Billion on Debian Etch
Hello everybody, I am 20 days with the same item and I can't configure it.
I want to know if someone has the Billion ISDN card on a Debian Etch,
because everybody tells me to do that, then the other one but no one has the
same configuration.
If some one has the same configuration (Billion + Debian Etc), can you help?
What packages install and what steps continue.
Thanks to all and have a
2018 Oct 31
0
Would RHEL, CentOS, and Fedora Remain Open Source/Free Software After IBM Buys Red Hat for $34 Billion?
On 10/31/18 12:44 AM, Turritopsis Dohrnii Teo En Ming wrote:
> Good morning from Singapore,
>
> This is of paramount importance. Would Red Hat Enterprise Linux (RHEL), CentOS, and Fedora remain open source/free software after IBM buys Red Hat for $34 Billion?
yes, because closing the code is the same as burning $34 Billion.
2018 Oct 31
0
Would RHEL, CentOS, and Fedora Remain Open Source/Free Software After IBM Buys Red Hat for $34 Billion?
On 10/30/2018 9:12 PM, Turritopsis Dohrnii Teo En Ming wrote:
> Why do you say so?
>
> On 10/31/18 12:44 AM, Turritopsis Dohrnii Teo En Ming wrote:
>> Good morning from Singapore,
>>
>> This is of paramount importance. Would Red Hat Enterprise Linux (RHEL), CentOS, and Fedora remain open source/free software after IBM buys Red Hat for $34 Billion?
> yes, because
2014 Dec 19
2
Replace atoi and atol with strtol strtoul:Need Help
On Thu, Dec 18, 2014 at 11:38:42PM +0000, James Aylett wrote:
> Collapsing is done using a valueno (you can tell because collapse_key
> is of type Xapian::valueno), which is a 32 bit unsigned integer (see
> here:
> https://getting-started-with-xapian.readthedocs.org/en/latest/concepts/indexing/values.html).
> For 32 bit unsigned you need unsigned long.
Xapian::valueno is actually a
2005 Aug 12
4
Billion BRI PCI card
I'm looking at experimenting with asterisk with an ISDN BRI and ISDN
phones (since I have these already).
I saw that the Billion card was cheap and could be used in either TE or
NT modes.
I have the following question which I couldn't answer by reading through
the manual. Maybe someone has experience of using this card and can help
me out.
when using in NT mode does the card require