similar to: Indexing more than 15 billion documents

Displaying 20 results from an estimated 100 matches similar to: "Indexing more than 15 billion documents"

2005 Jul 20
1
docid type redifine
Hello all. I need to redefine a docid type (and all dependent types) like this: typedef unsigned long long docid; I think it would be enough to edit "include/xapian/types.h", but it isn't so. 1) I've added : string om_tostring(unsigned long long val) { CONVERT_TO_STRING("%llu") } in common/utils.{h,cc} 2) In include/enquire.h (line 438) I've found the
2007 Jul 24
1
Xapian::DocNotFoundError on replace_document? (Called from Search::Xapian)
Hello, I'm using Xapian 1.0.2 (flint) and matching Search::Xapian. I'm getting: terminate called after throwing an instance of 'Xapian::DocNotFoundError', which dumps core. at first it was after adding my 2nd document (to an empty db, although I don't know if that has any bearing) to the database with a replace_document() call. I shifted the first document off the
2015 Mar 11
2
stub-file and get_doccount
Hello, i switched from one big index to a stub file with many indexes and running into a problem. i have a tool to fetch a random document via: get_doccount random id up to get_doccount get_document with that id after changing to stub file this failes. Is there a nice way to get a random document from a stub file? ?MfG? Felix Ostmann
2013 Aug 27
2
What does collection_freq means?
Hi, all: I am confused with the concept of colletion_freq There's no informations about it on http://xapian.org/docs/glossary.html What does it means? Thanks Regards! -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130827/370cc6a3/attachment.html>
2014 Aug 07
2
Global number of matches
Hi, is there a way to get the number of all possible matches for a specific query? Some api call like Xapian::doccount Xapian::Enquire::get_number_of_matches(); Best wishes Wilm
2016 Jul 30
2
Letor: returning MSet after re-ranking
> > > I'd prefer to avoid adding things to the public API that don't get > used by end users. However because LTR is outside the Xapian build > tree, we can't easily give it privileged access to Xapian internals. > Sorry for a delayed response. The way I was thinking of performing reranking with updated weights was to add a class MSetRanker (basically containing a
2010 Aug 18
16
[PATCH 00 of 16] libxl: autogenerate type definitions and destructor functions
The series introduces auto-generation of the type definitions used in the libxl interface followed by auto-generation of a destructor function for each type. In the future it may be possible to use the related data structures for other purposes, for example auto-generation of the functions to marshal between C and language binding data types. tools/_libxl_types.h should be identical both before
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi, I'm using xapian with c# and mono and i'm having a segfault in get_weight. When i print the index variable, the value is clearly too high. I think something write over it. Do you have any idea on how i could trace the beginning of the segmentation fault ? Thanks, -- Yann
2014 Mar 06
2
Regarding GSOC 2014
Sir, I am a 4th yr undergraduate student pursuing my BTech in CSE at IIIT Hyderbad, India. I am interested in applying for Xapian in Gsoc 2014. I had gone through this year's idea page and interested in applying for 'posting list encoding improvements' project. I am good at C/C++,python; which is one of the requirement. I had done gone through the information Retrieval and
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote: > In other words, is it possible to avoid duplicates if new > documents are inserted into the DB by another process in-between > ->get_mset calls when reusing Xapian::Enquire objects? The Database object itself effectively does (it works in a snapshot of the state of the database when you open it, or last called reopen() which
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new documents are inserted into the DB by another process in-between ->get_mset calls when reusing Xapian::Enquire objects? I do some expensive processing on each mset window, so I always limit the results to limit heap usage even if I'm planning on going through a big chunk of the DB: $mset = $enq->get_mset(0, 1000);
2006 Mar 27
4
Remote databases and daemons
I've looked over the docs on remote backends, the protocol, and a bit of the c++ for doing distributed and remote searches. I've got a couple of questions: * The remote protocol is usable only as a Database, not as a WriteableDatabase -- is this correct? So, if I don't want my application to have a copy of the database on the same machine I'll need to write an indexer daemon on
2010 Sep 09
2
[PATCH]: add libxl python binding
Changes since last time: - split auto-generated code in to c and h files - un-break the build system - fix ocaml binding due to libxl API change - lot''s of tidy-ups too numerous to mention Please consider and apply :) -----8<--------------------------------------------------------------- Introduce python binding for libxl. The binding is not yet complete but list_domains,
2014 Dec 18
4
Replace atoi and atol with strtol strtoul:Need Help
Hello, I came across the file *omega.cc* which is in directory* xapain-application/omega/* In this file , atoi is used in *Percentage Relevance cutoff *(293 line no) as Percentage lies between 0-100 their is no need to modify atoi . But do we need to check for error's ? Second Implementation is in *collapsing* (301) in which we collapse set of document under a key,range of this key has not
2007 May 07
0
ISDN with Billion
Hello again. I can't configure the Billion PCI in my ISDN. I want to know if AsteriskNow and the TrixBox LiveCDs configure it automatically. Thanks to all -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.digium.com/pipermail/asterisk-users/attachments/20070507/8882b000/attachment.htm
2007 May 29
0
Billion on Debian Etch
Hello everybody, I am 20 days with the same item and I can't configure it. I want to know if someone has the Billion ISDN card on a Debian Etch, because everybody tells me to do that, then the other one but no one has the same configuration. If some one has the same configuration (Billion + Debian Etc), can you help? What packages install and what steps continue. Thanks to all and have a
2018 Oct 31
0
Would RHEL, CentOS, and Fedora Remain Open Source/Free Software After IBM Buys Red Hat for $34 Billion?
On 10/31/18 12:44 AM, Turritopsis Dohrnii Teo En Ming wrote: > Good morning from Singapore, > > This is of paramount importance. Would Red Hat Enterprise Linux (RHEL), CentOS, and Fedora remain open source/free software after IBM buys Red Hat for $34 Billion? yes, because closing the code is the same as burning $34 Billion.
2018 Oct 31
0
Would RHEL, CentOS, and Fedora Remain Open Source/Free Software After IBM Buys Red Hat for $34 Billion?
On 10/30/2018 9:12 PM, Turritopsis Dohrnii Teo En Ming wrote: > Why do you say so? > > On 10/31/18 12:44 AM, Turritopsis Dohrnii Teo En Ming wrote: >> Good morning from Singapore, >> >> This is of paramount importance. Would Red Hat Enterprise Linux (RHEL), CentOS, and Fedora remain open source/free software after IBM buys Red Hat for $34 Billion? > yes, because
2005 Aug 12
4
Billion BRI PCI card
I'm looking at experimenting with asterisk with an ISDN BRI and ISDN phones (since I have these already). I saw that the Billion card was cheap and could be used in either TE or NT modes. I have the following question which I couldn't answer by reading through the manual. Maybe someone has experience of using this card and can help me out. when using in NT mode does the card require
2014 Dec 19
2
Replace atoi and atol with strtol strtoul:Need Help
On Thu, Dec 18, 2014 at 11:38:42PM +0000, James Aylett wrote: > Collapsing is done using a valueno (you can tell because collapse_key > is of type Xapian::valueno), which is a 32 bit unsigned integer (see > here: > https://getting-started-with-xapian.readthedocs.org/en/latest/concepts/indexing/values.html). > For 32 bit unsigned you need unsigned long. Xapian::valueno is actually a