similar to: Indexing more than 15 billion documents

Displaying 20 results from an estimated 100 matches similar to: "Indexing more than 15 billion documents"

2005 Jul 20
1
docid type redifine
Hello all. I need to redefine a docid type (and all dependent types) like this: typedef unsigned long long docid; I think it would be enough to edit "include/xapian/types.h", but it isn't so. 1) I've added : string om_tostring(unsigned long long val) { CONVERT_TO_STRING("%llu") } in common/utils.{h,cc} 2) In include/enquire.h (line 438) I've found the
2007 Jul 24
1
Xapian::DocNotFoundError on replace_document? (Called from Search::Xapian)
Hello, I'm using Xapian 1.0.2 (flint) and matching Search::Xapian. I'm getting: terminate called after throwing an instance of 'Xapian::DocNotFoundError', which dumps core. at first it was after adding my 2nd document (to an empty db, although I don't know if that has any bearing) to the database with a replace_document() call. I shifted the first document off the
2015 Mar 11
2
stub-file and get_doccount
Hello, i switched from one big index to a stub file with many indexes and running into a problem. i have a tool to fetch a random document via: get_doccount random id up to get_doccount get_document with that id after changing to stub file this failes. Is there a nice way to get a random document from a stub file? ?MfG? Felix Ostmann
2013 Aug 27
2
What does collection_freq means?
Hi, all: I am confused with the concept of colletion_freq There's no informations about it on http://xapian.org/docs/glossary.html What does it means? Thanks Regards! -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130827/370cc6a3/attachment.html>
2014 Aug 07
2
Global number of matches
Hi, is there a way to get the number of all possible matches for a specific query? Some api call like Xapian::doccount Xapian::Enquire::get_number_of_matches(); Best wishes Wilm
2016 Jul 30
2
Letor: returning MSet after re-ranking
> > > I'd prefer to avoid adding things to the public API that don't get > used by end users. However because LTR is outside the Xapian build > tree, we can't easily give it privileged access to Xapian internals. > Sorry for a delayed response. The way I was thinking of performing reranking with updated weights was to add a class MSetRanker (basically containing a
2010 Aug 18
16
[PATCH 00 of 16] libxl: autogenerate type definitions and destructor functions
The series introduces auto-generation of the type definitions used in the libxl interface followed by auto-generation of a destructor function for each type. In the future it may be possible to use the related data structures for other purposes, for example auto-generation of the functions to marshal between C and language binding data types. tools/_libxl_types.h should be identical both before
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi, I'm using xapian with c# and mono and i'm having a segfault in get_weight. When i print the index variable, the value is clearly too high. I think something write over it. Do you have any idea on how i could trace the beginning of the segmentation fault ? Thanks, -- Yann
2014 Mar 06
2
Regarding GSOC 2014
Sir, I am a 4th yr undergraduate student pursuing my BTech in CSE at IIIT Hyderbad, India. I am interested in applying for Xapian in Gsoc 2014. I had gone through this year's idea page and interested in applying for 'posting list encoding improvements' project. I am good at C/C++,python; which is one of the requirement. I had done gone through the information Retrieval and
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote: > In other words, is it possible to avoid duplicates if new > documents are inserted into the DB by another process in-between > ->get_mset calls when reusing Xapian::Enquire objects? The Database object itself effectively does (it works in a snapshot of the state of the database when you open it, or last called reopen() which
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new documents are inserted into the DB by another process in-between ->get_mset calls when reusing Xapian::Enquire objects? I do some expensive processing on each mset window, so I always limit the results to limit heap usage even if I'm planning on going through a big chunk of the DB: $mset = $enq->get_mset(0, 1000);
2006 Mar 27
4
Remote databases and daemons
I've looked over the docs on remote backends, the protocol, and a bit of the c++ for doing distributed and remote searches. I've got a couple of questions: * The remote protocol is usable only as a Database, not as a WriteableDatabase -- is this correct? So, if I don't want my application to have a copy of the database on the same machine I'll need to write an indexer daemon on
2010 Sep 09
2
[PATCH]: add libxl python binding
Changes since last time: - split auto-generated code in to c and h files - un-break the build system - fix ocaml binding due to libxl API change - lot''s of tidy-ups too numerous to mention Please consider and apply :) -----8<--------------------------------------------------------------- Introduce python binding for libxl. The binding is not yet complete but list_domains,
2014 Dec 18
4
Replace atoi and atol with strtol strtoul:Need Help
Hello, I came across the file *omega.cc* which is in directory* xapain-application/omega/* In this file , atoi is used in *Percentage Relevance cutoff *(293 line no) as Percentage lies between 0-100 their is no need to modify atoi . But do we need to check for error's ? Second Implementation is in *collapsing* (301) in which we collapse set of document under a key,range of this key has not
2007 May 07
0
ISDN with Billion
Hello again. I can't configure the Billion PCI in my ISDN. I want to know if AsteriskNow and the TrixBox LiveCDs configure it automatically. Thanks to all -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.digium.com/pipermail/asterisk-users/attachments/20070507/8882b000/attachment.htm
2007 May 29
0
Billion on Debian Etch
Hello everybody, I am 20 days with the same item and I can't configure it. I want to know if someone has the Billion ISDN card on a Debian Etch, because everybody tells me to do that, then the other one but no one has the same configuration. If some one has the same configuration (Billion + Debian Etc), can you help? What packages install and what steps continue. Thanks to all and have a
2018 Oct 31
0
Would RHEL, CentOS, and Fedora Remain Open Source/Free Software After IBM Buys Red Hat for $34 Billion?
On 10/31/18 12:44 AM, Turritopsis Dohrnii Teo En Ming wrote: > Good morning from Singapore, > > This is of paramount importance. Would Red Hat Enterprise Linux (RHEL), CentOS, and Fedora remain open source/free software after IBM buys Red Hat for $34 Billion? yes, because closing the code is the same as burning $34 Billion.
2018 Oct 31
0
Would RHEL, CentOS, and Fedora Remain Open Source/Free Software After IBM Buys Red Hat for $34 Billion?
On 10/30/2018 9:12 PM, Turritopsis Dohrnii Teo En Ming wrote: > Why do you say so? > > On 10/31/18 12:44 AM, Turritopsis Dohrnii Teo En Ming wrote: >> Good morning from Singapore, >> >> This is of paramount importance. Would Red Hat Enterprise Linux (RHEL), CentOS, and Fedora remain open source/free software after IBM buys Red Hat for $34 Billion? > yes, because
2014 Dec 19
2
Replace atoi and atol with strtol strtoul:Need Help
On Thu, Dec 18, 2014 at 11:38:42PM +0000, James Aylett wrote: > Collapsing is done using a valueno (you can tell because collapse_key > is of type Xapian::valueno), which is a 32 bit unsigned integer (see > here: > https://getting-started-with-xapian.readthedocs.org/en/latest/concepts/indexing/values.html). > For 32 bit unsigned you need unsigned long. Xapian::valueno is actually a
2005 Aug 12
4
Billion BRI PCI card
I'm looking at experimenting with asterisk with an ISDN BRI and ISDN phones (since I have these already). I saw that the Billion card was cheap and could be used in either TE or NT modes. I have the following question which I couldn't answer by reading through the manual. Maybe someone has experience of using this card and can help me out. when using in NT mode does the card require