Displaying 20 results from an estimated 5000 matches similar to: "Finding Max Possible Weight of a Document"
2014 Apr 13
2
Adding an external library to Xapian
My code is not on Github. I am using the tarball as of now. The following
it the error that occurred:
http://pastebin.com/cVJrjUZX
On Sun, Apr 13, 2014 at 8:16 PM, James Aylett <james-xapian at tartarus.org>wrote:
> On 13 Apr 2014, at 15:37, Pallavi Gudipati <pallavigudipati at gmail.com>
> wrote:
>
> > A linker error is encountered even after following the above
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi,
I'm using xapian with c# and mono and i'm having a segfault in get_weight.
When i print the index variable, the value is clearly too high.
I think something write over it. Do you have any idea on how i could
trace the beginning of the segmentation fault ?
Thanks,
--
Yann
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2010 Aug 28
1
Matchspy and faceting
Have been working recently on a site that classifies posts using tags in
taxonomies, so a post about the Oil spill in the Gulf of Mexico might be
tagged 'Subscribers only' (access level), 'Barack Obama' (person), 'Tony
Hayward' (person), 'BP' (company), 'Transocean' (company), 'Gulf of
Mexico' (location).
With some advice from Richard Boulton I
2010 Dec 16
1
Enquire::get_mset() "first" and MatchDecider
Hello,
This is a somewhat idle question about Enquire::get_mset(), possibly based
on wrong assumptions on how Xapian works, but I would be grateful if
someone satisfied my curiosity.
Xapian::MSet Xapian::Enquire::get_mset(Xapian::doccount first,
Xapian::doccount maxitems,
const Xapian::RSet * omrset = 0,
const
2007 Oct 16
1
Matches estimate varies with sorting method
Hi all,
I found that the figure returned by MSet::get_matches_estimated() varies
depending on how results are to be sorted.
For instance, in my index, value 4 contains date and time in the format
"yyyymmddhhmmss". For the same query, the number of results will be
estimated to 20000+ when results are first sorted by date and time
with set_sort_by_value_then_relevance(4) and to only 100
2013 Feb 20
1
Sticky results
Hi there,
I have a xapian index whose results are being sorted by a value, with (PHP bindings):
$enquire->set_sort_by_value($sort_data_value);
This is because I want the results returned in chronological order of publication date. However, I now have a need to have certain results be 'sticky' at the top of the resultset, regardless of their publication date. Obviously there are
2013 Jan 17
1
FASTER Search
I am suffering for slow searching performance on Xapian.
I am using Xapian for indexing about 150,000,000 documents.
It was implemented in C++;
The performance of searching was not that fast.
e.g. Searching a query, which includes about 20 terms, needs 2 secs avg.
For searching, I followed such steps:
1. construct a QueryParser for certain string
2. parse the query to get a Xapian::Query
2013 Oct 23
2
performance on document.get_data()
I got some performance issue for document.get_data() and
enquire.get_mset(). It costs 35 seconds for matches =
enquire.get_mset(0,200), and 3 seconds for iterating all doc in matches to
get_data. Is't normal? My index contains 30millions documents. I use python
binding to operate xapian. Bellow it's my index structure
# value: 0:date, 1:site
# data: json message which contains: author,
2008 Sep 27
3
Query::MatchAll
Why there still been rank when using Query::MatchAll() ?
2018 Mar 30
2
sorting large msets
Hello, is there a way to optimize sorting by certain values
for queries which return a huge amount of results?
For example, I just want a simple query that gives me the 200
most recent emails out of millions. The elapsed time for
get_mset increases as the number of documents ($n * 2000)
increases.
I suppose I could store a pre-sorted set using SQLite or
similar. Thanks in advance for any
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote:
> On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> > I'm already retrying the ->get_mset operations; but now I'm
> > wondering where I'd hit DatabaseModifiedErrors while inside a
> > Xapian::MSetIterator loop.
> >
> > I assume ->get_document is a place where it gets thrown;
> > but
2023 Aug 27
1
DatabaseModifiedError while iterating on mset
On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> I'm already retrying the ->get_mset operations; but now I'm
> wondering where I'd hit DatabaseModifiedErrors while inside a
> Xapian::MSetIterator loop.
>
> I assume ->get_document is a place where it gets thrown;
> but once a document is retrieved, can iterating through
> terms in one document
2007 Sep 20
3
Incorrect get_matches_estimated() of Xapian::Mset
Hello, As I know, get_matches_estimated() return an estimate for the number of documents with matches the query.
But now, I found it get a disparity between the return value and real mathced number. For an example: the real matched number is 58, but the return value is 458; so when the users click the hinder page, get a blank page ... so they often complain to me.
I found that the main reason is
2014 Mar 26
3
about sort_by_value
Hello, I have found that the use of sort_by_value very slow.
16800 result, return to the previous 10, sorting takes about 25ms.
And if you do not sort, returns 10, need only about 0.3ms.
How to make the sort faster?
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
2013 Aug 26
2
Perl interface isn't working in 1.2.x
On 08/25/2013 05:02 PM, Olly Betts wrote:
> So the simple fix is
> probably just to install the perl-Search-Xapian RPM instead.
Thanks, the Centos 6 repos don't have that rpm and the
http://xapian.org/download page seems to only cover the XS bindings, if
I am reading this correctly:
But I was able to remove the rpm packages and compile and install the
core and swig from source.
2014 Apr 13
2
Adding an external library to Xapian
We are using the --enable-maintainer-mode and will move to git soon.
The diff file is attached.
*Siddhant Mutha*
Undergraduate Student
Department of Computer Science and Engineering
IIT Madras
Chennai
http://www.siddhantmutha.com/ <http:/www.siddhantmutha.com/>
On Sun, Apr 13, 2014 at 8:26 PM, James Aylett <james-xapian at tartarus.org>wrote:
> On 13 Apr 2014, at 15:48, Pallavi
2013 Jun 19
2
Compact databases and removing stale records at the same time
I'm trying to compact (or at least merge) multiple databases, while stripping search records which are no longer required.
Backstory:
I've inherited the Cyrus IMAPd xapian-based search code from Greg Banks when he left Opera.
One of the unfinished parts was removing expunged emails from the search database.
We moved from having a single search database to supporting multiple
2011 Feb 11
2
PHP Binding and SWIG Director for XapianMatchDecider
Hi,
I try to use XapianMatchDecider to filtering results, but the code :
$Decider = new MatchDecider_Search($XapianDatabase->_cPtr);
$mset = $XapEnquire->get_mset(0, 3000, null, $Decider);
Always return :
PHP Fatal error: No matching function for overloaded 'Enquire_get_mset'
in xapian.php on line 1082
I saw that SWIG was not supporting Directors for PHP, but now, it seems
to