Displaying 20 results from an estimated 700 matches similar to: "queries for a set of values"
2024 Apr 26
2
queries for a set of values
On Fri, Apr 26, 2024 at 10:37:37PM +0000, Eric Wong wrote:
> Say I have a bunch of values which I want to filter a query against.
> If I had boolean terms, it could just OP_OR against the whole set.
> IOW, this is what notmuch does with terms:
>
> std::set<std::string> terms;
>
> // notmuch populates terms via terms.insert(*i)...
>
> Query(OP_OR,
2017 Dec 15
5
How to get the serialise score returned in Xapian::KeyMaker->operator().
HI, all,
I am a user of Xapian, and now I have a problem in using it.
After using boolean terms to get some candidates of documents (still too much), we want sorted them by self-defined function which is used in Xapian::KeyMaker->operator(). But how can I get the serialise score in Xapian::MSetIterator object.
c++ code likes this:
class SortKeyMaker : public Xapian::KeyMaker {
std::string
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
Xapians!
If tomorrow Xapian search engine would achieved the same performance
and result in searches as Google we would not be able to beat Google,
because we would create only a copy of the searches that already
exists from Google search engine. However there is a way to beat
anyone, and there is a way to beat Google successfully as well just do
not give up. Some see it as implementing Ajax, or
2010 Jun 19
2
Xapian 1.0.21 released
I've uploaded Xapian 1.0.21 (including Search::Xapian 1.0.21.0), which
as usual you can download from:
http://xapian.org/download
The most notable changes in this release are:
Xapian-core API:
* Xapian::Stem now recognises "nb" and "nn" as additional codes for the
Norwegian stemmer.
* Xapian::QueryParser now correctly parses a wildcarded term in between two
other
2017 Jan 20
2
NEAR non-leaf subqueries
Olly Betts writes:
> On Thu, Jan 12, 2017 at 07:53:21PM +0100, Jean-Francois Dockes wrote:
>
> > Recoll also supports multi-word synonyms which could potentially
> > generate PHRASE subqueries inside NEAR queries, but this
> > understandably already did not work with 1.2, so the multi-word
> > expansions are only used when proximity is not involved (by the way,
2017 Jan 12
2
NEAR non-leaf subqueries
Olly Betts writes:
> On Wed, Jan 04, 2017 at 07:29:58AM +0100, Jean-Francois Dockes wrote:
> > Olly Betts writes:
> > > The ticket has a patch which attempts to handle the OR case (which seems
> > > to be the part you actually care about) but this suffers from issues with
> > > object lifetimes which get a bit involved in the details. Since there
>
2006 Nov 15
1
Composition of NEAR and OR
The following piece of code triggers an 'unimplemented' exception with the
message:
"Can't use NEAR/PHRASE with a subexpression containing NEAR or PHRASE"
Xapian::Query or1(Xapian::Query::OP_OR,
Xapian::Query("one"),
Xapian::Query("two"));
Xapian::Query or2(Xapian::Query::OP_OR,
Xapian::Query("three"),
2010 Jan 19
1
QueryParser: aliases and OP_AND
Hello,
I'm wondering about how the QueryParser parses a query containing an "alias" when the default operator is OP_AND
(by "alias", I mean a search field mapped to multiple term prefixes).
With the following php code :
<?php
$parser=new XapianQueryParser();
$parser->set_default_op(XapianQuery::OP_AND);
$parser->add_prefix('alias', 'AUT1:');
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ?
i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc.
search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp.
This method is ok , but
2012 Jun 04
1
Search not finding queries with stop words.
I have a search in perl that looks a bit like:
my $qp = new Search::Xapian::QueryParser();
$qp->set_stemmer(new Search::Xapian::Stem("english"));
$qp->set_stemming_strategy(STEM_SOME);
$qp->set_default_op($defaultop);
...
my $par = $qp->parse_query($query);
my $enq = $xDatabase->enquire( $par );
and in the db create script:
my $stopper =
2011 Mar 07
1
Set Term Frequency for a Query
Hello,
I have a problem when trying to define a query and setting for each term its
"term frequency" with the classical constructor
Xapian::Query<http://xapian.org/docs/apidoc/html/classXapian_1_1Query.html#f396e213df0d8bcffa473a75ebf228d6>(const
std::string &tname_,
2018 Apr 07
3
Database corruption after clean rebuild
Javier Garcia <javiertury at gmail.com> writes:
> I've applied the path to notmuch 0.26.1 without success.
>
> $ rm -rf ~/.mail/.notmuch
> $ LD_LIBRARY_PATH=/hidden-path/notmuch-0.26.1/lib/:$LD_LIBRARY_PATH
> ./notmuch new
> Found 20065 total files (that's not much mail).
> Processed 20065 total files in 58s (341 files/sec.).
> Added 19605 new
2014 Oct 30
2
Does Xapian support retrieval optional?
Hi,
I've been using Xapian for a while. But there is a scene I don't know
whether supported already.
Suppose:
1. Raw query: how to make pizza
2. Parsed query: how AND to AND make AND pizza
3. Documents:
d1: how to make pizza at home
d2: 3 ways to make pizza
d3: make pizza in 4 easy steps
Question:
1. During searching process, how to retrieve d2, d3 (although they don't
2020 Apr 07
2
crash after running notmuch new
Matt <mattator at gmail.com> writes:
> thanks didn't know about xapian-check !
> the output
> ===
> docdata:
> blocksize=8K items=70 firstunused=3 revision=421 levels=0 root=2
> B-tree checked okay
> docdata table structure checked OK
>
> termlist:
> blocksize=8K items=186136 firstunused=62058 revision=421 levels=2 root=12260
> B-tree checked okay
>
2017 Dec 29
2
notmuch: Xapian exception during database creation
Running notmuch from git on Debian testing[1] with the mail and database
sitting on a ZFS filesystem, adding mail to a new database:
> agrajag-testing ~/s/notmuch % ./notmuch new
> Found 605510 total files (that's not much mail).
> add_file: A Xapian exception occurred36m 37s remaining).
> A Xapian exception occurred adding message: Unexpected end of posting list for
2018 Sep 10
3
Notmuch DB Problems
Mueen Nawaz <mueen at nawaz.org> writes:
> After a lot of poking around, I figured out the problem, and this may be
> of interest to the developers (although not sure if it is a xapian issue
> or a notmuch issue).
>
> Here's why it would freeze:
>
> I have a post-new hook that runs a Python script. Depending on whether
> the new email it is processing matches a
2020 Apr 20
4
performance problems with notmuch new
Franz Fellner <alpine.art.de at gmail.com> writes:
> I also suffer from bad performance of notmuch new. I used notmuch
> some years ago and notmuch new always felt instantanious. Had to stop
> using it because internet was too slow to sync my mails :/ Now (with
> better internet and a completely new setup using mbsync) indexing one
> mail takes at least 10 seconds,
2019 Jul 09
2
Transitioning notmuch/Xapian from 32-bit to 64-bit system
Hi!
Suppose you have a huge notmuch/Xapian database, built on a 32-bit system
(well, actually on x86_64-pc-linux-gnu, but using a years old 32-bit
notmuch binary; notmuch 0.9, Xapian 1.2.21 -- don't laugh), and suppose
you're finally going to update that years old notmuch installation
(release by release, forward-porting a bunch of patches). Naturally, I'd
now do a native 64-bit
2017 Sep 12
2
perl bindings to Xapian::Query
QueryParser is great, but I would like to make a query myself, so I can
filter results by a specified value (in this case restricting by epoch
time after a certain value)
My code looks like this, and compiles, and appears like it should work
according to the perl source:
my $query = $qp->parse_query($querystr);
if ($datefilter) {
my $filterepoch = time() - ($datefilter
2020 Apr 24
1
performance problems with notmuch new
On Thu Apr 23 00:21:30 2020, Olly Betts <olly at survex.com> wrote:
> First question: what version of Xapian are you using?
On my laptop it's 1.4.15 (arch linux) and the desktop runs 1.4.14 (Gentoo linux)
> And second thing to check, are you committing each message separately?
No, I sync with mbsync which dosnloads a bunch of mails,
then I run notmuch new which indexes all in