Displaying 20 results from an estimated 3000 matches similar to: "Using special characters in query terms"
2017 Feb 08
1
searching for " in phrase and other special chars
Hello,
I'm reading xapian-core/docs/queryparser.rst and haven't been
able to find a way to escape " (double-quote) inside quoted
phrases.
Is this possible?
I'm also wondering if searching for other special characters,
such as a literal '*', is possible without triggering a wildcard
match. It would be helpful for some source code searches.
Thanks!
2008 Mar 27
2
Proper noun stemming
Hi All
I was wondering if anyone had a solution for the following problem.
I user QueryParser to stem my documents before adding them to a
database. During the stemming process I would like to find a way of
keeping proper nouns that span two or more words together as a phrase.
For example "New York" or "Gordon Brown" or "Prime Minister" get spilt
up. I see
2007 May 30
1
QueryParser prefixing terms when stemming?
I'm new to Xapian and we just recently upgraded to version 1.0.0.0.
However, something seems to have changed during the upgrade and I
need help figuring out how my code should be written.
In version 0.9.9.1 of Search::Xapian, the following code results in
this output "Xapian::Query(pet:(pos=1))".
my $qp = new Search::Xapian::QueryParser;
$qp->set_stemmer(new
2007 Nov 14
1
Problem indexing text with spelling enabled in Perl
Hi All,
I'm using the TermGenerator::index_text() on version 1.0.4 with the
FLAG_SPELLING turned on, because the new spelling suggestion stuff
seems awesome, but I'm getting a segv.
(gdb) bt
#0 0xb7ae153c in Xapian::WritableDatabase::add_spelling
(this=0xa553988, word=@0xbff97724, freqinc=1) at ./include/xapian/
base.h:154
#1 0xb7becf47 in
2007 Jun 12
1
Empty results OMEGA with XAPIAN 1.0.1
Hi,
I configured XAPIAN 1.0.1 and OMEGA 1.0.1. on my development machine
(first removed the old ones). I recreated my databases (both quartz
and flint) and tried to run original queries against the databases
created by the new versions.
I'm getting empty result sets from OMEGA. If I use the delve tool I
actually see that the records are created fine. No log files are
written as far as I
2015 Jul 26
1
Get term from document by position
mple (see attachment).
>
> Attachments get stripped out by the mailing list, so I?ve made a private gist of the two files here: <https://gist.github.com/jaylett/ce8455b37e2b84422346>.
>
> Actually, when I run it I get 0 matches, which would explain why you?re just getting the start of the document. However if I adjust things (match the stemming strategy for TermGenerator to
2011 Sep 14
1
Integrated Chinese tokenizer SCWS in xapian-core
Xapian is a very excellent open source search engine library, but there is no native support for Chinese word segmentation in queryparser and termgenerator.
Therefore, I modified small amount of source codes, integrated into the SCWS tokenizer, that is the same open-source and developped by myself.
Anyone can obtain the patch from below URL. After patching, Xapian::QueryParser::parse_query and
2007 Dec 29
3
Term-Flags
Hi,
Is it necessary to set the down below flag to the TermGenerator,
if I want the "Did you mean ..." spelling corrections?
Xapian::TermGenerator::flags::FLAG_SPELLING
Thank you very much
Markus
2010 Nov 02
1
How to make QueryParser select entire word like "H.O.T"
Hi,
I'm using xapian to build my search engine, but met with a problem.
The code snippet is like:
----------------------Code begin-------------------------------------------------------------
Xapian::QueryParser qp;
qp.add_prefix("Singer", "S");
Xapian::Query query = qp.parse_query("Singer:s.h.e",
2016 Dec 29
2
Formulating Advanced Queries with Xapian-Omega
To Olly Betts:
Thank you very much for any feedback.
I apologise for this belated reply and also for the fact that the text
of the previous posting appeared fragmented, due to its fixed
chars/line format.
With reference to:
> Can, or could, one construct a query so that Omega (Xapian) can
handle
> this ?
>
> ... perhaps with some type of Regex ?
>
> It would seem
2007 Jun 28
1
TermGenerator and SimpleStopper
Hi,
I'm using SimpleStopper with TermGenerator in a Python indexing
script, in an attempt to keep my index size down (currently 30K per
doc, and I have 200 million docs to index, which I think implies
6TB.) However, unprefixed (positional?) terms are not affected by
the stopper, though Z-prefixed terms are.
I assume this is intentional for phrase queries, but I need to reduce
my
2018 Sep 30
1
xapian parser bug?
On Sun, Sep 30, 2018 at 09:05:25AM -0300, David Bremner wrote:
> if (str.find (' ') != std::string::npos)
> query_str = '"' + str + '"';
> else
> query_str = str;
>
> return parser.parse_query (query_str, NOTMUCH_QUERY_PARSER_FLAGS, term_prefix);
I wouldn't recommend trying to generate strings to feed to
2007 Mar 28
2
Moving indextext.cc into core.
One of the items on the ToDo list for version 1.0 at
http://wiki.xapian.org/TodoFor1_2e0#preview is:
"Rework Omega's indextext.cc as a xapian-core "TextSplitter" class."
I've been wondering about this for a while now. Currently, we have the
Query Parser in Xapian core, but no text processing. Clearly, it makes
sense to have a "text splitter" class in
2015 Jul 26
1
Get term from document by position
> Can you file a bug with some example outputs that are unrelated to the search string?
Here is the example (see attachment).
This example does the following:
1)First, it indexes text from the "text.txt" file (see attachment) (actually, this is the text of the following book: "Abbas, Lichtman. Basic immunology").
2)Next, it searches for the "extracellular
2010 Jun 09
1
TermGenerator incorrectly tokenizes German text which contains special characters
Dear Xapian users,
I try to index some German text with Xapian using the xapian_php bindings. I
run Apache 2.2 on Windows using PHP 5.2.13 with the pre build xapian
bindings from Flax:
Xapian Support enabled Xapian
Compiled Version @PACKAGE_VERSION@
Xapian Linked Version 1.2.0
The problem is that after indexing text which contains special characters
like ?, ?, ? and ?, using
2017 Mar 31
3
xapian bug tracker registration email is not working
I tried to register yesterday, twice with 2 email addresses. The website
said it sent me mail, I never got a mail. Someone else, please confirm
this is broken.
A bug I'd like to file is that I'd like an option to search
*word. Currently you can only search word*. The example where I would
find this useful is searching for urls and email addresses. For example,
I want to find emals from
2017 Mar 31
3
xapian bug tracker registration email is not working
I tried to register yesterday, twice with 2 email addresses. The website
said it sent me mail, I never got a mail. Someone else, please confirm
this is broken.
A bug I'd like to file is that I'd like an option to search
*word. Currently you can only search word*. The example where I would
find this useful is searching for urls and email addresses. For example,
I want to find emals from
2007 Jan 17
7
removing special/syntax characters
Is there any somewhat standard way to remove or otherwise handle
special or syntax characters from a user''s search, such as a colon?
I was thinking maybe there was something akin to
Ferret::Analysis::FULL_ENGLISH_STOP_WORDS, like
Ferret::Analysis::FERRET_SYNTAX_CHARS, but no such luck.
How are other folks dealing with filtering user input?
John
2007 Dec 17
1
Crashes with spelling enabled and perl.
Hi Guys,
Here's a simple test case that causes a segfault with the perl
bindings patched to enable spelling correction:
use strict;
use warnings;
use Search::Xapian;
my $db = Search::Xapian::WritableDatabase->new("test.db",
Search::Xapian::DB_CREATE_OR_OPEN);
if (!defined($db)) {
die("Failed to open xapian_database: $!");
}
my $indexer =
2008 Jan 16
2
Escaping special characters :, (, ), [, ], {, }, !, +, ", ~, ^, -, |, <, >, =, *, ?, \
Hello,
I am trying to escape words for searching i.e., "hello". The key here
is that the two L''s on "hello" are actually vertical bars. Is there
a special function in Ferret or anywhere for that matter that will do
the escaping of the Ferret special characters?
Thanks in advance,
Benjamin