Displaying 20 results from an estimated 100 matches similar to: "understanding stemming and synonyms"
2012 Jan 20
2
Perl version of sortable_serialize missing?
I attempted to use the sortable_serialize function from perl, however
doesn't seem to exist. The only occurrence of the string "sortable" in
the /usr/local/perl/5.10.1/Search/ tree is in the pod in Xapian.pm.
What am I doing wrong?
use Search::Xapian;
...
$doc->add_value(4,sortable_serialize($recdate));
Undefined subroutine &main::sortable_serialize called
2011 Jul 28
1
Xapian and Synonyms
Hi guys
I've just had a thought about something we do with our search on
ReportBuyer.com: we cater for both American and British English in our
searches and we have had plans for a while now to implement something
that allows users to find 'colour' and 'color', 'tap' and 'faucet' by
doing some clever programming.
Looking at the Xapian docs, though, it appears
2006 May 17
3
QueryParser lowercase / uppercase and stemming
Hello.
There are several problems I couldn't find a solution.
1. QueryParser does not perform stemming
I am working with PHP5 and use the xapian wrapper written by Daniel M?nard
I build a query using parseQuery. Output of the parsed query shows that
terms are not stemmed, although a stemmer is set ( see code snippet)
# create a XapianDatabase object to search in
$db = new
2012 Oct 04
1
Synonyms of Abbreviations
Hello,
I am looking for a documentation or an example to use the synonym function.
I tried this
db.add_synonym("omega","xapain");
and this works by adding the flag FLAG_AUTO_SYNONYMS.
If i try to use the
db.add_synonym("omega","xapain is search engine ");
it fails why? Can xapian use synonym for Abbreviations like MBA => Master
of business
2012 Jan 04
2
[issue] The difference between QueryParser::FLAG_AUTO_SYNONYMS and QueryParser::FLAG_AUTO_MULTIWORD_SYNONYMS
I don't know whether this is a BUG or for special purpose...
According to the definition of "xapian/queryparser.h", FLAG_AUTO_MULTIWORD_SYNONYMS contains bit of
FLAG_AUTO_SYNONYMS .
Therefore, long as I set the parse flag with FLAG_AUTO_SYNONYMS, the query parser will automatically activate
the function of FLAG_AUTO_MULTIWORD_SYNONYMS. See the below source code part from
2008 Mar 12
1
how can i use stopwords?
Hi,
I do not understand the stopword function...
I've set the termgenerator like this:
$self->{'Stemmer'} = new Search::Xapian::Stem(german2);
$self->{'Stopper'} = new Search::Xapian::SimpleStopper();
$self->{'TermGenerator'} = new Search::Xapian::TermGenerator;
$self->{'TermGenerator'}->set_stemmer( $self->{'Stemmer'} );
2010 May 11
3
indexing words with alternative spellings
Some languages (e.g. German and Danish) have special letters that are
often written using two-letter combinations when the appropriate
keyboard or medium is not available:
? = ae
? = ue
? = oe
? = ae
? = oe
? = aa
? = ss
(there are undoubtedly far more examples than those)
As a user of an index, I would like to be able to search for
e.g. "schaefer" and get matches on both
2011 May 27
1
Does OP_NEAR works with stemming?
Hi All,
I used the OP_NEAR operator for queryparser, and when I searched for "apple store" from my own collection, the query is parsed as "Zappl:(pos=1) NEAR 11 Zstore:(pos=2)" but retrieved nothing. However, if I type in "Apple Store", the query is parsed as Xapian::Query((apple:(pos=1) NEAR 11 store:(pos=2))) and some results are showed. I'm not sure whether
2015 Dec 28
3
synonym expansion for boolean prefixes.
I have an application for synonyms for tags in in notmuch, which means
synonym expansion for a particular boolean prefix. I have a vague memory
of Olly telling me this doesn't work, but I'm not sure about the
details.
My higher level goal is to support a kind of indirection with tags,
where query tag:foo can really generate tag:bar or tag:fub, depending on
some kind of configuration.
2007 Jan 19
3
Are these numbers resonsable?
I have only one box[1] running 3 sub-systems[2] at my system, are these
numbers resonsable[3]??
[1] - From dmesg (FreeBSD 6.1-RELEASE):
AMD Sempron(tm) Processor 3000+ (1808.33-MHz K8-class CPU)
real memory = 2080309248 (1983 MB)
avail memory = 1997869056 (1905 MB)
ad0: 76350MB <SAMSUNG SP0802N TK200-04> at ata0-master UDMA33
[2] The sub-systems are:
1 - A server giving adreesses of
2008 Feb 22
0
Association to indicate synonyms
Which is the better way to indicating synonyms into a table? But I''d
get all those synonyms together to the main table. Althought if is
choosed a synonym I''d that it were directed to the object of the main
table.
I don''t know if a polymorphic association could be helpful in this
case. Any help please?
-------------
create_table :words do |t|
t.string :name
t.string
2006 Aug 22
3
synonyms
Hi,
Using ferret and acts_as_ferret.
Great work.
Is there a way to define some synonyms (searchable words that would not
appear in the texts ?
Like stop words, but instead of being removed from query and index,
they would be added ;-)
Can some synonyms be regexp ? I''d like for instance to have ? (oelig)
be equivalent to oe in French.
Or maybe an utf8 normalization could achieve
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
Xapians,
I am maintaining about two indexes for my search engines which
approximately is each the same size. I would like to share this
knowledge with you, since many of you have never seen Xapian index of
this size. And of course you can search the index by yourself at
- http://myhealthcare.com/
- http://find1friend.com/
I need 2 x 100 million more documents into each index, and I hope it
will
2012 Jan 05
1
Enhance synonyms feature of the query parser (patch included)
Very few people seem to be using synonym in Xapian, I recently found some problems in the use of synonyms.
Normally, I think we should not contain any prefix info in synonym table except that 'Z'.
For example, I have the following synonyms and prefix info:
db.add_synonym("search", "find");
db.add_synonym("Zsearch", "Zfind");
2008 Mar 27
2
Proper noun stemming
Hi All
I was wondering if anyone had a solution for the following problem.
I user QueryParser to stem my documents before adding them to a
database. During the stemming process I would like to find a way of
keeping proper nouns that span two or more words together as a phrase.
For example "New York" or "Gordon Brown" or "Prime Minister" get spilt
up. I see
2008 Nov 21
1
Multiple databases vs Single large database
Hi
I've decided to use xapian because my files table in my mysql database is going
to grow very large, and it seems mysql isn't good at full text searching. I'm
doing this with the php wrapper by the way.
The way my system is set out, each user has their own set of files, and when
doing a search it is going to be for a specific user's file (based on file
name, title,
2012 Mar 22
1
Warning from ExtUtils::MakeMaker
Installation was OK and all works well, but this warning is bad :-/ tried
15 min to find the failure (there is no failure) ...
$ perl Makefile.PL XAPIAN_CONFIG=/root/build/xapian-core/bin/xapian-config
PREFIX=/root/build/Search-Xapian
Checking if your kit is complete...
Looks good
'XAPIAN_CONFIG' is not a known MakeMaker parameter name.
Writing Makefile for Search::Xapian
$ perl
2012 Apr 19
1
Xapian::Database->close() for perl missing
I have a xapian-daemon, which can be queried via http. A background-process
generated every hour one new index and then remove and create a new symlink
to the current database.
/path/to/index/20120419010000
/path/to/index/20120419020000
/path/to/index/20120419030000
/path/to/index/default => /path/to/index/20120419030000
So the daemon only check the mtime of /path/to/index/default/iamchert
2013 Aug 26
2
Perl interface isn't working in 1.2.x
On 08/25/2013 05:02 PM, Olly Betts wrote:
> So the simple fix is
> probably just to install the perl-Search-Xapian RPM instead.
Thanks, the Centos 6 repos don't have that rpm and the
http://xapian.org/download page seems to only cover the XS bindings, if
I am reading this correctly:
But I was able to remove the rpm packages and compile and install the
core and swig from source.
2015 Sep 11
0
xapian-core-1.0.23 compiled on Interix / Eric Lindblad
Report by Eric Lindblad 11-09-2015
http://www.ericlindblad.blogspot.com
I compiled xapian-core-1.0.23 today on Interix.
'gmake check' summary
PASS: btreetest, quartztest, stemtest, termgentest
FAIL: apitest, internaltest, queryparsertest
MS XP SP3 HOME
modified SFU Interix 3.5 w/ getopt.h, inttypes.h, & stdint.h copied
over from SUA Interix for Vista (and newer) into