Displaying 20 results from an estimated 400 matches similar to: "Relevance, weighting and searching by specifically weighted text"
2011 Jul 28
1
Xapian and Synonyms
Hi guys
I've just had a thought about something we do with our search on
ReportBuyer.com: we cater for both American and British English in our
searches and we have had plans for a while now to implement something
that allows users to find 'colour' and 'color', 'tap' and 'faucet' by
doing some clever programming.
Looking at the Xapian docs, though, it appears
2011 May 16
2
Xapian.org design
Hi guys
We were talking a little while ago about putting together some content
to provide more examples of operations for first-time users.
One of the other thoughts that's come up is about the design of the
Xapian site; we've a bit of spare designer capacity at work and were
wondering if we could do you a jazz-ed up site design (for free) and
wondered if this'd be something the
2011 Jul 27
3
Searching using prefixes
Hi guys
I'm trying to figure out how I can use probabilistic searching on a
given field within a document; I've written to the list about this
before, but haven't quite figured out what's required and, following a
little research, I think I understand what I need to do but I'd like a
clarification on this.
o We have a database of a number of documents, with fields: title,
2017 Apr 09
3
Omega: Missing support for newer weighting schemes
On Sun, Apr 09, 2017 at 11:34:07PM +0530, Vivek Pal wrote:
> > Each scheme already has a human-readable name, and Xapian::Registry
> > can map that to an "examplar" object of the right type, so we
> > could take a string like "bm25 1 0.8", see the first word is "bm25"
> > and get a BM25Weight object, then call parse_params("1 0.8") on
2017 Apr 12
4
Omega: Missing support for newer weighting schemes
> Each scheme already has a human-readable name, and Xapian::Registry
> can map that to an "examplar" object of the right type, so we
> could take a string like "bm25 1 0.8", see the first word is "bm25"
> and get a BM25Weight object, then call parse_params("1 0.8") on it to
> create the correct Weight object (broadly similar to how
2017 Apr 13
2
Omega: Missing support for newer weighting schemes
On Mon, Apr 10, 2017 at 11:47:36PM +0530, Vivek Pal wrote:
> > No, use Xapian::Registry to find the weighting scheme from the name
> > like how Weight::unserialise() does (otherwise every caller would need
> > code similar to that above).
>
> Okay, I looked into Xapian::Registry and it seems you are referring to using
> the get_weighting_scheme method? (which expects a
2017 Apr 08
2
Omega: Missing support for newer weighting schemes
On Sat, Apr 08, 2017 at 09:11:22PM +0100, James Aylett wrote:
> On 8 Apr 2017, at 19:15, Vivek Pal <vivekpal.dtu at gmail.com> wrote:
>
> >> and the details of which weighting schemes were available in which version
> >> isn't a key part of the $set command itself.
> >
> > Do you suggest dropping that piece of information out? Since the reason behind
2011 Aug 11
3
Fwd: Re: what is the fastest way to fetch results which are sorted by timestamp ?
(Forwarded off-list message)
-------- Original Message --------
Subject: Re: [Xapian-discuss] what is the fastest way to fetch results
which are sorted by timestamp ?
Date: Thu, 11 Aug 2011 01:06:36 +0800
From: ??? <panjunyong at gmail.com>
To: Tim Brody <tdb2 at ecs.soton.ac.uk>
On Wed, Aug 10, 2011 at 6:39 PM, Tim Brody <tdb2 at ecs.soton.ac.uk> wrote:
> Hi,
>
> In
2018 Mar 31
2
sorting large msets
Olly Betts <olly at survex.com> wrote:
> On Fri, Mar 30, 2018 at 05:21:43PM +0000, Eric Wong wrote:
> > Hello, is there a way to optimize sorting by certain values
> > for queries which return a huge amount of results?
> [...]
> > $enquire->set_sort_by_value_then_relevance(0, 1);
>
> If you're just wanting the 200 newest, it'll be faster not to
2011 Aug 09
3
what is the fastest way to fetch results which are sorted by timestamp ?
what is the fastest way to fetch results which are sorted by timestamp ?
i want to use xapian as my search engine , use add_boolean_term(something) and add_value(0,sortable_serialise(get_timestamp())) to a doc.
search through enquire.set_weighting_scheme(xapian.BoolWeight()) and enquire.set_sort_by_value(0,True) to ensure that the results are sorted by the timestamp.
This method is ok , but
2013 Oct 23
2
performance on document.get_data()
I got some performance issue for document.get_data() and
enquire.get_mset(). It costs 35 seconds for matches =
enquire.get_mset(0,200), and 3 seconds for iterating all doc in matches to
get_data. Is't normal? My index contains 30millions documents. I use python
binding to operate xapian. Bellow it's my index structure
# value: 0:date, 1:site
# data: json message which contains: author,
2018 Mar 30
2
sorting large msets
Hello, is there a way to optimize sorting by certain values
for queries which return a huge amount of results?
For example, I just want a simple query that gives me the 200
most recent emails out of millions. The elapsed time for
get_mset increases as the number of documents ($n * 2000)
increases.
I suppose I could store a pre-sorted set using SQLite or
similar. Thanks in advance for any
5.1 - kernel-2.6.18-53.el5.centos.plus.i686.rpm: [Errno -1] Package does not match intended download
2007 Dec 04
1
5.1 - kernel-2.6.18-53.el5.centos.plus.i686.rpm: [Errno -1] Package does not match intended download
Thank you for the great CentOS.
When trying to update from 5.0 to 5.1, I get an error regarding the
centos.plus kernel and headers, "[Errno -1] Package does not match
intended download".
What'd I do?
The end of the yum update results below:
Transaction Summary
=============================================================================
Install 11 Package(s)
Update
2005 Jul 28
3
MySQL authentication
Hi
I am wanting to install : dovecot 1.0-stable
I currently have dovecot 0.99.11-1 which came packaged on Fedora Core 3
It is doing auth off a MySQL db an working well
I have downloaded the latest stable tarball and extracted ... I don't
see how to enable mysql lookups at compile time...
Help please ... I am getting a lot of pressure from the people at work
...
Regards
Andrew
Andrew
2011 Apr 01
0
Xapian-discuss Digest, Vol 83, Issue 1
I think this is a shining example of how well Xapian works with large
document collections. I was just discussing this with my colleagues here
and one of the issues that came up is that we'd love Xapian to become
really lot more popular but have found that the documentation's a bit
difficult to get into, as is the API.
So I was wondering: do you have any thoughts on improving this and
2010 Apr 02
1
Question from a new user of xapian: query term weight
Hi all,
I've been a Lucene user for the past year, but lately, with most of my
project moving to Python, I really love Xapian's clean python binding.
I can't seem to see how to boost a query term using Xapian's query
syntax. In Lucene, there is "hello^4 world^.2" to boost "hello" and
suppress "world". However, digging through Xapian's
2012 Apr 20
1
Implementing the tf-idf weighting scheme
Hi, all:
This is the basic implementation of tf-idf scheme (basic scheme used in
SMART) that can be used in the Xapian. It might still need some futher
revision, but I believe it works anyway.:)
I modified the weight.h to define a subclass Tf_idfWeight and add a new
file tf_idf.cc in ../weight in the repo, to implement Tf_idfWeight.
Here is the git diff patch:
https://gist.github.com/2422049
2005 Oct 14
1
DoveCot 1.0a3 hangs at login
I've just upgraded to 1.0a3 via a freebsd port, and I've begun to
notice odd behaviour from Dovecot. After converting my config file
settings over to the new format, everything runs fine for an hour or
so, then dovecot stops responding:
oot at toejamfootball# telnet localhost 143
Trying ::1...
telnet: connect to address ::1: Connection refused
Trying 127.0.0.1...
Connected to localhost.
2009 Sep 02
6
dnsmasq - I'm a little confused....
I have attempted to update with yum over the last 27 hours, and all I
get is this:
[mhr at mhrichter ~]$ sudo yum update
Password:
Loaded plugins: fastestmirror, priorities
Loading mirror speeds from cached hostfile
* base: linux.mirrors.es.net
* updates: centos.g5selfstorage.com
* extras: centos.mirrors.redwire.net
kbs-CentOS-Extras | 951 B 00:00
2010 Aug 23
1
Sort ordering
Using MultiValueSorter, I can sort by key1, key2, relevance; or relevance, key1, key2.
But AFAIK, I can't sort by key1, relevance, key2. Unless I spool out the entire result set or write some C++.
I wonder if we need a new 'sort by' function that accepts any combination of keys and relevance in any order? The function would make it's own optimisations (ie is relevance first or