Displaying 20 results from an estimated 3000 matches similar to: "relevance sets and stemmed terms"
2007 Aug 23
0
Re: [Swig-devel] license issue
Just re-posting so swig-devel get so see it.
It's an architectural solution whereby swig has the power to banish licensing conflicts that may exist between any imported library and the target system for which swig is generating a module.
-----Original Message-----
From: "Sam Liddicott" <sam@liddicott.com>
To: "Alexander Lind" <malte@webstay.org>; "William
2006 Dec 01
1
writabledatabase_delete_document()
Hi guys
I have implemented xapian on a website, and it currently has about 2M
items in its index.
Its all been working quite nicely so far, until I tried removing some
old items from the index (removing items when the index was smaller was
no problems at all).
When I try to remove them now (using writabledatabase_delete_document()
via php), it halfway freezes up the machine, and the apache
2013 Mar 13
1
patch-Add standard ExpandDecider subclass to restrict to terms with a particular prefix
Hi, guys, I wrote a patch for ticket
#467<http://trac.xapian.org/ticket/467>(Add
standard ExpandDecider subclass to restrict to terms with a particular
prefix).
A new ExpandDecider class is added in expanddecider.cc/h, which delete all
unprefix terms
I am a newbiee to open source contribution, please let me know if this
patch is work.
Patch is here(generate by svn diff):
2007 Mar 04
5
Getting non-stemmed terms from IndexReader
I need to get a set of terms being indexed using Ferret. I used
IndexReader.terms and it returns a list of TermEnum nicely. The only
problem is that my analyzer includes a stemming filter.
So now, the terms I''m getting back are all stemmed. Is there anyway to
get the original unstemmed terms back from the index somehow? Thanks.
--
Posted via http://www.ruby-forum.com/.
2007 Apr 20
1
ExpandDecider and MatchDecider operator() return type
Currently ExpandDecider::operator() and MatchDecider::operator() return
`int' for no very good reason that I can see. It would be more natural to
return `bool', since these classes are making a "yes/no" decision about
whether to include a term in an ESet or a document in an MSet.
The problem is that this can't be done without breaking existing user
code which defines
2006 May 15
1
term / posting question
Hi guys
Sorry to take up your time with this, I have just been stuck on a little
problem with xapian for a few days here and I can't seem to figure it
out for myself.
I have created an xapian index (using the php bindings). I have added
documents to it, with values, terms and postings. I can successfully
search in this index on anything that is in a posting, but if I search
on a word that
2006 Jun 15
1
enquire bug(?)
Hi Guys
I've run into a little problem, which I think may be a bug in Xapian (or
more likely the php bindings).
The functions I think may not be working correctly is
Enquire_set_sort_by_value()
and
Enquire_set_sort_by_value_then_relevance()
Given a result set where each document has a unique number set on the
first value (0), I am trying to order the result by this value.
If I do
2007 Mar 21
1
scoring question
Hi All
I have just realized that if I set a query like
'green jelly bean'
xapian will turn that query into
'green OR jelly OR bean'
This causes documents containing just one of the words to be considered
a 100% hit.
The behavior I would like to see is that each word gives a 33.3% hit, so
that a document containing all 3 words gets placed above a document with
only 1 or 2
2017 Jun 14
2
KMeans Clusterer - Going forward
Hello,
I have finished moving the API to PIMPL classes and will fix issues within
the current code over the next week, based on reviews from mentors.
The next step going forward is to start with forming document vectors that
are reduced and more useful. This majorly helps in saving run time (since
time for distance calculation depends on number of terms). Getting the
useful terms within a
2007 Aug 19
1
xapian 1.0.2 in the ubuntu repo
Hi all
Does anyone know when we can expect to see ubuntu 1.0.2 in the ubuntu
repository?
I am talking about the repo hosted at xapian.org, |
(http://www.xapian.org/debian feisty main)
|I know I can download it manually, but it just feels so good to type
'apt-get upgrade' instead :)
Alec
2006 Dec 02
1
document_id globally incrementing
Hi All
I have made my xapian indexer automatically create new indexes once it
reaches X documents in each, and for each document that I add to each
sub-index, I record its document_id and its index_id (relating to what
index the document ended up in).
writabledatabase_add_document() returns document_id:s beginning from 0
for each new index when you add new documents, like you would expect.
So
2007 Jan 12
1
xapian error
Just got this error when replacing (updating) a document in the xapian
index (using php bindings):
Fatal error: Uncaught exception 'Exception' with message 'DatabaseError:
Error reading block 16908825: got end of file'
Does anyone know what this means exactly?
Alec
2007 Aug 21
1
PHP extension
Hi All
Is there any reason why someone could not build a regular PHP extension
(not using SWIG) for Xapian?
Same as mnoGoSearch has.
The extension itself would be released under the PHP license.
Is there anything on either side of the pond that would prevent this
from being legal?
Alec
2020 Sep 20
3
help improving relevance of snippets displayed by Omega
Olly,
Thanks again very much for helping me improve my understanding of Xapian
and Omega. Thanks especially for pointing out that my idea of trying to
generate a snippet from stemmed text lacking capitalization and punctuation
would probably not produce a user-friendly result.
But I'm still doubtful that expanding the sample size could be the right
way to obtain excerpts from the document
2020 Sep 18
2
help improving relevance of snippets displayed by Omega
Hi,
Thanks for creating Xapian and Omega. I have been amazed by how easy they
make it to get a basic full-text search engine up and running.
I'm wondering if you can help me better understand one aspect of the
results I am getting from the default query template.
Usually the snippet that's displayed in response to a query doesn't contain
the word that the user searched for, even
2006 Apr 06
1
search on subsets
hi all
i am building an experimental php search app on top of xapian (using the
xapian-bindings).
i want to add a feature so that you can search within a search - search
on a subset.
i haven't yet started working on it, but as far as i can tell, there
seems to be no easy way of doing this.
can someone point me in the right direction?
thanks
alec
2006 Jul 06
1
Re: [XapianBug 84] Enquire_set_sort_by_value() and friends have no effect
ah, that explains it ... sorry to waste your time, im going to go sit in
the corner for a while now :p
is there a reason you have no numeric sort function?
cheers
alec
bugzilla-daemon at ixion.tartarus.org wrote:
> http://www.xapian.org/cgi-bin/bugzilla/show_bug.cgi?id=84
>
> olly at survex.com changed:
>
> What |Removed |Added
>
2006 May 15
1
adaptive query scoring
Hi all
Is there a way to do adaptive query scoring (as in popular results
returned by a query should get more weight because they are getting
clicked more often) in xapian? Is this what the rset class should be
used for?
I could write a php app to do adaptive results scoring for separate
words (just recording the clicks and then have a cron:ned script add
weight to the document_id:s for the
2014 Apr 13
2
Adding an external library to Xapian
My code is not on Github. I am using the tarball as of now. The following
it the error that occurred:
http://pastebin.com/cVJrjUZX
On Sun, Apr 13, 2014 at 8:16 PM, James Aylett <james-xapian at tartarus.org>wrote:
> On 13 Apr 2014, at 15:37, Pallavi Gudipati <pallavigudipati at gmail.com>
> wrote:
>
> > A linker error is encountered even after following the above
2020 Sep 19
2
help improving relevance of snippets displayed by Omega
Thanks Olly!
But expanding the sample seems like the wrong solution. Is there a way to
instead pass a hit or hits from the document to snippet generation?
Michael