similar to: relevance sets and stemmed terms

Displaying 20 results from an estimated 3000 matches similar to: "relevance sets and stemmed terms"

2007 Aug 23
0
Re: [Swig-devel] license issue
Just re-posting so swig-devel get so see it. It's an architectural solution whereby swig has the power to banish licensing conflicts that may exist between any imported library and the target system for which swig is generating a module. -----Original Message----- From: "Sam Liddicott" <sam@liddicott.com> To: "Alexander Lind" <malte@webstay.org>; "William
2006 Dec 01
1
writabledatabase_delete_document()
Hi guys I have implemented xapian on a website, and it currently has about 2M items in its index. Its all been working quite nicely so far, until I tried removing some old items from the index (removing items when the index was smaller was no problems at all). When I try to remove them now (using writabledatabase_delete_document() via php), it halfway freezes up the machine, and the apache
2013 Mar 13
1
patch-Add standard ExpandDecider subclass to restrict to terms with a particular prefix
Hi, guys, I wrote a patch for ticket #467<http://trac.xapian.org/ticket/467>(Add standard ExpandDecider subclass to restrict to terms with a particular prefix). A new ExpandDecider class is added in expanddecider.cc/h, which delete all unprefix terms I am a newbiee to open source contribution, please let me know if this patch is work. Patch is here(generate by svn diff):
2007 Mar 04
5
Getting non-stemmed terms from IndexReader
I need to get a set of terms being indexed using Ferret. I used IndexReader.terms and it returns a list of TermEnum nicely. The only problem is that my analyzer includes a stemming filter. So now, the terms I''m getting back are all stemmed. Is there anyway to get the original unstemmed terms back from the index somehow? Thanks. -- Posted via http://www.ruby-forum.com/.
2007 Apr 20
1
ExpandDecider and MatchDecider operator() return type
Currently ExpandDecider::operator() and MatchDecider::operator() return `int' for no very good reason that I can see. It would be more natural to return `bool', since these classes are making a "yes/no" decision about whether to include a term in an ESet or a document in an MSet. The problem is that this can't be done without breaking existing user code which defines
2006 May 15
1
term / posting question
Hi guys Sorry to take up your time with this, I have just been stuck on a little problem with xapian for a few days here and I can't seem to figure it out for myself. I have created an xapian index (using the php bindings). I have added documents to it, with values, terms and postings. I can successfully search in this index on anything that is in a posting, but if I search on a word that
2006 Jun 15
1
enquire bug(?)
Hi Guys I've run into a little problem, which I think may be a bug in Xapian (or more likely the php bindings). The functions I think may not be working correctly is Enquire_set_sort_by_value() and Enquire_set_sort_by_value_then_relevance() Given a result set where each document has a unique number set on the first value (0), I am trying to order the result by this value. If I do
2007 Mar 21
1
scoring question
Hi All I have just realized that if I set a query like 'green jelly bean' xapian will turn that query into 'green OR jelly OR bean' This causes documents containing just one of the words to be considered a 100% hit. The behavior I would like to see is that each word gives a 33.3% hit, so that a document containing all 3 words gets placed above a document with only 1 or 2
2017 Jun 14
2
KMeans Clusterer - Going forward
Hello, I have finished moving the API to PIMPL classes and will fix issues within the current code over the next week, based on reviews from mentors. The next step going forward is to start with forming document vectors that are reduced and more useful. This majorly helps in saving run time (since time for distance calculation depends on number of terms). Getting the useful terms within a
2007 Aug 19
1
xapian 1.0.2 in the ubuntu repo
Hi all Does anyone know when we can expect to see ubuntu 1.0.2 in the ubuntu repository? I am talking about the repo hosted at xapian.org, | (http://www.xapian.org/debian feisty main) |I know I can download it manually, but it just feels so good to type 'apt-get upgrade' instead :) Alec
2006 Dec 02
1
document_id globally incrementing
Hi All I have made my xapian indexer automatically create new indexes once it reaches X documents in each, and for each document that I add to each sub-index, I record its document_id and its index_id (relating to what index the document ended up in). writabledatabase_add_document() returns document_id:s beginning from 0 for each new index when you add new documents, like you would expect. So
2007 Jan 12
1
xapian error
Just got this error when replacing (updating) a document in the xapian index (using php bindings): Fatal error: Uncaught exception 'Exception' with message 'DatabaseError: Error reading block 16908825: got end of file' Does anyone know what this means exactly? Alec
2007 Aug 21
1
PHP extension
Hi All Is there any reason why someone could not build a regular PHP extension (not using SWIG) for Xapian? Same as mnoGoSearch has. The extension itself would be released under the PHP license. Is there anything on either side of the pond that would prevent this from being legal? Alec
2020 Sep 20
3
help improving relevance of snippets displayed by Omega
Olly, Thanks again very much for helping me improve my understanding of Xapian and Omega. Thanks especially for pointing out that my idea of trying to generate a snippet from stemmed text lacking capitalization and punctuation would probably not produce a user-friendly result. But I'm still doubtful that expanding the sample size could be the right way to obtain excerpts from the document
2020 Sep 18
2
help improving relevance of snippets displayed by Omega
Hi, Thanks for creating Xapian and Omega. I have been amazed by how easy they make it to get a basic full-text search engine up and running. I'm wondering if you can help me better understand one aspect of the results I am getting from the default query template. Usually the snippet that's displayed in response to a query doesn't contain the word that the user searched for, even
2006 Apr 06
1
search on subsets
hi all i am building an experimental php search app on top of xapian (using the xapian-bindings). i want to add a feature so that you can search within a search - search on a subset. i haven't yet started working on it, but as far as i can tell, there seems to be no easy way of doing this. can someone point me in the right direction? thanks alec
2006 Jul 06
1
Re: [XapianBug 84] Enquire_set_sort_by_value() and friends have no effect
ah, that explains it ... sorry to waste your time, im going to go sit in the corner for a while now :p is there a reason you have no numeric sort function? cheers alec bugzilla-daemon at ixion.tartarus.org wrote: > http://www.xapian.org/cgi-bin/bugzilla/show_bug.cgi?id=84 > > olly at survex.com changed: > > What |Removed |Added >
2006 May 15
1
adaptive query scoring
Hi all Is there a way to do adaptive query scoring (as in popular results returned by a query should get more weight because they are getting clicked more often) in xapian? Is this what the rset class should be used for? I could write a php app to do adaptive results scoring for separate words (just recording the clicks and then have a cron:ned script add weight to the document_id:s for the
2014 Apr 13
2
Adding an external library to Xapian
My code is not on Github. I am using the tarball as of now. The following it the error that occurred: http://pastebin.com/cVJrjUZX On Sun, Apr 13, 2014 at 8:16 PM, James Aylett <james-xapian at tartarus.org>wrote: > On 13 Apr 2014, at 15:37, Pallavi Gudipati <pallavigudipati at gmail.com> > wrote: > > > A linker error is encountered even after following the above
2020 Sep 19
2
help improving relevance of snippets displayed by Omega
Thanks Olly! But expanding the sample seems like the wrong solution. Is there a way to instead pass a hit or hits from the document to snippet generation? Michael