similar to: Thanks for the detailed feedback on the stemmer

Displaying 20 results from an estimated 6000 matches similar to: "Thanks for the detailed feedback on the stemmer"

2013 Jan 24
1
Integrating a PaiceHusk stemmer into the library
Hey guys Hi :) I've implemented a PaiceHusk stemmer externally So what I am doing right now is passing a pointer to my StemPaiceHusk class(which in turn has been subclassed from Stemimplementation) to the Stem::Stem(StemImplementation *p) constructor .So basically,I have to include "paicehusk.h" in my indexer .However,I now want to make it a part of the Xapian library so that I
2013 Jan 10
1
Add an example to the community page and contribute more code
Hi guys.I've finished an example indexer which acts like a grep replacement for a file.It indexes each line containing a proper noun in a given text file.The line containing the proper noun will be displayed upon searching for that noun.I would like to add it to the community code examples.I'm planning to write more examples which demonstrate some advanced features of Xapian along similar
2014 Sep 05
0
stemmer or no-stemmer..
Hi, In the documentation of FTS lucene, I need little more clarification about this section: -------------------------------------------------- --with-lucene --with-stemmer The second switch is only required if you have compiled libstemmer yourself or if it's included in the CLucene you are using. -------------------------------------------------- I am totally unsure but as I understand,
2013 Feb 19
2
Implementing tf-idf weighting scheme in Xapian
Hello guys.I just read up about tf-idf schemes and want to implement it in Xapian (with some frequently used normalizations) as it will also give me a good hang of implementing a weighting scheme before I start working on implementing DFR schemes. I read the following as references and I think Ive understood it well and can write the hack :- 1.)
2014 Dec 01
2
Adding Support for Krovetz Stemmer Algo in Xapian
> On 30 Nov 2014, at 17:51, Abhishek Singh Kushwah <abhishek18kushwah at gmail.com> wrote: > > > Two of the implementation of algorithms has already been rejected > > previously due to licenses both being the implementation of porter > > but our xapian use implementation in snowball which i assume is > > under GPL. The only cases I can think you might be
2013 Mar 26
1
Merging of the TfIdf patch
Hello Guys. I have updated the code,tests,documentation,makefile entries and the registry entry of the* *TfIdf patch as per the feedback.Please do let me know if any additional changes are required before the patch can be merged, -Regards -Aarsh On Sun, Mar 3, 2013 at 2:50 PM, aarsh shah <aarshkshah1992 at gmail.com> wrote: > Hello guys.I have sent a pull request for the code and
2010 Mar 13
1
Stemmer in Xapian, some questions from student.
Hello, I am student from University of Science and Technology in Cracow. I would like to ask you some questions about Xapian, because it is one of few search engines I am testing. 1. What type of stemmer is used in Xapian? Are You able to write some important features applied to Xapian? 2. We must use stemming for polish language. I read on Your webpage what languages include Xapian. There is no
2005 Oct 12
2
Stemmer Modifications
I'm using Xapian as a search back-end on a website. My client has certain search terms that the stemmer does not stem in a way they would like. For example "continuity" stems to "continu", which produces undesirable results in their application. Is there a way to override the stemming of certain words in a way that is compatible with the indexing stemmer and the query
2015 Feb 15
3
Bitsize project: Krovetz Stemmer
Hello xapian devs, I had shown interest in writing a krovetz stemmer for xapian and spoke to James Aylett about it. Since it was hard to code the stemmer in snowball, I came up with a C++ implementation of the stemmer. But since it is a dictionary based stemmer, im having problems on deciding how to create the dictionary. I did check out some of the implementations of the Krovetz stemmer online
2014 May 14
2
Starting work on Perf Test Module
Hello, I am beginning work on the perf test module. The initial steps that I aim to accomplish are :- -> Download the wikipedia dumps for multiple languages . -> Write python scripts to tokenize the dump (will probably use something like nltk which has powerful inbuilt tokenizers) -> Discuss and finalize the design of the search and query expansion perf tests as I want to complete them
2013 Mar 27
1
Need help as Pl2 tests not performing as expected
Hello guys. I just ran the updated tests for PL2 and they are not giving the mset order I expect.Now,the thing is, dfr's behavior is a bit hard to predict and so even if I expect a particular order ,it may give another order and still be correct.So,the only way to write correct tests for PL2 is to manually calculate the weight of the documents to decide the expected order.For that,I need to
2010 Apr 11
1
A Hebrew stemmer based on libhspell
Hello. I'm interested in creating a Hebrew stemmer to use with Xapian. Hebrew is a complicated language to stem, as it uses the semitic "root" system, rather than prefixes and suffixes, and has many irregularities in accidence (morphology). Fortunately, two bright fellows from the Technion University in Israel have already created a Hebrew morphological analyzer as part of their
2013 Mar 03
0
Added code and tests for the tf-idf weighting scheme.
Hello guys.I have sent a pull request for the code and tests of the Tf-Idf weighting scheme. Please do let me know if any changes are required.Meanwhile,Ill begin working on implementing normalizations which require additional statistics and on the DFR schemes. https://github.com/xapian/xapian/pull/6 On Tue, Feb 26, 2013 at 5:30 PM, <xapian-devel-request at lists.xapian.org>wrote: >
2014 Nov 29
4
Adding Support for Krovetz Stemmer Algo in Xapian
Hello, As mentioned on the project ideas page, Adding more support for stemmer algorithm, i found an implementation of Krovetz Stemmer Algo in C++ but before working on it to merge it into xapian, i needed help in recognizing the license information associated with the source code. To avoid further licensing issues kindly someone check the link
2008 Nov 05
1
Polish stemmer?
Hi, is there a xapian stemmer suitable for polish or czech languages? Thanks, Torsten
2010 Feb 02
1
How to use a custom stemmer from Python bindings?
Hi, I'm using Xapian bindings for Python in my project. How could I use a custom stemmer instead of the included one (Snowball)? The one I'm looking at right now is Hunspell (http://hunspell.sourceforge.net/) which has Python bindings (http://code.google.com/p/pyhunspell/). Thanks in advance, Eugene
2014 Mar 31
2
Paice-Husk Stemmer
Hi everyone, I was working on the Paice-Husk Stemmer, which is a Bite Size Project for Xapian, and I have created a C++ as well as Snowball version of it. I read the algorithm, and picked the rules from here: http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm The C++ code takes rules as input from a file and generates the stem of given word, whereas the Snowball version
2006 Apr 20
1
Odd stemmer behavior
I've noticed some strange results from the stemmer in the Ruby port: irb(main):003:0> @stem.stem_word("anybody") => "anybodi" irb(main):004:0> @stem.stem_word("swimmingly") => "swim" irb(main):005:0> @stem.stem_word("fiercely") => "fierc" irb(main):006:0> @stem.stem_word("fraudulently") =>
2007 Jan 09
2
non-snowball stemmer
Hi! I am going to use non-snowball russian stemmer with Xapian. There is a good one at http://www.aot.ru. I've found that current implementation of Xapian::Stem does not allow it (there is no public interface for Xapian::Stem::Internal). Do you apply patches? Are there any recommendations for writing patches? Regards, Oleg Obolenskiy highpower at mail.ru
2013 Mar 03
0
Sent a pull request for testing TradWeight using an Rset.
Hello guys.As discussed on IRC,I have sent a pull request for a test for testing TradWeight with an Rset. On Fri, Mar 1, 2013 at 5:30 PM, <xapian-devel-request at lists.xapian.org>wrote: > Send Xapian-devel mailing list submissions to > xapian-devel at lists.xapian.org > > To subscribe or unsubscribe via the World Wide Web, visit >