Displaying 20 results from an estimated 1000 matches similar to: "Bitsize project: Krovetz Stemmer"
2015 Feb 10
3
Bitsize project - Krovetz stemmer
Hello Xapian devs,
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20150210/c848e9b7/attachment-0002.html>
2014 Nov 29
4
Adding Support for Krovetz Stemmer Algo in Xapian
Hello,
As mentioned on the project ideas page, Adding more support for stemmer
algorithm,
i found an implementation of Krovetz Stemmer Algo in C++ but before
working on it to merge it into xapian, i needed help in recognizing the
license information associated with the source code.
To avoid further licensing issues kindly someone check the link
2014 Dec 01
2
Adding Support for Krovetz Stemmer Algo in Xapian
> On 30 Nov 2014, at 17:51, Abhishek Singh Kushwah <abhishek18kushwah at gmail.com> wrote:
>
> > Two of the implementation of algorithms has already been rejected
> > previously due to licenses both being the implementation of porter
> > but our xapian use implementation in snowball which i assume is
> > under GPL.
The only cases I can think you might be
2005 Oct 12
2
Stemmer Modifications
I'm using Xapian as a search back-end on a website. My client has
certain search terms that the stemmer does not stem in a way they would
like. For example "continuity" stems to "continu", which produces
undesirable results in their application. Is there a way to override the
stemming of certain words in a way that is compatible with the indexing
stemmer and the query
2013 Jan 24
1
Integrating a PaiceHusk stemmer into the library
Hey guys Hi :) I've implemented a PaiceHusk stemmer externally So what I
am doing right now is passing a pointer to my StemPaiceHusk class(which in
turn has been subclassed from Stemimplementation) to the
Stem::Stem(StemImplementation *p) constructor .So basically,I have to
include "paicehusk.h" in my indexer .However,I now want to make it a part
of the Xapian library so that I
2007 Jan 09
2
non-snowball stemmer
Hi!
I am going to use non-snowball russian stemmer with Xapian. There is a
good one at http://www.aot.ru. I've found that current implementation of
Xapian::Stem does not allow it (there is no public interface for
Xapian::Stem::Internal). Do you apply patches? Are there any
recommendations for writing patches?
Regards,
Oleg Obolenskiy
highpower at mail.ru
2008 Nov 05
1
Polish stemmer?
Hi,
is there a xapian stemmer suitable for polish or czech languages?
Thanks,
Torsten
2010 Feb 02
1
How to use a custom stemmer from Python bindings?
Hi,
I'm using Xapian bindings for Python in my project. How could I use a
custom stemmer instead of the included one (Snowball)? The one I'm
looking at right now is Hunspell (http://hunspell.sourceforge.net/)
which has Python bindings (http://code.google.com/p/pyhunspell/).
Thanks in advance,
Eugene
2014 Mar 31
2
Paice-Husk Stemmer
Hi everyone,
I was working on the Paice-Husk Stemmer, which is a Bite Size Project for
Xapian, and I have created a C++ as well as Snowball version of it.
I read the algorithm, and picked the rules from here:
http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm
The C++ code takes rules as input from a file and generates the stem of
given word, whereas the Snowball version
2006 Apr 20
1
Odd stemmer behavior
I've noticed some strange results from the stemmer in the Ruby port:
irb(main):003:0> @stem.stem_word("anybody")
=> "anybodi"
irb(main):004:0> @stem.stem_word("swimmingly")
=> "swim"
irb(main):005:0> @stem.stem_word("fiercely")
=> "fierc"
irb(main):006:0> @stem.stem_word("fraudulently")
=>
2014 Sep 05
0
stemmer or no-stemmer..
Hi,
In the documentation of FTS lucene, I need little more clarification
about this section:
--------------------------------------------------
--with-lucene --with-stemmer
The second switch is only required if you have compiled libstemmer
yourself or if it's included in the CLucene you are using.
--------------------------------------------------
I am totally unsure but as I understand,
2008 Aug 16
1
python how do i stem words in python?
hi,
i am newbie to xapian and am trying to get started with it in python.
there is no stemmer.stem_word method in the latest python library. how
do i stem words before doing doc.add_posting?
is there any sample hello world code in python that i can use?
thanks a lot!
>>> stemmer = xapian.Stem('english')
>>> stemmer.
stemmer.__call__
2010 Mar 13
1
Stemmer in Xapian, some questions from student.
Hello,
I am student from University of Science and Technology in Cracow. I would
like to ask you some questions about Xapian, because it is one of few
search engines I am testing.
1. What type of stemmer is used in Xapian? Are You able to write some
important features applied to Xapian?
2. We must use stemming for polish language. I read on Your webpage what
languages include Xapian. There is no
2017 Jun 14
2
KMeans Clusterer - Going forward
Hello,
I have finished moving the API to PIMPL classes and will fix issues within
the current code over the next week, based on reviews from mentors.
The next step going forward is to start with forming document vectors that
are reduced and more useful. This majorly helps in saving run time (since
time for distance calculation depends on number of terms). Getting the
useful terms within a
2010 Apr 11
1
A Hebrew stemmer based on libhspell
Hello.
I'm interested in creating a Hebrew stemmer to use with Xapian. Hebrew is a
complicated language to stem, as it uses the semitic "root" system, rather
than prefixes and suffixes, and has many irregularities in accidence
(morphology).
Fortunately, two bright fellows from the Technion University in Israel have
already created a Hebrew morphological analyzer as part of their
2013 Jan 10
1
Add an example to the community page and contribute more code
Hi guys.I've finished an example indexer which acts like a grep replacement
for a file.It indexes each line containing a proper noun in a given text
file.The line containing the proper noun will be displayed upon searching
for that noun.I would like to add it to the community code examples.I'm
planning to write more examples which demonstrate some advanced features of
Xapian along similar
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all,
I have some troubles using the stemming algorithm provided by the tm
(text mining) + Snowball packages.
Here is my config:
MacOS 10.5
R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
I have installed all the needed packages (tm, rJava, rWeka, Snowball)
+ dependencies. I have desactivated AWT (like written in
2009 Sep 09
2
InvalidArgumentError throw using Turkish stemmer and posting text "'leri"
Hi all,
I've come across a very strange bug with Xapian 1.0.9.0 and the Turkish
query parser when trying to index a string (as posting) that looks like
this: "...bir araya getiren CD'leri son teknolojiyle piyasaya...". The
actual offending bit of the string is: 'leri
It throws the message I have shown below. The real annoyance is that I can't
seem to catch it because it
2013 Feb 14
0
Thanks for the detailed feedback on the stemmer
Hey Hi Olly :) Thank you so much for your detailed help and feedback on the
stemmer patch . Ive begun working on all the points that you mentioned.Will
send a new pull request in a couple of days.:) Sorry for the shabby
patch,I'm just new to this stuff,Ill do better next time. :)
-Regards
-Aarsh
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
2006 Mar 29
1
Using boolean terms in PHP bindings
OK, I'm indexing my data with the scriptindex. I want to be able to
restrict the search by the category field. Do I need to do anything to
the data itself? Like, literally prefix it with the characters "XC"?
Below is my indexor for scriptindex and the my php code...
document_id : field=ref unique=Q boolean=Q
search_id : field=document_id index=S
document_title : field=title