Displaying 20 results from an estimated 22 matches for "stemfilt".
Did you mean:
stemfilter
2008 May 12
1
Using StemFilter with PhraseQuery
Hi,
I''m having difficulty getting the StemFilter and PhraseQuery to work
properly together. When I use a StemFilter with a PhraseQuery, searches only
work if the phrase consists of stems. For example, the search phrase
"reduces health care" will not work but the phrase "reduce health care" will
work even though the exact tex...
2007 Jan 21
2
A few questions: Tweaking StemFilter, indexes, ...
Hello all,
I am new to the list, but I have been using ferret for a little bit
already. I would first like to thank Dave for all his work on ferret.
I had a few questions that I haven''t been able to figure out after
messing around with ferret and going through the documentation.
StemFilter ------
I am trying to improve the quality of my searches in context of the
content of my application. I have created an analyzer using the
following:
StemFilter.new StopFilter.new(
LowerCaseFilter.new(StandardTokenizer.new(text)), @stop_words )
This has been pretty good so far, however, I real...
2007 Jun 07
0
Ferret::Analysis::StemFilter documentation bug
"nl" selects the dutch stemming algorithm, "no" selects the norwegian
(as one would expect).
there''s no inconsistency, which the documentation would suggest (using
"dut" and "nld" for dutch stemming, "nl" or "no" for norwegian).
this is on ferret 0.11.4 at least, i didn''t check earlier versions.
phillip
--
Posted
2006 Sep 05
15
ferret finds ''tests'' but not ''test''
Hello all,
Quick question (possibly!) - I''ve got a few records indexed and doing a
search for ''test'' reports in no hits even though I know the word ''tests''
exists in the indexed field. Doing a search for ''tests'' produces a
result. I would have thought that ''test'' would match ''tests'' but no such
2007 Sep 07
5
Custom Analyser .. where to put it ??
...cedUsage
My problem is that i ve no idea where to put my custom Analyser class
like :
class GermanStemmingAnalyzer < Ferret::Analysis::Analyzer
include Ferret::Analysis
def initialize(stop_words = FULL_GERMAN_STOP_WORDS)
@stop_words = stop_words
end
def token_stream(field, str)
StemFilter.new(StopFilter.new(LowerCaseFilter.new(StandardTokenizer.new(str)),
@stop_words), ''de'')
end
end
Any clue ?
Thanks a lot
Guillaume.
--
Posted via http://www.ruby-forum.com/.
2007 Mar 06
1
case-sensitivity of analyzer
Is there anything about this analyzer that says "case-sensitive" to you?
module Ferret::Analysis
class StemmingAnalyzer
def token_stream(field, text)
StemFilter.new(StandardTokenizer.new(text))
end
end
end
Just wondering how I can force my index to be case-insensitive.
Thanks,
-Adam
--
Posted via http://www.ruby-forum.com/.
2006 Nov 02
3
Indexing and searching across multiple locales
Hi -
I''m currently investigating support for Ferret and content that spans
multiple locales. I am particularly interested in using stemming and fuzzy
searches (e.g. with slop factor) across multiple locales.
So far I''ve followed the online docs for implementing a Stemming Analyzer,
and it is working for English terms just fine. I''ve also written a method to
import data
2007 Jan 11
5
stop words in query
Hello all,
Quick question, I''m using AAF and the following custom analyzer:
class StemmedAnalyzer < Ferret::Analysis::Analyzer
include Ferret::Analysis
def initialize(stop_words = ENGLISH_STOP_WORDS)
@stop_words = stop_words
end
def token_stream(field, str)
StemFilter.new(StopFilter.new(LowerCaseFilter.new(StandardTokenizer.new(str)),
@stop_words))
end
However when my search term includes a stop word I never get any results
back. Once I remove the stop word I get the normal results back. Do I
need to do a search of my query for stop words and remove them...
2009 Apr 09
4
Weird analyzer issue with the word ''fly''
...ysis::StemmingAnalyzer.new,
:fields => {:name => { :boost => 2.0 },
...
}})
And this analyzer is defined in a module thus:
module Ferret::Analysis
class StemmingAnalyzer
def token_stream(field, text)
StemFilter.new(StandardTokenizer.new(text))
end
end
end
Now, here''s a search without using the analyzer:
>> TeachingObject.find_with_ferret("flea fly", :per_page => 2000).size
=> 14
And with the analyzer:
>> TeachingObject.find_with_ferret("flea fly"...
2007 Nov 09
2
Problem with stemming and AAF
...b directory,
as follows:
require ''rubygems''
require ''ferret''
class StemmedAnalyzer < Ferret::Analysis::Analyzer
include Ferret::Analysis
def initialize(stop_words = ENGLISH_STOP_WORDS)
@stop_words = stop_words
end
def token_stream(field, str)
StemFilter.new(StopFilter.new(LowerCaseFilter.new(StandardTokenizer.new(str)),
@stop_words))
end
end
And added the call to the analyzer in my model file:
acts_as_ferret( :fields => { :name => { :boost => 1,
:store => :yes },
:product_number =...
2006 Dec 08
4
Using custom stem analyzer giving mongrel errors
...9;m using the custom stem analyzer:
require ''rubygems''
require ''ferret''
include Ferret
module Ferret::Analysis
class FerretAnalyzer
def initialize(stop_words = FULL_ENGLISH_STOP_WORDS)
@stop_words = stop_words
end
def token_stream(field, text)
StemFilter.new(StopFilter.new(LowerCaseFilter.new(StandardTokenizer.new(text)),
@stop_words))
end
end
end
and I''m simply setting the :analyzer option in AAF.
However, I get odd behavior. The first search that I do will go through
and display the proper results, but any subsequent request sta...
2006 Dec 06
10
Stem Analyzer
Hi all,
I am trying to implement a search that will use the Stem Analyzer. I
added the Stem Anaylzer from the examples shown in another post
http://ruby-forum.com/topic/80178#147014
module Ferret::Analysis
class StemmingAnalyzer
def token_stream(field, text)
StemFilter.new(StandardTokenizer.new(text))
end
end
end
The problem with the Stem analyzer is that when I search for a term such
as ''engineering'', it only matches whole words that fit the stem so the
only results I get back are documents where ''engin'' is a whole wo...
2006 Nov 25
5
Metaphone analysis
...nil if t.nil?
t.text = @version.eql?(:double) ?
Text::Metaphone.double_metaphone(t.text) :
Text::Metaphone.metaphone(t.text)
end
end
end
end
Second I created a MetaphoneAnalyzer class that would use the MetaphoneFilter
created above. The MetaphoneAnalyzer also makes use of the StemFilter so that
words like "eat" and "eating" both equal to "eat".
require ''ferret''
# TODO write tests
module Curtis
module Analysis
class MetaphoneAnalyzer < Ferret::Analysis::Analyzer
include Ferret::Analysis
def initialize(version = :double,...
2007 Jul 19
1
partial term query by default
My customers seem to expect that the world of search is dictated by
Google. As such they expect all queries except phrase queries to
include partial term hits. In other words, searching on "test" also
returns "testing".
Is there a quick option to enable this or anyway more elegant than
gsub''ing the query to add asterisks around each term (except those in
phrases of
2007 Jan 17
1
Tokenizers?
Hi everyone. First a quick word - I am relatively new to Ruby and Ruby
on Rails, but I love learning about it and using it. Currently I am
working on extending Boxroom (file repository RoR app) for the CARE
Indonsia intranet, where I work as an intern. I am using ferret, and
it''s working great.
I noticed that if a file contains something like this
"applications/entries", this
2006 Dec 06
1
AAF - Stem Analyzer
I''m not on AAF. Can someone else help Raymond with an example?
On 12/6/06, Raymond O''connor <nappin713 at yahoo.com> wrote:
>
> Matt Schnitz wrote:
> > You also need to stem-analyze the incoming query.
> >
> > I had this same problem. :^>
> >
> >
> > Schnitz
>
> Do you have an example of how to do this? I''m using
2006 Oct 14
2
customer analyzer?
I''d like to make my own analyzer for stemming, but where do I put it or
how do I reference it?
--
Posted via http://www.ruby-forum.com/.
2006 Aug 18
1
Portuguese Stemming
Today while compiling ferret I noticed there was a Portuguese stemmer
being compiled. How do I enable it''s use for my index?
Pedro.
2007 Nov 13
8
acts_as_ferret : cannot use a customized Analyzer (as indicated in the AdvancedUsageNotes)
Hi all,
I cannot make aaf (rev. 220) use my custom analyzer, despite following the
indications @
http://projects.jkraemer.net/acts_as_ferret/wiki/AdvancedUsage
To pinpoint the problem, I created a model + a simple analyzer with 2 stop
words : "fax" and "gsm".
test 1 : model.rebuild_index + model.find_by_contents("fax") # fax is a
stop word.
=> I get a
2007 Mar 20
2
Strange Results For Term Frequencies
...e ''ferret''
$KCODE=''u''
text = <<END_OF_TEXT
Der Begriff Entgelt (n.; Plural "Entgelte") bezeichnet die in einem
Vertrag...
END_OF_TEXT
class StemAnalyzer < Ferret::Analysis::Analyzer
def token_stream(field, str)
return
Ferret::Analysis::StemFilter.new(Ferret::Analysis::StandardTokenizer.new(str),"german")
end
end
puts "Using Ferret v#{Ferret::VERSION}..."
puts "Using Ruby v#{VERSION}..."
@index = Ferret::I.new(:analyzer => StemAnalyzer.new())
@index << {:title => "Entgelt", :content...