Displaying 20 results from an estimated 500 matches similar to: "StandardTokenizer Doesn''t Support token_stream method"
2007 Jul 07
2
Extending/Modifying QueryParser
Hi,
I''ve implemented synonym searching in my rails application but have
an idea I''d like to implement but can''t figure out how to do. The
idea is that I''d like to give the end user the choice on whether to
search for the synonym of a word or not. Preferably by extending the
query language to parse a construct similar to ''%word1'' and
2006 Nov 25
5
Metaphone analysis
Not sure how much this will interest people but I don''t have a blog so I''m
posting something I threw together today cause I think it might be useful.
In what little free time I have I''ve been wanting to put together a
Rails/Ferret based restful dictionary. So I finally got a chance to get
started today so the first thing I wanted to do was implement a metaphone
2006 Oct 20
0
Ferret 0.10.13 released
Hi Folks,
I''ve just release Ferret 0.10.13 (skip 0.10.12, it was a bad build).
There are two interesting additions to this release. You can now
access the Filter#bits method of the built in filters so you can can
use them in your own code, possibly within your own custom filters.
For example you could implement a custom filter like so:
class MultiFilter < Hash
def
2006 Sep 15
1
Custom analyzer not invoked?
Hello,
I''m trying to define my own analyzer by doing something like:
#-----------------------------------------------------
require ''ferret''
include Ferret
class MyAnalyzer < Analysis::Analyzer
def token_stream(field, str)
# Display results of analysis
puts ''Analyzing: field:%s str:%s'' % [field, str]
t =
2007 Nov 09
2
Problem with stemming and AAF
I''m sure I''m missing something completely obvious here, so I hope
someone can point me in the right direction!
I''ve implemented a basic search with AAF, which works as expected; I''m
running a ferret drb server, and using will_paginate to page results.
The code in my search_controller.rb:
search_text = params[:query] || " "
@products =
2007 Mar 06
1
case-sensitivity of analyzer
Is there anything about this analyzer that says "case-sensitive" to you?
module Ferret::Analysis
class StemmingAnalyzer
def token_stream(field, text)
StemFilter.new(StandardTokenizer.new(text))
end
end
end
Just wondering how I can force my index to be case-insensitive.
Thanks,
-Adam
--
Posted via http://www.ruby-forum.com/.
2007 Sep 07
5
Custom Analyser .. where to put it ??
Hi,
I m trying to use a custom analyser to add my french stop words... i m
reading the tutorial at :
http://projects.jkraemer.net/acts_as_ferret/wiki/AdvancedUsage
My problem is that i ve no idea where to put my custom Analyser class
like :
class GermanStemmingAnalyzer < Ferret::Analysis::Analyzer
include Ferret::Analysis
def initialize(stop_words = FULL_GERMAN_STOP_WORDS)
2007 Jan 11
5
stop words in query
Hello all,
Quick question, I''m using AAF and the following custom analyzer:
class StemmedAnalyzer < Ferret::Analysis::Analyzer
include Ferret::Analysis
def initialize(stop_words = ENGLISH_STOP_WORDS)
@stop_words = stop_words
end
def token_stream(field, str)
StemFilter.new(StopFilter.new(LowerCaseFilter.new(StandardTokenizer.new(str)),
@stop_words))
end
However when
2007 Sep 20
5
Ferret DRB, UTF-8, Mongrel
I have spent days trying to figure out how to get UTF-8 working with my
site.
Here''s my environment:
Linux version 2.6.16.29-xen_3.0.3.0
Ruby 1.8.4 (2005-12-24 [i386-linux]
Rails 1.2.3
mongrel (1.0.1)
mongrel_cluster (1.0.2, 0.2.1)
ferret (0.11.4)
acts_as_ferret stable plugin
Ferret DRB server
When I don''t use an analyzer with my acts_as_ferret declaration,
everything works
2007 May 23
6
Accented characters
Hello,
I want to clean up accented characters in my index, using acts_as_ferret
in a Rails project. I searched this forum, and found the best solution
is to use an analyser.
I created somthing like this:
class PortugueseAnalyzer
include Ferret::Analysis
MAPPING = {
2009 Apr 09
4
Weird analyzer issue with the word ''fly''
Hi all
I''m using a_a_f in rails with a StemmingAnalyzer, in the index and in my
search. I got the idea from this topic:
http://www.ruby-forum.com/topic/80178
I''m having a problem with some search terms - i narrowed one of them
down to the inclusion of the word ''fly''. Can anyone give me any clues
at to what might be happening, or even how i can investigate?
2006 Dec 08
4
Using custom stem analyzer giving mongrel errors
I''m using the custom stem analyzer:
require ''rubygems''
require ''ferret''
include Ferret
module Ferret::Analysis
class FerretAnalyzer
def initialize(stop_words = FULL_ENGLISH_STOP_WORDS)
@stop_words = stop_words
end
def token_stream(field, text)
StemFilter.new(StopFilter.new(LowerCaseFilter.new(StandardTokenizer.new(text)),
2007 Nov 13
8
acts_as_ferret : cannot use a customized Analyzer (as indicated in the AdvancedUsageNotes)
Hi all,
I cannot make aaf (rev. 220) use my custom analyzer, despite following the
indications @
http://projects.jkraemer.net/acts_as_ferret/wiki/AdvancedUsage
To pinpoint the problem, I created a model + a simple analyzer with 2 stop
words : "fax" and "gsm".
test 1 : model.rebuild_index + model.find_by_contents("fax") # fax is a
stop word.
=> I get a
2006 Dec 06
10
Stem Analyzer
Hi all,
I am trying to implement a search that will use the Stem Analyzer. I
added the Stem Anaylzer from the examples shown in another post
http://ruby-forum.com/topic/80178#147014
module Ferret::Analysis
class StemmingAnalyzer
def token_stream(field, text)
StemFilter.new(StandardTokenizer.new(text))
end
end
end
The problem with the Stem analyzer is that when I search for a
2008 Nov 06
1
How to get the length of an UTF-8 string
Hi there,
I am intending to get the length of an UTF-8 string which contains
some Japanese characters (let's say, rstr) in R language.
I try to use the nchar(rstr) to get its length, however, it returns
the "NA" for it contains some multi-byte characters.
Is there any alternatives to return the length of this rstr?
Any suggestion is appreciated.
Long
2007 Mar 20
2
Strange Results For Term Frequencies
I would like to thank all the people who have contributed to this very
fine project. Great work!
I''ve encountered some strange results while examining the term frequency
of one of my indexed documents. The indexed terms seem to vary for the
very same document depending on the presence or absence of completely
unrelated operations in the code, so the resulting term frequency
changes, too.
2007 Mar 23
5
Any chance to get 0.11.3 on windows soon ?
Hi,
I''m working on a Ferret-based application which indexes content in all
European languages. Thus, I have to deal with those funny European
characters.
After googling a bit, I decided to move on with a custom European
analyzer based on MappingFilter, as suggested in the Ferret rdoc.
Everything works fine with Ferret 0.11.3 on Mac OS X.
But this application needs to run on both
2006 Oct 19
2
How to deal with accentuated chars in 0.10.8?
I''m startin to use Ferret and acts_as_ferret.
I need to use something like EuropeanAnalyzer
(http://olivier.liquid-concept.com/fr/pages/2006_acts_as_ferret_accentuated_chars).
By example, if the user search by "gonzalez" you can find documents taht
contents the term "gonz?lez" (gonzález)
The EuropeanAnalyzer is based on Ferret::Analysis::TokenFilter,
2004 Feb 17
0
VFS module programmieren
############################# In German :
Hallo,
gibt es da Jemanden der mir den folgenden Text vielleicht ein bischen besser
erkl?ren kann?
Habe dies aus dem Samba Developers Guide.
Warum mach ich das nun?
Ich habe ein Problem:
Ich will das jedes mal wenn ein User versucht einen Ordner in der Freigabe
zu verschieben oder umzubenenner, der nicht leer ist,
eine Fehlermeldung auf dem Client
2005 Dec 02
1
Compile error on FreeBSD 4.10 gcc 2.95.4
FYI, I tried installing ferret on my freebsd virtual server and got this:
retango# gem install ferret --include-dependencies
Attempting local installation of ''ferret''
Local gem file not found: ferret*.gem
Attempting remote installation of ''ferret''
Updating Gem source index for: http://gems.rubyforge.org
Building native extensions. This could take a while...