Displaying 20 results from an estimated 3000 matches similar to: "tweaking minimum word length?"
2006 Sep 09
3
Per field analyzer
Is there a way to add per-field analyzer? I can''t seem to find a way to do that.
Thanks
--
Kent
---
http://www.datanoise.com
2007 Apr 06
16
[VOTE] Should stop-words be filtered by default?
Hey folks,
A lot of confusion has been caused by having stop-words filtered by
the default analyzer. There have been a few suggestions to remove this
feature so I thought I''d put it to a vote. Making this change would
not be backwards compatible and would require users to either rebuild
their indexes or change their code to keep the same stop-words
settings. However, it would save a lot
2007 Jan 19
9
Double-quoted query with "and" fails.
Hi,
We''re using Ferret 0.9.4 and we''ve observed the following behavior.
Searching for ''fieldname: foo and bar'' works fine while ''fieldname:
"foo and bar"'' doesn''t return any results. Is there a way to make
ferret recognize the ''and'' inside the query as a search term and not
an operator? (I hope I got the
2006 Jul 07
4
How to add Asia token analyzer to ferret simply?
Hi,David
Can you give me an example of how to add analyzer to ferret to Asian
languages?
My web application will have to support multi language search,which
means,for example,both Chinese and English will be searched through the
form.
Currently,I have decided to use the simple token principles,which means
that every Chinese character will be a token,although this is not so
well in some
2007 Mar 22
3
Noice words...
Hi
I use acts_as_ferret on an app that is in Danish and English. In
Danish english words like "and" and "under" has meaning. Is it
possible to make ferret search for these words? As it is now a seach
for "under" returns nothing even-though I know the word is present in
the index.
Cheers
Mattias
2006 Sep 12
3
Querying against numeric fields? e.g. price:( >= min_price)
Using acts_as_ferret I''m trying to do a query like:
active:(true) title|body:(#{params[:s]}) product_price:( >=
#{params[:min]})
Where I want to return only the active products that contain the search
term in the title or body and has a minimum price >= params[:min]
I''m finding that even though I''m indexing the product price as an
integer (so no .00 to cause
2007 Mar 28
6
trouble with PerFieldAnalyzer
I''m having trouble with PerFieldAnalyzer (ferret version 0.10.14).
Script:
require ''rubygems''
require ''ferret''
require ''pp''
include Ferret::Analysis
include Ferret::Index
class TestAnalyzer
def token_stream field, input
pp field
pp input
LetterTokenizer.new(input)
end
end
pfa =
2007 May 05
4
Stop words, fields, StandardAnalyzer quagmire
Hello,
I''m using: Ruby 1.8.6, Rails 1.2.3, ferret 0.11.4, acts_as_ferret from
svn stable.
I''ve had quite a day wrestling with trying to remove the use of
stopwords. The problem was that when searching for words like "no" or
"the", no results were found. I found a confusing thing behavior that
has taken me some time to figure out, and I hope sharing it
2007 Apr 08
10
Ferret and non latin characters support
I''ve successfully installed ferret and acts_as_ferret and have no
problem with utf-8 for accented characters. It returns correct results
fot e.g. fran?ais. My problem is with non latin characters (Persian
indeed). I have tested different locales with no success both on Debian
and Mac. Any idea?
(ferret 0.11.4, acts_as_ferret 0.4.0, rails 1.1.6)
--
Posted via http://www.ruby-forum.com/.
2007 Jul 14
1
performance bottleneck
I have got my database in Mysql. I used ferret to index a table with 10
million rows. On limiting the selection of data to 1000 initial retrieval,
it takes 200 seconds but for the whole table it took more than four hours
and after which i had to close my indexing application. I used the
StandardAnalyser for it. There is no problem from the database side as
retrieval of all the data in the table
2006 Jul 07
9
Search on data accross many tables, linked by belongs_to
I am using Ferret and acts_as_ferret, as my search back-end for my Rails
project. I have a question about using acts_as_ferret on a main table
that is linked to other tables by foreign keys. Is there a way to
include the information linked by the belongs_to keyword in the search
results ?
As an example, let''s say I have a main table ''posts'':
2006 Jul 05
3
Is there any schema of full-text search that support utf-8?
Is there any schema of full-text search that support utf-8 especially
for Asia language such as Chinese,Japanese,etc.
Ferret/acts_as_ferret can not work when these language key words are
searched,and also, it is difficult to implement pagination-which need
both the count of search results and offset.
Very grateful!
--
Posted via http://www.ruby-forum.com/.
2006 Aug 16
1
StandardAnalyzer not indexing "some"
Hi everybody,
In the basic setup acts_as_ferret uses a StandardAnalyzer. How come that
it won''t index the headline "some headline" with "some" and "headline".
It only uses LetterTokenizer and LowerCaseFilter.
Thanks for your help.
Michael
--
Posted via http://www.ruby-forum.com/.
2006 Apr 13
3
QueryParser doesn''t use StandardAnalyzer correctly?
I am having a bit of a problem with my search queries being parsed
correctly it seems, and I wonder if anyone else has experienced this.
I have written an index using StandardAnalyzer for analysis. I want to
search that index by passing my user query through a QueryParser
instance which is also using a StandardAnalyzer. However the resultant
query does not seem to be a valid term query and
2007 Mar 13
2
index returns all results for specific queries
Hey all,
I''m getting some really weird results when searching documents. It
*seems* to be somehow related to the document format I''m using.
I wrote a small script to replicate it:
################
#!/usr/bin/ruby
require ''rubygems''
require ''ferret''
include Ferret
index = Index::Index.new(:path => ''/tmp/fooindex'', :key
2006 Oct 30
1
PerFieldAnalyzer and AAF
Hi All,
Does anyone know if you can user PerFieldAnalyzer with the
acts_as_ferret method? My goal is to index fields with different
analyzers for a class. Thanks in advance!
Miguel
--
Posted via http://www.ruby-forum.com/.
2007 Jul 04
6
Inconsistent results when using wild card queries
We get some unexpected results when using wild card queries. We''re using
aaf and Ferret 0.11.4
For exampel, when seraching on a part of a collegues name (kristofer)
and limiting it to a specific source_id:
Query: source_id:25 AND kri*
Result: 2 documents. None of them containg the word kristofer, but other
matching words, as "kring" and "kring?" (swedish)
Query:
2006 Jul 18
10
searching with chinese chars
Hi all,
maybe not a Ferret question, but I assume here might have came across
that already.
I wrote a simple CGI app that adds docs into a Ferret index. The idea
is testing asian languages input and searching.
The script that does the input seems to be OK. As David mentioned in a
question I made a little while ago, Ferret''s index is agnostic, in the
sense that you can store anything in
2007 May 09
3
bug when assigning new analyzer?
require ''rubygems''
require ''ferret''
include Ferret
PATH = ''/tmp/ferret_stopwords_test''
index = Index::IndexWriter.new(:path => PATH, :create => true)
index.analyzer = Analysis::StandardAnalyzer.new([])
index << {:title => ''a few good men'', :language => ''en''}
index.analyzer =
2007 Nov 13
8
acts_as_ferret : cannot use a customized Analyzer (as indicated in the AdvancedUsageNotes)
Hi all,
I cannot make aaf (rev. 220) use my custom analyzer, despite following the
indications @
http://projects.jkraemer.net/acts_as_ferret/wiki/AdvancedUsage
To pinpoint the problem, I created a model + a simple analyzer with 2 stop
words : "fax" and "gsm".
test 1 : model.rebuild_index + model.find_by_contents("fax") # fax is a
stop word.
=> I get a