Displaying 20 results from an estimated 1000 matches similar to: "StandardAnalyzer not indexing "some""
2007 Jan 19
9
Double-quoted query with "and" fails.
Hi,
We''re using Ferret 0.9.4 and we''ve observed the following behavior.
Searching for ''fieldname: foo and bar'' works fine while ''fieldname:
"foo and bar"'' doesn''t return any results. Is there a way to make
ferret recognize the ''and'' inside the query as a search term and not
an operator? (I hope I got the
2006 Apr 13
3
QueryParser doesn''t use StandardAnalyzer correctly?
I am having a bit of a problem with my search queries being parsed
correctly it seems, and I wonder if anyone else has experienced this.
I have written an index using StandardAnalyzer for analysis. I want to
search that index by passing my user query through a QueryParser
instance which is also using a StandardAnalyzer. However the resultant
query does not seem to be a valid term query and
2006 Jul 18
4
Some basic questions
Hi, David and everyone,
I''ve had Ferret running fine in a production Rails application for a
while now. I haven''t updated Ferret or really looked at the
Ferret-related code since probably January, but I recently started
thinking about trying out the latest version (we were using 0.3.2, I
think). I got the latest (0.9.4) and have noticed things break. In
particular, I used to
2007 May 05
4
Stop words, fields, StandardAnalyzer quagmire
Hello,
I''m using: Ruby 1.8.6, Rails 1.2.3, ferret 0.11.4, acts_as_ferret from
svn stable.
I''ve had quite a day wrestling with trying to remove the use of
stopwords. The problem was that when searching for words like "no" or
"the", no results were found. I found a confusing thing behavior that
has taken me some time to figure out, and I hope sharing it
2006 Jul 26
13
tweaking minimum word length?
Hi,
Can Ferret be configured to change the minimum word length of what it
indexes? Right now it seems to drop words 3 characters or less, but
I''d like to include words going down to 2 characters. How would I do
that?
Francis
2005 Nov 17
1
indexing source code
Hi again,
I''m using ferret to index source code - DamageControl will allow users
to search for text in source code.
Currently I''m using the default index with no custom analyzer (I''m
using the StandardAnalyzer). Do you have any recommendations about how
to write an analyzer that will index source code in a more ''optimal''
way? I.e. disregard common
2006 Sep 09
3
Per field analyzer
Is there a way to add per-field analyzer? I can''t seem to find a way to do that.
Thanks
--
Kent
---
http://www.datanoise.com
2007 Mar 28
6
trouble with PerFieldAnalyzer
I''m having trouble with PerFieldAnalyzer (ferret version 0.10.14).
Script:
require ''rubygems''
require ''ferret''
require ''pp''
include Ferret::Analysis
include Ferret::Index
class TestAnalyzer
def token_stream field, input
pp field
pp input
LetterTokenizer.new(input)
end
end
pfa =
2006 Sep 23
8
svn problems
I can consistently segfault the 0.10.4 gem, so I''m trying to get the
subversion version working with hopes towards tracking the problem down.
I have a fresh SVN checkout but:
a) the version (in ferret.rb) claims to be 0.9.6; and
b) Ferret::Index::FieldInfos and a couple other classes are missing at
run time. It looks like this is because they''re not exported in the C
2006 Aug 17
3
Ferret locks up when adding items to an index
I''m running Ferret 0.9.5 on a MacBook Pro (OS X 10.4.7) under
Locomotive 2.0.7.
I have a problem where Ferret is hanging when I try to add items to
the index. It doesn''t happen with every object that''s being indexed,
and I''m not sure what the objects in question have in common (they
are not all instances of the same ActiveRecord object). The process
2006 Aug 20
7
missing terms in index causing search errors
I am unable to find results for models when one or more of the terms are
not being indexed.
Lets suppose I index a User on the phrase "Ruby on Rails." If I then
search using User.find_by_contents("Ruby on Rails") I get no results,
since "or" is a common term and does not get indexed. Of course,
User.find_by_contents("Ruby Rails") works just fine.
I
2007 May 09
3
bug when assigning new analyzer?
require ''rubygems''
require ''ferret''
include Ferret
PATH = ''/tmp/ferret_stopwords_test''
index = Index::IndexWriter.new(:path => PATH, :create => true)
index.analyzer = Analysis::StandardAnalyzer.new([])
index << {:title => ''a few good men'', :language => ''en''}
index.analyzer =
2007 Jun 25
4
Ignore apostrophes in words
Hi, I just started using ferret and the aaf plugin and it seems to work
quite nicely. However, my fields are very short (titles of music) and I
don''t think may users will be typing in apostrophes when they are
looking for something. Right now, for a simple document such as "what
i''ve done" I''d like it to be indexed as "what ive done" instead. Right
2007 Apr 03
3
[Repost] Problem with url searching..
Hi all,
I''ve posted that few weeks ago but no one answered, but this feature is
REALLY important for us.
I have many objects with a url field, of course containing standards
urls...
I''m trying to match them but i actually got problems with that.
Here''s a little code of what i would like to achieve:
require ''rubygems''
require
2007 Apr 06
16
[VOTE] Should stop-words be filtered by default?
Hey folks,
A lot of confusion has been caused by having stop-words filtered by
the default analyzer. There have been a few suggestions to remove this
feature so I thought I''d put it to a vote. Making this change would
not be backwards compatible and would require users to either rebuild
their indexes or change their code to keep the same stop-words
settings. However, it would save a lot
2007 Jul 14
1
performance bottleneck
I have got my database in Mysql. I used ferret to index a table with 10
million rows. On limiting the selection of data to 1000 initial retrieval,
it takes 200 seconds but for the whole table it took more than four hours
and after which i had to close my indexing application. I used the
StandardAnalyser for it. There is no problem from the database side as
retrieval of all the data in the table
2006 Oct 24
2
Problem with stop words
I am seeing trouble with searches for ''you'' not returning anything. It
appears that ''you'' is a stop word to the standard analyzer:
require ''rubygems''
require ''ferret''
index = Ferret::I.new(:or_default => false)
index << ''you''
puts index.search(''you'')
returns
2006 Jun 16
2
indexing large tokens
Hi,
I''m using the StandardAnalyzer to build an index, and passing in Documents
that have Fields that contain large tokens (22+ characters) interpersed with
normal English words. This seems to cause the IndexWriter to slow to a
crawl. Is this a known issue, or am I doing something wrong?
If this is a known issue I don''t have any problem just not indexing tokens
longer than a
2007 Nov 13
8
acts_as_ferret : cannot use a customized Analyzer (as indicated in the AdvancedUsageNotes)
Hi all,
I cannot make aaf (rev. 220) use my custom analyzer, despite following the
indications @
http://projects.jkraemer.net/acts_as_ferret/wiki/AdvancedUsage
To pinpoint the problem, I created a model + a simple analyzer with 2 stop
words : "fax" and "gsm".
test 1 : model.rebuild_index + model.find_by_contents("fax") # fax is a
stop word.
=> I get a
2007 Mar 13
2
index returns all results for specific queries
Hey all,
I''m getting some really weird results when searching documents. It
*seems* to be somehow related to the document format I''m using.
I wrote a small script to replicate it:
################
#!/usr/bin/ruby
require ''rubygems''
require ''ferret''
include Ferret
index = Index::Index.new(:path => ''/tmp/fooindex'', :key