Displaying 20 results from an estimated 500 matches similar to: "Possible Bug when Creating Indexes"
2006 Sep 05
4
No matches
The following script creates a search index and then searches it. I get
no results? Where am I going wrong?
Thanks.
-----------BEGIN SCRIPT----------------
require ''rubygems''
require ''ferret''
include Ferret
path = ''/tmp/myindex''
field_infos = Ferret::Index::FieldInfos.new()
field_infos.add_field(:name, :store => :yes, :index => :yes)
2006 Sep 25
0
Odd indexing issue
Hey Dave,
I just contributed $100 to the ferret donation box. My project is
earning no money yet (but hopefully will), for now I hope this helps you
out and covers me for asking stupid questions ;).
To get a distance sorted output, I am passing an array of the id field
from a ferret search through to mysql in a custom select statement.
SELECT ... id IN (#{ids.join(",")})
This has
2007 Jul 11
4
Query scoring - WTF?
Hi!
I thought I understood Ferret''s query scoring and how to tweak
results using boost values. What I currently experience however,
leaves me completely baffled.
Perhaps someone can shed some light on the scoring algorithm, because
asking Ferret to "explain" the score for a particular document isn''t
as informative as I thought. Actually, it confuses me even
2007 Feb 16
8
term vector blues
I have a lot of crashes when I try to use term vectors. Here''s an
example, which crashes pretty consistently. This problem seems to be
somewhat sensitive to platform... people on other OS''s and ruby versions
have reported no error. I have seen this with ferret 0.10.13 and 0.10.14
on debian stable using ruby 1.8.2, but I have observed the same problem
on various other systems as
2006 Nov 23
0
Two repeatable crash bugs in Ferret proper
Hi guys! Been reading this list for a while.
I have two repeatable Ferret crash bugs, both seg faults.
1. The first bug appears to seg fault Ferret when you use quotes in a search
argument (eg ''file_name:"file name"'')
2. The second bug appears to seg fault Ferret when you attempt to index text
with very long tokens (above 256 chars). It may have something to do with
2006 Dec 07
8
crash on repeated search
I have found another crash in ferret; this one just uses a regular
search. It''s similar to an issue reported by Matt Schnitz a while ago,
but unlike his, mine does not go away if I turn off omit_norms. It does
go away if I turn on the garbage collector more often, but I''m not sure
that''s a stable workaround under the circumstances.
This one isn''t a
2006 Sep 23
0
TermQuery problem
Hi,
Using the 0.10.4 gem under ruby 1.8.5 (2006-08-25) [i686-linux], I
get different results with a TermQuery and a search string. Namely,
using a search string seems to always work whereas using a TermQuery
often doesn''t return any entries.
For example:
> x=@i[450][:message_id]
=> "9e7db9110509070759732b21c4 at mail.gmail.com"
>
2006 Sep 18
16
Dynamic fields and AAF
Hi,
I have a model which has properties, these are your standard name/value
pairs, but also have attributes that affect how I want to store them in
ferret. I was using 0.9.5 with 0.2 of aaf, which seemed fine, I just
copied and pasted (yes, I know, ick) the to_doc method and added code to
iterate though the properties that that model had, and add relavent
fields to the document.
It seems
2006 Sep 22
2
Searching untokenized fields
Hi ..
I tried to exclude certain objects from my search, by adding appropriate
term queries ..
i = Ferret::Index::Index.new
i.field_infos.add_field(:type, :index => :untokenized, :term_vector => :no)
i << {:type => "Movie", :name => "Indiana" }
i << {:type => "Movie", :name => "Forrest" }
i << {:type =>
2006 Aug 23
4
Ferret 0.10 and Fields
Hey ...
I just tried to convert my code to 0.10 .. But i''m currently not sure
how to use fields..
i really like some of the new api.. its leaner and i like the fact that
these strange consts are gone (like
Ferret::Search::BooleanClause::Occur::MUST) ..
I see that you''re now having Ferret::Index::FieldInfo to describe the
fields of the index.. thats good.. and i now see that
2007 Jun 12
5
index browser inconsistent with IndexReader
Hi,
We have an index of around 1M web pages as part of our web app. The
app uses ferret by way of RDig to perform searches. We have noticed
anecdotally that some searches don''t work the way we thought they
should, as if documents were missing from the index. Yesterday we
came upon a concrete instance of this.
Our documents have several fields, one of which is called :keywords
and
2007 Mar 01
2
FerretHash
Dave, thank you so much for the 0.11 release(s). You have solved many
problems for me. As part of my appreciation for your good works, I am
offering up for public consideration a silly little class that I wrote.
(Code is below.) This class offers a simplified Hash-like interface to
(a very restricted subset of) Ferret. Hence I call it FerretHash.
FerretHash comes with its very own pet Ferret
2007 Apr 10
8
ferret-0.11.4-mswin32 not compatible with Ruby1.8.4
Just a quick note for future reference - at least for me, ferret won''t
work on Ruby 1.8.4.
gem install ferret
Successfully installed ferret-0.11.4-mswin32
ruby -v
ruby 1.8.4 (2005-12-24) [i386-mswin32]
irb
irb(main):001:0> require ''ferret''
A windows error message box appears -
ruby.exe - Entry Point Not Found
The procedure entry point rb_w32_write could not be
2006 Oct 07
5
How to proceed with incorporating Ferret?
Hi,
I''ve listened in to this mail list for quite a while now but not
doing anything with Ferret until I was ready to incorporate it. I''ve
used Lucene for years, but not Ferret.
I downloaded and installed the ''bleeding edge'' version (lets call it
0.10.9.1). There appears to be a significant re-working of the API
happening. It all looks good. But there
2007 Mar 09
5
memory leak in index build?
I have a script (below) which attempts to make an index out of all the
man pages on my system. It takes a while, mostly because it runs man
over and over... but anyway, as time goes on the memory usage goes up
and up and never down. Eventually, it runs out of ram and just starts
thrashing up the swap space, pretty much grinding to a halt.
The workaround would seem to be to index documents in
2006 Sep 26
4
Some documents not found
I''m a ferret newbie, so hopefully I''m missing something simple :)
I am using ferret to index data about 36,000 products from a MySQL
database. The index has one document for each product, with these
important fields:
id: the id (unique) of the product record in the database
content: a concatenation of several bits of information from the product
and associated records
I
2007 Feb 26
4
Ferret 0.11.0 tests segfault
I have an important segfault when I create the index (via
Ferret::Index::FieldInfos#create_index).
I decided to run the tests, this is what I have :
$> ruby test_all.rb
Loading once
Loaded suite test_all
Started
....................EEEEEEEE./unit/../unit/index/../../unit/store/../../unit/analysis/../../unit/utils/../../unit/query_parser/../../unit/search/tc_filter.rb:11:
[BUG] Segmentation
2007 Aug 03
2
can''t search for OR (as in the state)
I''m trying to search a Model by the state field using Acts As Ferret.
The query for this is ''+state:NY'' (substitute state abbreviation for
NY). This works find however ''+state:OR'' returns nothing, though just
''portland'' will pull up matches within that state.
I''m pretty sure it''s reading OR as an or conditional
2006 Sep 09
2
search_each segmentation fault and parser anomoly
The included test script turned up the following anomolies (run
against Ferret 0.10.3, but had same problems with 0.10.2):
1. When the content word is not in the index the inclusion of a
wildcard file term causes search_each to throw a segmentation
fault.
$ ./test.rb zzz file:*.txt
query: +content:zzz +file:*.txt
./test.rb:28: [BUG] Segmentation fault
ruby 1.8.4 (2005-12-24)
2006 Nov 22
2
crash while retrieving term vectors
This program reliably crashes for me (usually a segfault):
require ''rubygems''
require ''ferret''
reader=Ferret::Index::IndexReader.new ARGV
fields=reader.field_infos.fields
reader.max_doc.times{|n|
fields.each{|field|
reader.term_vector(n,field)
} unless reader.deleted?(n)
print "."; STDOUT.flush
}
As you can see, it just goes through