Displaying 20 results from an estimated 1000 matches similar to: "crash while retrieving term vectors"
2007 Feb 16
8
term vector blues
I have a lot of crashes when I try to use term vectors. Here''s an
example, which crashes pretty consistently. This problem seems to be
somewhat sensitive to platform... people on other OS''s and ruby versions
have reported no error. I have seen this with ferret 0.10.13 and 0.10.14
on debian stable using ruby 1.8.2, but I have observed the same problem
on various other systems as
2007 Jun 12
5
index browser inconsistent with IndexReader
Hi,
We have an index of around 1M web pages as part of our web app. The
app uses ferret by way of RDig to perform searches. We have noticed
anecdotally that some searches don''t work the way we thought they
should, as if documents were missing from the index. Yesterday we
came upon a concrete instance of this.
Our documents have several fields, one of which is called :keywords
and
2007 Mar 09
5
memory leak in index build?
I have a script (below) which attempts to make an index out of all the
man pages on my system. It takes a while, mostly because it runs man
over and over... but anyway, as time goes on the memory usage goes up
and up and never down. Eventually, it runs out of ram and just starts
thrashing up the swap space, pretty much grinding to a halt.
The workaround would seem to be to index documents in
2007 Mar 01
2
FerretHash
Dave, thank you so much for the 0.11 release(s). You have solved many
problems for me. As part of my appreciation for your good works, I am
offering up for public consideration a silly little class that I wrote.
(Code is below.) This class offers a simplified Hash-like interface to
(a very restricted subset of) Ferret. Hence I call it FerretHash.
FerretHash comes with its very own pet Ferret
2007 Apr 10
8
ferret-0.11.4-mswin32 not compatible with Ruby1.8.4
Just a quick note for future reference - at least for me, ferret won''t
work on Ruby 1.8.4.
gem install ferret
Successfully installed ferret-0.11.4-mswin32
ruby -v
ruby 1.8.4 (2005-12-24) [i386-mswin32]
irb
irb(main):001:0> require ''ferret''
A windows error message box appears -
ruby.exe - Entry Point Not Found
The procedure entry point rb_w32_write could not be
2006 Dec 07
8
crash on repeated search
I have found another crash in ferret; this one just uses a regular
search. It''s similar to an issue reported by Matt Schnitz a while ago,
but unlike his, mine does not go away if I turn off omit_norms. It does
go away if I turn on the garbage collector more often, but I''m not sure
that''s a stable workaround under the circumstances.
This one isn''t a
2007 Jan 01
2
Possible Bug when Creating Indexes
I''m running:
ferret (0.10.9)
ruby 1.8.5 (2006-08-25) [i386-mswin32]
on Windows XP(SP2)
When I create an index as follows:
field_infos = FieldInfos.new(:store => :yes, :term_vector => :no, :index
=> :yes)
field_infos.add_field(:id, :index => :untokenized)
field_infos.add_field(:subject)
field_infos.add_field(:author)
field_infos.add_field(:tags, :store => :no)
index =
2007 Apr 03
2
How can I count frequency of terms in a document?
Hi, there.
I need some help.
Is there a way to count frequencies of terms in a document on Ferret?
I know that Ferret has IndexReader#terms_docs_for method which counts
all documents.
I need to count frequencies of terms in a specific document.
Some way??
--
Posted via http://www.ruby-forum.com/.
2006 Sep 18
16
Dynamic fields and AAF
Hi,
I have a model which has properties, these are your standard name/value
pairs, but also have attributes that affect how I want to store them in
ferret. I was using 0.9.5 with 0.2 of aaf, which seemed fine, I just
copied and pasted (yes, I know, ick) the to_doc method and added code to
iterate though the properties that that model had, and add relavent
fields to the document.
It seems
2006 Oct 16
3
seg faults and problems with new version
Hi all,
first off, it''s 2AM and I''m not thinking properly, so please
forgive me if this one''s easy, but I just need to get this going.
First problem, using 0.9.6 on all of our development machines,
works great, then we move it to a server running x86_64 linux and it
segfaults as soon as it tries to create an Index. I''ve tried
rebuilding with different
2006 Oct 03
4
newbie question
Hi,
I''m new to using ferret (and fairly new to ruby/rails) and I''m having a
problem I can''t fathom. Sorry for the long post ...
I have a test which passes
require ''rubygems''
require ''ferret''
include Ferret
require ''test/unit''
class CompanyTest < Test::Unit::TestCase
def test_index
puts ''running
2006 Oct 03
4
newbie question
Hi,
I''m new to using ferret (and fairly new to ruby/rails) and I''m having a
problem I can''t fathom. Sorry for the long post ...
I have a test which passes
require ''rubygems''
require ''ferret''
include Ferret
require ''test/unit''
class CompanyTest < Test::Unit::TestCase
def test_index
puts ''running
2007 Apr 09
5
IndexReader#terms for all fields?
Is it possible to query the index for a TermEnum for all fields in
the index instead of just ?
Thanks,
John
2006 Sep 22
2
Searching untokenized fields
Hi ..
I tried to exclude certain objects from my search, by adding appropriate
term queries ..
i = Ferret::Index::Index.new
i.field_infos.add_field(:type, :index => :untokenized, :term_vector => :no)
i << {:type => "Movie", :name => "Indiana" }
i << {:type => "Movie", :name => "Forrest" }
i << {:type =>
2007 Mar 28
4
retrieving search result positions
Hi
I''m considering using Ferret in v2 of Weft QDA, a wxruby desktop
application for textual analysis in social science.
Ferret seems a very impressive package that meets and exceeds my
requirements, but I can''t find how to retrieve specific details about
the results.
I''d like to be able to run fairly simple queries. I then need to look at
each term match, and get
2006 Sep 14
2
Possiible Bug ? indexWriter#doc_count countsdeleted docs after #commit
Hi David,
> Deleted documents don''t get deleted until commit is called
Ok, but FYI, my experiments show that #commit doesn''t affect #doc_count,
even across ruby sessions.
On a different note, I''d like to request a variation of #add_document
which returns the doc_id of the document added, as opposed to self.
I''m trying to track down an issue with a large
2006 Sep 22
3
Error with :create => true and existing index
I implemented a "reindex" command which simply creates an IndexWriter
with :create => true for a prexisting index.
The "reindexing" seems to start out ok, with several thousand docs
added, then Ferret throws an exception:
IO Error occured: couldn''t rename file "index\_0.tmp" to "index\_0.cfs":
<File exists>
I guess that _0.cfs is held
2006 Sep 14
1
Possiible Bug ? indexWriter#doc_count counts deleted docs after #commit
I''m playing with "updating" docs in my index, and I think I''ve found bug
with IndexWriter counting deleted docs. Script and output follow:
=====
require ''rubygems''
require ''ferret''
p Ferret::VERSION
@doc = {:id => ''44'', :name => ''fred'', :email => ''abc at
2006 Aug 23
4
Ferret 0.10 and Fields
Hey ...
I just tried to convert my code to 0.10 .. But i''m currently not sure
how to use fields..
i really like some of the new api.. its leaner and i like the fact that
these strange consts are gone (like
Ferret::Search::BooleanClause::Occur::MUST) ..
I see that you''re now having Ferret::Index::FieldInfo to describe the
fields of the index.. thats good.. and i now see that
2006 Jun 22
3
Partition results based on field
Hello all
I''m using Ferret for a site wide search where I have several kinds of
(similar) objects in a central index (using a "type" field containing
the class name). This works great, and I can search all objects with one
query.
What I''d like to do now is to limit the results so that there will be a
maximum of 10 (or 5 or whatever) results for each type.. I