Displaying 11 results from an estimated 11 matches for "doc_freq".
2007 Mar 04
5
Getting non-stemmed terms from IndexReader
I need to get a set of terms being indexed using Ferret. I used
IndexReader.terms and it returns a list of TermEnum nicely. The only
problem is that my analyzer includes a stemming filter.
So now, the terms I''m getting back are all stemmed. Is there anyway to
get the original unstemmed terms back from the index somehow? Thanks.
--
Posted via http://www.ruby-forum.com/.
2013 Oct 30
2
Lucene 3.6.2 backend for xapian (#25)
[Replying to xapian-devel, as I think a wider audience would be useful]
On Mon, Oct 21, 2013 at 11:24:51PM +0800, jiangwen jiang wrote:
> yes, it's less efficient. Lucene database has multiple segments, each
> segment can treat as a independent database. The same term may exists in >=
> 1 segments.
Sorry for taking a while to respond - I've been both busy and mulling
this
2006 Mar 31
3
undefined method `<=>'' for :id:Symbol
....9.0/lib/ferret/index/term_infos_io.rb:263:in
`get_index_offset''
from
c:/ruby/lib/ruby/gems/1.8/gems/ferret-0.9.0/lib/ferret/index/term_infos_io.rb:162:in
`get_term_info''
from
c:/ruby/lib/ruby/gems/1.8/gems/ferret-0.9.0/lib/ferret/index/segment_reader.rb:176:in
`doc_freq''
from
c:/ruby/lib/ruby/gems/1.8/gems/ferret-0.9.0/lib/ferret/search/index_searcher.rb:47:in
`doc_freq''
from
c:/ruby/lib/ruby/gems/1.8/gems/ferret-0.9.0/lib/ferret/search/term_query.rb:13:in
`initialize''
from
c:/ruby/lib/ruby/gems/1.8/gems/ferret-...
2007 Feb 03
2
Boost Sorting with Acts_as_ferret?
...change the
rounding of my score? I dunno.
Basque chicken scored 1.0
3291.866 = product of:
6583.732 = sum of:
674.6201 = weight(ingredients_without_brackets:chicken in 12670),
product of:
0.2805449 = query_weight(ingredients_without_brackets:chicken),
product of:
3.13109 = idf(doc_freq=2145)
0.08959977 = query_norm
2404.677 = field_weight(ingredients_without_brackets:chicken in
12670), product of:
1.0 = tf(term_freq(ingredients_without_brackets:chicken)=1)
3.13109 = idf(doc_freq=2145)
768.0 = field_norm(field=ingredients_without_brackets,
doc...
2006 Jan 05
0
Java Lucene compatibility?
...from /usr/lib/ruby/site_ruby/1.8/ferret/index/
term_infos_io.rb:285:in `scan_for_term_info''
from /usr/lib/ruby/site_ruby/1.8/ferret/index/
term_infos_io.rb:163:in `get_term_info''
from /usr/lib/ruby/site_ruby/1.8/ferret/index/
segment_reader.rb:176:in `doc_freq''
from /usr/lib/ruby/site_ruby/1.8/ferret/search/
index_searcher.rb:47:in `doc_freq''
from /usr/lib/ruby/site_ruby/1.8/ferret/search/term_query.rb:
13:in `initialize''
from /usr/lib/ruby/site_ruby/1.8/ferret/search/term_query.rb:
99:in `new'...
2006 Oct 15
12
Very small scores for search results
...he following explanation (via code a added to
acts_as_ferret for debugging):
QUERY: id:building name:building
EXPLANATION of building: 8.438619e-42 = product of:
1.687724e-41 = weight(name:building in 3), product of:
0.6125279 = query_weight(name:building), product of:
2.386294 = idf(doc_freq=1)
0.2566858 = query_norm
2.755373e-41 = field_weight(name:building in 3), product of:
1.0 = tf(term_freq(name:building)=1)
2.386294 = idf(doc_freq=1)
1.15467e-41 = field_norm(field=name, doc=3)
0.5 = coord(1/2)
Note the tiny score of field_norm which is throwing the...
2008 Mar 05
0
Index Searcher Causes GC Memory Error: "irb: double free or corruption"
...ield_value}
fvh = field_index.highlight(query, 0, options.merge({ :field => :keywords }) )
return fvh
rescue Exception => e
puts "** IndexSearch.highlight(''#{query}''): #{e.message}" # << e.backtrace.join("\n")
return field_value
end
def doc_freq(field, term)
self.searcher.doc_freq(field, term)
end
#############################
# Self Util methods
def reader
self.searcher.reader
end
def size
if reader
total_size = reader.num_docs
else
total_size = 0
self.sub_searchers.each do |sub_searcher|
total_size += sub_se...
2006 Sep 07
7
counting occurences of words in the result set
Hello, I need to be able to count the occurences of certain terms in the
reults.
Currently my setup is Ferret 0.10.1 aaf bleeding edge.
results = VoObject.find_by_contents(query,:offset=>page, :limit=>
20,:sort => sort_fields)
I use results.total_hits for pagination. This all works really nicely.
However i need to be able to know how many occurences of certain
predefined terms occur
2007 Nov 05
1
Segmentation Fault in more_like_this.rb
I''ve been seeing some core dumps coming from ferret_server:
acts_as_ferret/lib/more_like_this.rb:170: [BUG] Segmentation fault
ruby 1.8.6 (2007-03-13) [i386-freebsd6]
I''m running the latest build of ferret (0.11.4-rc5). Line 170 in
more_like_this.rb is:
freq = reader.doc_freq(field_name, word)
which is calling into the ferret C code (if I''m reading this correctly).
Is there anything I can do to get you more information, or help track
down this problem?
Thanks.
--
Peter Jones
pmade inc. - http://pmade.com
2006 May 15
16
Ferret not able to read a Lucene Index?
Hi all,
Having problems trying to get Ferret to read an index generated by
Lucene.
Am I right in thinking Ferret should be able to read a Lucene generated
index no problem?
Using the code snippets detailed in
http://www.ruby-forum.com/topic/64099#new
Any advice gratefully received.
Many Thanks,
Steven
--
Posted via http://www.ruby-forum.com/.
2007 Apr 28
6
Determine how many documents a term occurs in
Is there a fast way to determine how many documents a term occurs in,
besides iterating through every document with TermDocEnum?
--
Best regards,
Stian Gryt?yr