Displaying 10 results from an estimated 10 matches for "termvector".
Did you mean:
term_vector
2006 May 26
8
Comparing two documents in the index
I want to compare two documents in the index (i.e. retrieve the cosine
similarity/score between two documents term-vector''s). Is this possible
using the standard Ferret functionality?
Thanks in advance,
Jeroen Bulters
--
Posted via http://www.ruby-forum.com/.
2007 Mar 28
4
retrieving search result positions
Hi
I''m considering using Ferret in v2 of Weft QDA, a wxruby desktop
application for textual analysis in social science.
Ferret seems a very impressive package that meets and exceeds my
requirements, but I can''t find how to retrieve specific details about
the results.
I''d like to be able to run fairly simple queries. I then need to look at
each term match, and get
2006 Aug 01
5
Per field boost values - possible? working?
...in the same index.
Here is some console code to demonstrate what I am actually doing
>> include Ferret::Document
=> Object
>> doc = Document.new
=> Document {
}
>> doc << Field.new(:name, "Business Search", Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 2.0)
=> nil
>> doc << Field.new("physical_address", "New Zealand", Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 1.0)
=> nil
>> doc
=> Document {
stored/uncompressed,indexed,tokenized,<name:Business Search&g...
2005 Nov 26
3
Several questions about Ferret.
...ndex.optimize
end
def to_document
doc = Document.new
doc << Field.new(''id'', self.id.to_s, Field::Store::YES,
Field::Index::UNTOKENIZED)
doc << Field.new(''body_en'', self.body_en, Field::Store::YES,
Field::Index::TOKENIZED, Field::TermVector::NO, false, 1.0)
doc << Field.new(''title_en'', self.title_en, Field::Store::YES,
Field::Index::TOKENIZED, Field::TermVector::NO, false, 3.0)
--
anatol (http://pomozov.info)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://rubyforg...
2006 Jul 31
16
Sorting performance
I''m using acts_as_ferret to index one of my rails models. Right after I
start the app the first request that orders by some ferret field will
take very long. Subsequent ones seem to be fast. I guess some caching is
going on. Any tips on solving this?
Pedro.
2007 Dec 20
1
indexing tuples (example: "frog" => 123) as opposed to words
Hi,
I need to map words in a document back to there original word id''s in my
database. For example, if I had the sentence "I eat food" and I was
searching for "food" I would obviously get the document back as a
result. For my particular problem I need to not only get the document id
but also the id of the match.
Suppose my original sentence was actually
2007 Jun 04
5
Sorting and getting occurrences of search in hit
Is there any way you could get the number of occurrences of the search
in one hit?
In a result I get the ferret_rank and ferret_score but not how many hits
the search generated in the current record.
I would also like to be able to sort after this when I search.
/mattias
--
Posted via http://www.ruby-forum.com/.
2007 Jun 12
5
index browser inconsistent with IndexReader
Hi,
We have an index of around 1M web pages as part of our web app. The
app uses ferret by way of RDig to perform searches. We have noticed
anecdotally that some searches don''t work the way we thought they
should, as if documents were missing from the index. Yesterday we
came upon a concrete instance of this.
Our documents have several fields, one of which is called :keywords
and
2006 Jun 04
20
Proposal of some radical changes to API
...ve Ferret''s API. The first part that needs to be
improved is the Document API. It''s annoying having to type all the
attributes to initialize a field just to change the boost. So;
field = Field.new(:name, "data...", Field::Store::YES,
Field::Index::TOKENIZED, Field::TermVector::NO, false, 5.0)
would become;
field = Field.new(:name, "data...", :index =>
Field::Index::TOKENIZED, :boost => 5.0)
It''d also be nice to replace the Parameter objects with symbols;
field = Field.new(:name, "data...", :index => :tokenized, :boost =>...
2006 Aug 20
7
missing terms in index causing search errors
I am unable to find results for models when one or more of the terms are
not being indexed.
Lets suppose I index a User on the phrase "Ruby on Rails." If I then
search using User.find_by_contents("Ruby on Rails") I get no results,
since "or" is a common term and does not get indexed. Of course,
User.find_by_contents("Ruby Rails") works just fine.
I