search for: termvector

Displaying 10 results from an estimated 10 matches for "termvector".

Did you mean: term_vector
2006 May 26
8
Comparing two documents in the index
I want to compare two documents in the index (i.e. retrieve the cosine similarity/score between two documents term-vector''s). Is this possible using the standard Ferret functionality? Thanks in advance, Jeroen Bulters -- Posted via http://www.ruby-forum.com/.
2007 Mar 28
4
retrieving search result positions
Hi I''m considering using Ferret in v2 of Weft QDA, a wxruby desktop application for textual analysis in social science. Ferret seems a very impressive package that meets and exceeds my requirements, but I can''t find how to retrieve specific details about the results. I''d like to be able to run fairly simple queries. I then need to look at each term match, and get
2006 Aug 01
5
Per field boost values - possible? working?
...in the same index. Here is some console code to demonstrate what I am actually doing >> include Ferret::Document => Object >> doc = Document.new => Document { } >> doc << Field.new(:name, "Business Search", Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 2.0) => nil >> doc << Field.new("physical_address", "New Zealand", Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 1.0) => nil >> doc => Document { stored/uncompressed,indexed,tokenized,<name:Business Search&g...
2005 Nov 26
3
Several questions about Ferret.
...ndex.optimize end def to_document doc = Document.new doc << Field.new(''id'', self.id.to_s, Field::Store::YES, Field::Index::UNTOKENIZED) doc << Field.new(''body_en'', self.body_en, Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 1.0) doc << Field.new(''title_en'', self.title_en, Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 3.0) -- anatol (http://pomozov.info) -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforg...
2006 Jul 31
16
Sorting performance
I''m using acts_as_ferret to index one of my rails models. Right after I start the app the first request that orders by some ferret field will take very long. Subsequent ones seem to be fast. I guess some caching is going on. Any tips on solving this? Pedro.
2007 Dec 20
1
indexing tuples (example: "frog" => 123) as opposed to words
Hi, I need to map words in a document back to there original word id''s in my database. For example, if I had the sentence "I eat food" and I was searching for "food" I would obviously get the document back as a result. For my particular problem I need to not only get the document id but also the id of the match. Suppose my original sentence was actually
2007 Jun 04
5
Sorting and getting occurrences of search in hit
Is there any way you could get the number of occurrences of the search in one hit? In a result I get the ferret_rank and ferret_score but not how many hits the search generated in the current record. I would also like to be able to sort after this when I search. /mattias -- Posted via http://www.ruby-forum.com/.
2007 Jun 12
5
index browser inconsistent with IndexReader
Hi, We have an index of around 1M web pages as part of our web app. The app uses ferret by way of RDig to perform searches. We have noticed anecdotally that some searches don''t work the way we thought they should, as if documents were missing from the index. Yesterday we came upon a concrete instance of this. Our documents have several fields, one of which is called :keywords and
2006 Jun 04
20
Proposal of some radical changes to API
...ve Ferret''s API. The first part that needs to be improved is the Document API. It''s annoying having to type all the attributes to initialize a field just to change the boost. So; field = Field.new(:name, "data...", Field::Store::YES, Field::Index::TOKENIZED, Field::TermVector::NO, false, 5.0) would become; field = Field.new(:name, "data...", :index => Field::Index::TOKENIZED, :boost => 5.0) It''d also be nice to replace the Parameter objects with symbols; field = Field.new(:name, "data...", :index => :tokenized, :boost =&gt...
2006 Aug 20
7
missing terms in index causing search errors
I am unable to find results for models when one or more of the terms are not being indexed. Lets suppose I index a User on the phrase "Ruby on Rails." If I then search using User.find_by_contents("Ruby on Rails") I get no results, since "or" is a common term and does not get indexed. Of course, User.find_by_contents("Ruby Rails") works just fine. I