similar to: Ferret questions

Displaying 20 results from an estimated 8000 matches similar to: "Ferret questions"

2006 May 26
8
Comparing two documents in the index
I want to compare two documents in the index (i.e. retrieve the cosine similarity/score between two documents term-vector''s). Is this possible using the standard Ferret functionality? Thanks in advance, Jeroen Bulters -- Posted via http://www.ruby-forum.com/.
2006 Jan 19
2
Simple Ferret Questions
I am trying to use Ferret for searching for users based on first and last name. In my index, I am adding the first_name, last_name, and a full_name which is basically "#{first_name} #{last_name}". I am searching the index using something like the following query: INDEX.search_each(%Q/first_name:#{query}* OR last_name:#{query}* OR full_name:#{query}*/) do |doc, score| The problem I am
2006 Feb 09
2
Finding related items (like latent semantic indexing)
I''ve been trying to use Classifier::LSI to provide a means of finding ''related items'', where each item is a one line description of a product. Although on small samples the Classifier works great, it completely baulks on my current dataset of 3000 items. I''ve started to look at ferret this morning, following a post on the ruby mailing list. I''d
2006 Jan 27
2
Grouping results
I have a general question about using a Ferret/Lucene index for grouping results. I am not sure how much of the heavy lifting the index can do for me, so I would appreciate any input. I am using ferret to index some objects that have the following properties: url, image_url, price, tags (space separated tags), created_at I would like search the index for any documents that match a specific
2006 Aug 04
5
A couple of ferret 0.9.4 exceptions
Hi Dave, I am using ferret at my site http://gifthat.com and I just had a few exceptions pop up. I don''t have a way to reproduce them, but my site just was listed on lifehacker.com and these issues have popped up under multiple concurrent users (only twice though which I think isn''t too bad). I am using two lighttpd instances both with read/write access to the index: 1) Error
2006 Sep 12
5
ferret / acts_as_ferret multiple server deployment
Has anyone deployed ferret & act_as_ferret to a load balanced multiple server environment? If so, did you simply use a shared network index? I have a couple of ideas on how to deploy - but each have shortcomings and I''m hoping to find out if anyone else has deployed ferret in this manner. The application is simply load balanced between multiple servers running the same app for
2006 Feb 28
2
Most Popular Searches
Hi, I have an index where each document contains an untokenized ''url'' field. I would like to query the index for the most popular urls. In SQL I would do this via a Group By clause. Is there anything in Ferret that will do something similar? I found this discussion that proposed a solution involving TermEnums:
2006 Apr 03
6
Installing Ferret locally on TextDrive
I would like to give the 0.9.0 version of Ferret a try on my application hosted on TextDrive. I am currently running on the 0.3.2 version there. Does anyone have any tips on installing it locally there? I know just enough about Ruby gems to get by... but I am thinking it could be as easy as passing a -i flag to specify the install location for ferret. Then, the only thing I am not sure about
2008 Jan 25
5
Ferret+Lucene Index
We use Nutch and Lucene for our heavy duty text analysis jobs but I?m trying to use ferrret to do some experiments. I understood that Ferret used the same index format as lucene but I cannot look into a lucene index with ferret and cannot read a ferret index with luke (the lucene index browser). Am I doing somehting wrong or have the formats diverged? -------------- next part -------------- An
2006 May 04
5
How to install Ferret to get the best performance
Hey all, After dabbling with ActiveSearch, we''re coming back around to take another look at Ferret. ActiveSearch slowed to a crawl after indexing about 20k documents, each 20 lines each. This time we may attempt to create multiple Ferret indexes (isolating each organization''s data individually), since we eventually could have upwards of 20k documents for some
2007 Sep 01
2
Can lucene use index generated by ferret?
I found it quite fun to generate index by ferret. However, I had to use lucene now. Can I use the index that I''ve already done by ferret? ps. Lucene 2.2.0 vs Ferret 0.11.4 Thx! -- Posted via http://www.ruby-forum.com/.
2007 Jul 18
3
Ferret doesn''t work with Luke
Hi, Does anyone know why the indexes created by Ferret can''t be opened by Luke (http://www.getopt.org/luke/)? When I do development with Clucene I use Luke all the time to understand what is going on in the index. It is especially useful when trying to diagnose analyzer issues. When I try to open a Ferret index with Luke I get the message "Invalid or corrupted index". I
2006 Jan 27
3
Using ID as Key
Hi, I followed the howto to use keys for documents: http://ferret.davebalmain.com/trac/wiki/HowTos#Howtousekeysfordocument If I add two documents with the same id, only one gets added to the index as expected. However, I have found the key and id do not match. So, attempting to access the index with the id does not work. For instance, when I run this search: INDEX.search_each(query) do
2007 Jan 05
7
Hitting Files per Directory Limits with Ferret?
Hey all! We''ve been using Ferret to great success these past six months. But recently we''ved tried adding many new ContentItems (only thing being index by Ferret at the moment), and things came crashing to a halt. ferret gem: 0.10.9 acts_as_ferret plugin (not sure which version) How we''re using the plugin: class ContentItem < ActiveRecord::Base acts_as_ferret
2006 May 15
16
Ferret not able to read a Lucene Index?
Hi all, Having problems trying to get Ferret to read an index generated by Lucene. Am I right in thinking Ferret should be able to read a Lucene generated index no problem? Using the code snippets detailed in http://www.ruby-forum.com/topic/64099#new Any advice gratefully received. Many Thanks, Steven -- Posted via http://www.ruby-forum.com/.
2007 Mar 01
13
ferret or not ferret?
hi, i''ve to choose a search engine for a medium-big site with a lot of searches and inserts at the same moment, do you suggest me something? i''m thinking about ferret, but i read that it has some problems with this king of "work" :( -- Posted via http://www.ruby-forum.com/.
2006 Jul 14
3
Scaling Ferret Beyond One Server
Hi Everyone, I was wondering if folks here have had experience scaling Ferret beyond a single server? Currently, we are running Ferret in the same physical server as its Rails front end (via acts_as_ferret), but it is evident that we need a more scalable solution already. How would you split up the tasks (via dRB perhaps?) between two or three servers? Shared disk, replicated Ferret index (?),
2006 Jan 02
11
aligning Ferret''s IndexSearcher.search API with Lucene''s
Recently I''ve been revisiting some of my search code. With a greater understanding of how Java Lucene implements its search methods, I realized that one level of abstraction is not present in the Ferret classes/methods. Here are the relevant method signatures: Ferret''s search methods: in Ferret::Index::Index: search(query, options = {}) -> returns a TopDocs
2007 Dec 19
1
Payload support in ferret
Hello All: This is potentially a ferret and ruby newbie question. I RTFM-ed (the oreilly shortcut anyway) and didnt find anything obvious. Are there plans for Ferret to support the notion of "payloads" introduced in Lucene 2.2. Payloads are basically arbitrary metadata that can be stored for each term. For example, for each term I can store, say, font information for the term. (
2006 Mar 19
3
Ferret 0.9.0-alpha (port of Apache Lucene to pure ruby)
Hi Folks, I''ve just released version 0.9.0. This latest version of Ferret is an alpha release. I have removed the old c extension and Ferret is now running on a fully ported C library. This has allowed some huge performance improvements both with regard to memory and CPU usage. There will probably be a few portability issues to start with. It has been developed on Linux so it should