similar to: Comparing two documents in the index

Displaying 20 results from an estimated 200 matches similar to: "Comparing two documents in the index"

2006 Jul 31
16
Sorting performance
I''m using acts_as_ferret to index one of my rails models. Right after I start the app the first request that orders by some ferret field will take very long. Subsequent ones seem to be fast. I guess some caching is going on. Any tips on solving this? Pedro.
2007 Dec 20
1
indexing tuples (example: "frog" => 123) as opposed to words
Hi, I need to map words in a document back to there original word id''s in my database. For example, if I had the sentence "I eat food" and I was searching for "food" I would obviously get the document back as a result. For my particular problem I need to not only get the document id but also the id of the match. Suppose my original sentence was actually
2007 Mar 28
4
retrieving search result positions
Hi I''m considering using Ferret in v2 of Weft QDA, a wxruby desktop application for textual analysis in social science. Ferret seems a very impressive package that meets and exceeds my requirements, but I can''t find how to retrieve specific details about the results. I''d like to be able to run fairly simple queries. I then need to look at each term match, and get
2006 Jun 04
20
Proposal of some radical changes to API
Hey guys, Now that the Lucy[1] project has Apache approval and is about to begin, the onus is no longer on Ferret to strive for Lucene compatability. (We''ll be doing that in Lucy). So I''m starting to think about ways to improve Ferret''s API. The first part that needs to be improved is the Document API. It''s annoying having to type all the attributes to
2005 Dec 16
2
Ferret questions
Hi, I am just getting started with Ferret, but I have a couple of questions. Any help is appreciated. 1) Will the current Ferret implementation only work on one server due to the index file? If it will work on multiple servers, could you point me towards some documentation? If not, are there plans to add support in the future? 2) This may not be Ferret-specific, but if I were implementing
2010 Jul 26
2
related documents
Hi All, I would like to take a doc in the xapian DB and find all related documents by relevance e.g. so when you view one document it says "Related entries X Y Z". I'm aware of the "Morelikethis" Lucene plugin that is supposed to do something like this, by generating a query from a document based on term frequency. Has anyone developed a tool to generate a query from a
2005 Nov 26
3
Several questions about Ferret.
Hi. First of all I would like to say "thank you" to David for its really valuable work. Ferret is a great project and it have great future. Well now is my questions as beginner in Ferret. How to remove ALL documents from index. Remove files is not a solution. I am interesting in something like index.remove_index or something like this. What is a usual way of doing it?? What is the
2006 Aug 01
5
Per field boost values - possible? working?
I''m making a simple business directory search and I want to boost the relevance of the ''name'' field over the ''address'' field - both stored in the same document in the same index. Here is some console code to demonstrate what I am actually doing >> include Ferret::Document => Object >> doc = Document.new => Document { } >> doc
2006 Feb 17
1
IndexReader NotImplemented
Hi there, Sorry if this has come up before, but I couldn''t see it obviously addressed anywhere. There are a few methods in IndexReader that raise NotImplementedErrors. I''m specifically interested in get_term_vector, but there are a number of others. Is there anything specific holding these back, or would patches to implement them be accepted? Thanks, -- Alex
2007 Jun 04
5
Sorting and getting occurrences of search in hit
Is there any way you could get the number of occurrences of the search in one hit? In a result I get the ferret_rank and ferret_score but not how many hits the search generated in the current record. I would also like to be able to sort after this when I search. /mattias -- Posted via http://www.ruby-forum.com/.
2007 Jun 12
5
index browser inconsistent with IndexReader
Hi, We have an index of around 1M web pages as part of our web app. The app uses ferret by way of RDig to perform searches. We have noticed anecdotally that some searches don''t work the way we thought they should, as if documents were missing from the index. Yesterday we came upon a concrete instance of this. Our documents have several fields, one of which is called :keywords and
2006 Aug 20
7
missing terms in index causing search errors
I am unable to find results for models when one or more of the terms are not being indexed. Lets suppose I index a User on the phrase "Ruby on Rails." If I then search using User.find_by_contents("Ruby on Rails") I get no results, since "or" is a common term and does not get indexed. Of course, User.find_by_contents("Ruby Rails") works just fine. I
2008 Apr 18
1
Overall p-value from a factor in a coxph fit
Hi all. If I run the simple regression when x is a categorical variable ( x <- factor(x) ): > MyFit <-coxph( Surv(start, stop, event) ~ x ) How can I get the overall p-value on x other than for each dummy variable? > anova(MyFit) does NOT provide that information as previously suggested on the list. All the best, Kare [[alternative HTML version deleted]]
2006 Feb 09
2
Finding related items (like latent semantic indexing)
I''ve been trying to use Classifier::LSI to provide a means of finding ''related items'', where each item is a one line description of a product. Although on small samples the Classifier works great, it completely baulks on my current dataset of 3000 items. I''ve started to look at ferret this morning, following a post on the ruby mailing list. I''d
2011 Mar 30
1
Package XML: Parse Garmin *.tcx file problems
I'm struggling with package XML to parse a Garmin file (named *.tcx). I wonder if it's form is incomplete, but appreciably reluctant to paste even a shortened version. The output below shows I can get nodes, but an attempt at value of a single node comes up empty (even though there is data there. One question: Has anybody succeeded parsing Garmin .tcx (xml) files? Thanks! Michael
2013 Feb 15
10
reading data
Hi, #working directory data1 #changed name data to data1.? Added some files in each of sub directories a1, a2, etc. ?indx1<- indx[indx!=""] lapply(indx1,function(x) list.files(x)) #[[1]] #[1] "a1.txt"??????? "mmmmm11kk.txt" #[[2]] #[1] "a2.txt"??????? "mmmmm11kk.txt" #[[3]] #[1] "a3.txt"??????? "mmmmm11kk.txt" #[[4]] #[1]
2009 Aug 19
2
Problem with predict.coxph
We occasionally utilize the coxph function in the survival library to fit multinomial logit models. (The breslow method produces the same likelihood function as the multinomial logit). We then utilize the predict function to create summary results for various combinations of covariates. For example:
2016 May 05
2
GSoC 2016 - Introduction
Hello, Thanks James for the reply. That cleared a few things out. Apologies for replying late because of exams going on. I was going through the previous clustering API to understand how it worked and it seems like the the approach for construction of the termlists which are used for distance metrics use TF-IDF weighting with cosine similarity, which is very similar to the approach I would need
2008 Jul 02
2
[LLVMdev] gcc in c++
On Wed, Jul 2, 2008 at 11:13 AM, Hendrik Boom <hendrik at topoi.pooq.com> wrote: > The sad thing is that they seem to be replacing one unsafe language with > another, presumably with enormous effort. > > The only hopeful sign in that thread is the proposal for using Cyclone > instead, starting here:http://gcc.gnu.org/ml/gcc/2008-06/msg00502.html > > But it is summarily
2013 Apr 02
0
[LLVMdev] LNT ClamAV - Sorting output
On 2 April 2013 21:20, Török Edwin <edwin+ml-debian at etorok.net> wrote: > You can pass all the filenames from the inputs/ directory directly on the > command-line, instead of specifying -r inputs/. > That way the order of scanning will be exactly the one specified on the > command-line. > Hum, I think I can fix that with Make... --renato -------------- next part