thr3ads.net - similar to: "Someone getting RDig work for Linux?"

Displaying 20 results from an estimated 700 matches similar to: "Someone getting RDig work for Linux?"

2007 Sep 27

Problem getting "extract" from RDig

Hi All, I have to have a site wide search for my current application. By search I mean I have to search the static and the dynamic contents from the database. I have been searching on this for a while on the net and RDig seems to be a apt solution. While using it I have encountered a few problems. I know these might be very basic issues but I have not been able to figure out what is wrong with

basic rdig setup

2007 Sep 18

basic rdig setup

I''m developing locally on Windows and I have a remote dev box that runs Linux. I''m trying to use RDig just to index using urls, no files. Both use acts_as_ferret for an administrative search that works fine. On the Windows machine, I get no errors, but get no results. On the Linux machine, I get: File Not Found Error occured at <except.c>:93 in xraise Error occured in

RDig document processing error

2006 Jul 25

RDig document processing error

Hi all, Am having problems using RDig: With this rdig config... cfg.crawler.start_urls = [''http://www.defensetech.org''] cfg.crawler.include_hosts = [''www.defensetech.org''] cfg.index.path = ''/my/path/to/index'' cfg.verbose = true ...I get this output: $ rdig -c config/rdig_config.rb /usr/local/lib/site_ruby/1.8/ferret/index/term.rb:45:

RDig config file problem

2006 Jul 14

RDig config file problem

Hi All, Hope it is ok to post RDig queries on this forum. Just trying to get RDig working (Ubuntu 6.06, RDig 0.3.0, ferret 0.9.4, rubyful_soup 1.0.4) Here is my output: sh:~/rdigtry$ rdig -c config/rdig_config.rb discovered content extractor class: RDig::ContentExtractors::PdfContentExtractor discovered content extractor class: RDig::ContentExtractors::WordContentExtractor discovered

RDig - ferret-based website crawler/indexer

2006 Mar 25

RDig - ferret-based website crawler/indexer

Hi! RDig is a small tool to build a Ferret index for the contents of a website or intranet. It contains a simple HTTP crawler and some support for extracting textual content from the fetched pages. I built this to implement a site-wide search for a recent project that combined a Rails application with lots of static html files generated by a CMS. Any feedback is very welcome! Rubyforge

could not install in WinXP

2007 Jan 21

could not install in WinXP

Directory of C:\search_app 01/21/2007 19:37 <DIR> . 01/21/2007 19:37 <DIR> .. 01/21/2007 19:36 427 008 ferret-0.10.13.gem 01/21/2007 19:07 148 992 rdig-0.3.4.gem 2 File(s) 576 000 bytes 2 Dir(s) 45 135 982 592 bytes free C:\search_app>gem install ferret Building native extensions. This could

adding one url to rdig index?

2007 Jan 05

adding one url to rdig index?

Hey there, I''m building a rails site using RDig as a site-wide search. I would like to be able to add just one URL (or possibly a list) to an existing index, so that when certain pages change I can update the index without reindexing the entire site. I looked through the documentation and didn''t see an example on how to do this so I am looking for some guidance here :). Is

RDig and AAF playing together

2007 Jul 29

RDig and AAF playing together

I have a site with two indexes. Index A is created offline by RDig and queried from the web via RDig (specifically, RDig.searcher.search). Index B is managed by AAF with :remote => true. Simple enough. However, I need to query both indexes from RDig. Usually this is ok, as I modified RDig to accept an array of search_paths with an element for index A and index B. However, when Index

End of File Error on index optmize

2007 Jun 23

End of File Error on index optmize

I was optimizing a 650MB using ferret (0.11.3) and I received the following error. I''ve seen some people have similar issues but I haven''t seen any resolutions. The contents of the index directory follow the error. Has anyone seen anything like this and found a resolution? Many thanks. /mnt/apps/search/releases/20070622175637/script/../config/../vendor/

Adding extra fields to an index (using RDig?)

2007 Feb 10

Adding extra fields to an index (using RDig?)

Hello everyone, I am writing an application which collects a set of web sites and caches them locally for offline viewing. I want to do searches on this collection and associate extra data with each result (e.g date collected, reason for collection, perhaps a sequence number). Now all this data exists when the harvesting is done and could be stored in a database. I want to use RDig to index my

Proximity searching in rdig ferret

2007 Feb 15

Proximity searching in rdig ferret

Lucene has a syntax "foo bar"~10 for finding foo within 10 words of bar. Does ferret support this feature? (the ~ is used for fuzzy queries) Does rdig? This could be a deal breaker for me ''cos I really need proximity searches -- Posted via http://www.ruby-forum.com/.

rdig wildcard searches

2007 Feb 15

rdig wildcard searches

Lucene has simple wildcard syntax supporting ? and * thus ruby could be matched by rub? r*by etc. This doesn''t work using rdig on the command line e.g. rdig -c config.rb -q ''data:"ru?y"'' gives RDig version 0.3.4 using Ferret 0.10.14 executing query >data:"ru?y"< Query: data:"ru y"~1 which is something entirely different. The

Error on optimize leads to corrupt index?

2007 Apr 14

Error on optimize leads to corrupt index?

The following exception occurred while trying optimize a large index: vendor/gems/rdig-0.3.4/lib/rdig/index.rb:46:in `optimize'': End-of- File Error occured at <except.c>:93 in xraise (EOFError) Error occured in store.c:216 - is_refill current pos = 0, file length = 0 Now, I get the following error any time I try to create a new index on the directory that I was trying

Questions about backgroundrb

2008 Mar 25

Questions about backgroundrb

Cc''ing to the list for archival purposes: On Tue, Mar 25, 2008 at 7:55 PM, Brian Noguchi <brian.noguchi at gmail.com> wrote: > Hi Hemant, > > I''m Brian Noguchi, a developer in the Bay Area. I have some questions about > backgroundrb, and I found your contact info on a forum. I figured its > probably best to get answers straight from the source. > >

converting xml to a OpenStruct

2010 Nov 28

converting xml to a OpenStruct

Hi just wondering if there is a easy way to turn an xml entity into a object like of type OpenStruct where i can access everything like a property? It''s kind of like parsing xml, I already know about nokogiri, but I want to work is a OpenStruct and not a DOM object, does that make sense? =P -- Kind Regards, Rajinder Yadav | DevMentor.org | Do Good! ~ Share Freely GNU/Linux:

regrex_crawler -- a crawler which uses regular expression to catch data from website

2009 Sep 13

regrex_crawler -- a crawler which uses regular expression to catch data from website

RegexpCrawler is a crawler which uses regular expression to catch data from website. It is easy to use and less code if you are familiar with regular expression. The project site is: http://github.com/flyerhzm/regexp_crawler/tree I give an example: a script to synchronize your github projects except fork projects, , please check example/github_projects.rb require ''rubygems''

Example for using ferret search engine

2007 Jun 24

Example for using ferret search engine

Hi, Is there any application where I can see the usage of Ferret engine(like example implementation). I have some difficulties in using it, sending query and getting the results. Thank you, Raj. -- Posted via http://www.ruby-forum.com/.

putting away HashWithIndifferentAccess

2007 Sep 25

putting away HashWithIndifferentAccess

Hey, campineros. And many good handshakes to zimbatm for getting some patches applied. So, yeah, I''d really like to get rid of any serious dependancies with this 1.6 release. Anything that''s not in stdlib has to go. Of course, camping-omnibus will still assume the whole ActiveRecord, Markaby, Mongrel setup that''s in the history books. Metaid can be removed and

acts_as_ferret and searching word docs

2006 Nov 17

acts_as_ferret and searching word docs

I was wondering if it is possible to search word documents using ferret. The actual text in a word document isn''t in a binary format - only the formatting. Surely it would be possible to parse that? -- Posted via http://www.ruby-forum.com/.

Importing packages in Depend

2013 Sep 12

Importing packages in Depend

Hi, I am currently preparing a new version of my package papeR. When I run R CMD check using the development version of R I get the following note: Package in Depends field not imported from: ?nlme?, ?lme4?, ?survival? These packages needs to imported from for the case when this namespace is loaded but not attached. I now have problems to fix this issue. It is easy to get rid of two of the

similar to: Someone getting RDig work for Linux?