search for: htmlcontentextractor

Displaying 2 results from an estimated 2 matches for "htmlcontentextractor".

2006 Jul 25
1
RDig document processing error
...ib/rubyful_soup.rb:230: warning: method redefined; discarding old attrs discovered content extractor class: RDig::ContentExtractors::PdfContentExtractor discovered content extractor class: RDig::ContentExtractors::WordContentExtractor discovered content extractor class: RDig::ContentExtractors::HtmlContentExtractor using Ferret 0.9.0 /usr/local/lib/site_ruby/1.8/rdig/url_filters.rb:116: warning: instance variable @patterns not initialized /usr/local/lib/site_ruby/1.8/rdig/url_filters.rb:105: warning: instance variable @patterns not initialized added url http://www.defensetech.org fetching http://www.defense...
2006 Jul 14
2
RDig config file problem
...1.0.4) Here is my output: sh:~/rdigtry$ rdig -c config/rdig_config.rb discovered content extractor class: RDig::ContentExtractors::PdfContentExtractor discovered content extractor class: RDig::ContentExtractors::WordContentExtractor discovered content extractor class: RDig::ContentExtractors::HtmlContentExtractor /home/steven/rdigtry/config/rdig_config.rb:4 /usr/lib/ruby/gems/1.8/gems/rdig-0.3.0/lib/rdig.rb:113:in `configuration'' /home/steven/rdigtry/config/rdig_config.rb:1 /usr/lib/ruby/gems/1.8/gems/rdig-0.3.0/lib/rdig.rb:226:in `load_configfile'' /usr/lib/ruby/gems/1.8/gems/rdig-0.3.0/...