similar to: problem scraping using nokogiri - getting wrong characters

Displaying 20 results from an estimated 1000 matches similar to: "problem scraping using nokogiri - getting wrong characters"

2011 Dec 05
12
Using nokogiri
HI, I want to grab some information about university names, and I found this term called "web scraping" I search about it in google, and there are tools in ruby. One of them is nokogiri but I''m a bit confused because it seems that it only gets information that its already in an html or xml I found a webpage that have a list of university names as a <select>
2011 Dec 15
4
Scrapping Content from a website
Hey all, I want to scrap some content from a website to rails application and wants to authenticate the url of the website which I want to scarp so please help me out.... Thank You!!! -- You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to
2012 Nov 13
1
Unofficial Lustre BOF at SC12
Are you in Salt Lake City? We''ll be having an ''unofficial'' Birds of a Feather (BOF) session Wednesday evening the 14th, from 5:30pm to 7:30pm, at the Salt Lake Marriott Downtown City Creek 2nd floor Snowbird Rm. We''ve had a really busy year with some great progress around Lustre development in particular! Also some new important participants have joined. Lots
2010 Jan 25
4
Does Amazon.com blocks scraping?
Hi there Does anyone know if Amazon.com has any sort of server side script that tries to block scraping activities? I first noticed that if I didn?t change the agent alias, it would fetch a page exactly like the normal one, but without the intial search field(maybe a silly way to prevent scraping). Then after it, I changed to some other alias, and submit a search. I got the result page as
2011 Sep 24
1
How to transfer HTML data to a XML file
Hello friends, Can anyone please help me with transfering HTML data to a XML file. What I need to do is I need to extract the data from the text fields of the HTML form and then transfer this data to the proper tags of the XML file. Will appreciate your help!Thanks Regards Adi -- Posted via http://www.ruby-forum.com/. -- You received this message because you are subscribed to the Google
2011 Sep 21
5
bundle install rmagick error in unix
I have Rails 3 and Ruby 1.9.2 Installing rmagick (2.13.1) with native extensions /opt/local/lib/ruby/site_ruby/1.9/rubygems/installer.rb:551:in `rescue in block in build_extensions'': ERROR: Failed to build gem native extension. (Gem::Installer::ExtensionBuildError) /opt/local/bin/ruby19 extconf.rb checking for Ruby version >= 1.8.5... yes checking for gcc... yes checking for
2011 Oct 16
2
current_path?
What''s the best way to retrieve the current path? (i.e. /posts/1) -- You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to rubyonrails-talk-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To unsubscribe from this group, send email to
2012 Feb 07
10
Encoding error
Hello, I have in my db a register what have special characters, and when I try to put on my form to edit this values, this happens: incompatible character encodings: UTF-8 and ASCII-8BIT Extracted source (around line #4): 1: <%= form_for :group, :url => { :action => "update" } do |f| %> 2: <%= utf8_enforcer_tag %> 3: <label>Nome do grupo</label>
2009 Oct 21
4
XML file using Nokogiri gem
Hello friends, Can you guys give me some idea about how to Create XML file using Nokogiri gem. -- Posted via http://www.ruby-forum.com/.
2011 Jul 11
39
Help filling out a registration (I think it's called multi tenant)
Do you know when you are filling out a registration for a new account, and you keep pressing next to fill the rest of the information in separate pages? I''d like to do that, but I have no idea how. Basically I have a company model with its information, and a contacts model, with belongs to my company (also each company has only one contact), and I''d like for the user to fill in
2009 Nov 11
2
nokogiri: parsing tags
Hello, I''m trying to use nokogiri gem to parse individual xhtml tags (not the whole html document) and preform some processing on them. Heres an example of a string that I want to process: str = ''<img src="http://www.canadascapital.gc.ca/data/2/rec_imgs/5005_Pepsi_H1NB.gif"/> some text<a href="#">test</a> ....'' The following code
2010 Jun 10
0
strange problem:no such file to load -- nokogiri on rails3
I am test the script named test.rb by command "ruby test.rb" and it pass. Then i converte it as rake task. The rake task pass on rails 2.3.4 but fail on rails 3 beta4 with error msg: "no such file to load -- nokogiri" ============================ env :rails 3 beta4, ruby 1.8.7, ubuntu ============================ test.rb ============================ require
2013 Sep 02
2
Why the string interpolation is not working inside the Nokogiri method `#search` ?
Why the string interpolation is not working inside the Nokogiri method `#search` ? require ''nokogiri'' doc = Nokogiri::HTML::Document.parse <<-eotl <div> <p>foo</p> <p>foo</p> <p>bar</p> </div> eotl doc.class # => Nokogiri::HTML::Document class Person attr_accessor :name end ram = Person.new
2010 Oct 31
0
Loading xml document using Nokogiri and retrieving CDATA element
This is an extension of my last post (problems with REXML) which has me looking to Nokogiri again. The reason I am not using Nokogiri is I can not seem to find a way to get CDATA out of a Nokogiri document. First, can you tell me if I am loading my document correctly, because when I call my_document.to_xml, I only get one line back: (rdb:1) test_file = Nokogiri::XML(mismo_xml_file)
2013 Jun 29
2
how to remove nokogiri warning message
hi I always get this message today WARNING: Nokogiri was built against LibXML version 2.8.0, but has dynamically loaded 2.9.0 I googled by all solution is for mac, I am use linux(openSUSE 12.3) any one can help? thanks in advance -- You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To unsubscribe from this group and stop
2011 Oct 02
5
tutorial: gravatar_image_tag problem
Hi all, Still working through the Hartl tutorials. Making progress but am hitting (another) wall. The repository is on github at: git-9UaJU3cA/F/QT0dZR+AlfA@public.gmane.org:aamax/sample_app.git if you want to see the code. I''ve pretty much followed the tutorial exactly except I am using rails 3.1.0 rather than 3.0.1 and some of the other gem versions are different. I
2009 Jan 24
4
nokogiri selector help
hiya, i want the selector that would return a <tr> of a table if any td contains some text, so i can use it in click_link_within e.g. When I click the "show" link within the row containing "user at example.com" When /^I click the "(.+)" link within the row containing "(.+)"$/ do | link, text| selector = ?? click_link_within selector, link
2009 Oct 13
9
Nokogiri: to_s WITHOUT html surrounding's tags?
Hi all n = Nokogiri::HTML("<h1>H1</h1>") n.to_s # => <!DOCTYPE html PUBLIC \"-//W3C//DTD HTML 4.0 Transitional//EN\" \"http://www.w3.org/TR/REC-html40/loose.dtd\">\n<html><body><h1>H1</h1></body></html> Is there a method that only outputs the stuff I''ve read, and not the whole valid XHTML stuff?
2013 Jun 04
3
Nokogiri::XML methods example
Hi, I am not able to find out a simple example to find out the how the method works: > Nokogiri::XML::EntityReference::new (http://nokogiri.org/Nokogiri/XML/EntityReference.html#method-c-new) > Nokogiri::XML::Document#canonicalize (http://nokogiri.org/Nokogiri/XML/Document.html#method-i-canonicalize) Thanks -- Posted via http://www.ruby-forum.com/. -- You received this message
2014 May 18
0
Problem on reading a web page (no matter using Net:HTTP or nokogiri)
Currently I am trying to get some reference data from bet.hkjc.com/football/index.aspx?lang=ch&pageno=1 However, I could not read the content (page source) properly even though I used force_encoding or String.encode! after getting the response.body Same coding is working fine for other site, but just this site has some tricky things made ruby cannot read the page content correctly. Do