Hi, I want to parse the html doc using ruby. I tried using reXML but failed to load html doc as it is not in well formed structure. Can you please suggest me a good parser which I can use to parse HTML page using Ruby? Thanks, Karika. -- Posted via http://www.ruby-forum.com/.
http://rubyforge.org/projects/tidy/ On 13-Feb-06, at 2:01 PM, Karika wrote:> Hi, > > I want to parse the html doc using ruby. > I tried using reXML but failed to load html doc as it is not in well > formed structure. > Can you please suggest me a good parser which I can use to parse HTML > page using Ruby? > > Thanks, > Karika. > > -- > Posted via http://www.ruby-forum.com/. > _______________________________________________ > Rails mailing list > Rails@lists.rubyonrails.org > http://lists.rubyonrails.org/mailman/listinfo/rails
Karika wrote:> Hi, > > I want to parse the html doc using ruby. > I tried using reXML but failed to load html doc as it is not in well > formed structure. > Can you please suggest me a good parser which I can use to parse HTML > page using Ruby? > > Thanks, > Karika.I''ve had good luck with Rubyful Soup: http://www.crummy.com/software/RubyfulSoup/ -- Posted via http://www.ruby-forum.com/.