Displaying 8 results from an estimated 8 matches for "htmltree".
Did you mean:
htmlthe
2006 Jan 25
0
screenscraping using htmltools and rexml
Hi,
I need to do some screen scraping and I''ve spent a couple hour getting
htmltools and rexml do the right thing. Here''s the code:
parser = HTMLTree::Parser.new(false, false)
parser.feed(res.body)
tree = parser.tree.html_node.as_rexml_document
I works for one page, but for another I get "undefined method `add'' for
#<HTMLTree::Element:0x37f9cc8>" in as_rexml_document
It seems like a library mismatch, but I just downloa...
2009 Jun 06
1
Rpad - avoid removal of manually created html-tags in R chunk
...</pre>
</body>
</html>
If I run Rpad("index.html") in the local server mode and when I turn to
my Rgui, then the object "x" looks like "ABab", i.e. all my html-tags
are removed within the R-chunk.
I know that I can avoid this with nested "HtmlTree"s, e.g.
H("table", H("tr", H("th", "A"), H("th", "B")), H("tr", H("td", "a"),
H("td", "b")))
but as I have to produce html-output of a large dataframe (100 rows and
6 colums), the...
2004 Aug 06
5
Ices2 compile error - streaming ogg!
...oot 2435 Feb 28 13:22 debugXML.h
-rw-r--r-- 1 root root 4146 Feb 28 13:22 encoding.h
-rw-r--r-- 1 root root 3123 Feb 28 13:22 entities.h
-rw-r--r-- 1 root root 2936 Feb 28 13:22 HTMLparser.h
-rw-r--r-- 1 root root 800 Feb 28 13:22 HTMLtree.h
lrwxrwxrwx 1 root root 1 May 18 18:02 libxml -> .
-rw-r--r-- 1 root root 2551 Feb 28 13:22 nanoftp.h
-rw-r--r-- 1 root root 920 Feb 28 13:22 nanohttp.h
-rw-r--r-- 1 root root 17325 Feb 28 13:22 parser.h
-rw-r--r-- 1 root roo...
2004 Aug 06
1
Ices2 compile error - streaming ogg!
...> > -rw-r--r-- 1 root root 4146 Feb 28 13:22 encoding.h
> > -rw-r--r-- 1 root root 3123 Feb 28 13:22 entities.h
> > -rw-r--r-- 1 root root 2936 Feb 28 13:22 HTMLparser.h
> > -rw-r--r-- 1 root root 800 Feb 28 13:22 HTMLtree.h
> > lrwxrwxrwx 1 root root 1 May 18 18:02 libxml -> .
> > -rw-r--r-- 1 root root 2551 Feb 28 13:22 nanoftp.h
> > -rw-r--r-- 1 root root 920 Feb 28 13:22 nanohttp.h
> > -rw-r--r-- 1 root root 17325 Feb 28 13:...
2006 Jan 10
1
OT: Scraper library recommendation
...t, as it wouldn''t recognize
XHTML-style "empty" tags (for instance, it dislikes <link ... />).
What''s even worse, I can''t seem to get it to dump back the HTML it read.
Something as simple as:
#!/usr/bin/env ruby
require ''html/tree''
p = HTMLTree::Parser.new(false, false)
p.feed("<a href=''about:blank''><img src=''blah'' /></a>")
p.tree.dump
Results in:
<a href="about:blank">
<img src="blah">
Rubyful Soup is not perfect either, quite often...
2004 Aug 06
0
Ices2 compile error - streaming ogg!
...28 13:22 debugXML.h
> -rw-r--r-- 1 root root 4146 Feb 28 13:22 encoding.h
> -rw-r--r-- 1 root root 3123 Feb 28 13:22 entities.h
> -rw-r--r-- 1 root root 2936 Feb 28 13:22 HTMLparser.h
> -rw-r--r-- 1 root root 800 Feb 28 13:22 HTMLtree.h
> lrwxrwxrwx 1 root root 1 May 18 18:02 libxml -> .
> -rw-r--r-- 1 root root 2551 Feb 28 13:22 nanoftp.h
> -rw-r--r-- 1 root root 920 Feb 28 13:22 nanohttp.h
> -rw-r--r-- 1 root root 17325 Feb 28 13:22 parser.h
> -rw...
2006 Jun 05
6
HTML Parsing libraries
Hi,
What is the best way to parse HTML?
Or is there a simple way to convert a table to an array?
I tried beautiful_soup and the built-in htmltools, but have trouble
getting them to run.
Any pointers?
Thanks, Hari
--
Posted via http://www.ruby-forum.com/.
2004 Aug 06
2
Is there a guide?
At 07:59 AM 7/29/02 +1000, you wrote:
>I noticed another post where it was mentioned which modules were needed etc,
>then followed the instructions at xiph.org for CVS. It's working now,
>although darkice (which is none of xiph's concern AFAIK) seg-faults.....
>
>I haven't been able to test ices2 in live mode, well, it seems to work, but I
>get silence.... ices2 in