The Ecfun package includes functions written to scrape data from
web pages. See, e.g., readUShouse, readUSsenate,
readUSstateAbbreviations. They use getURL{RCurl} and readHTMLTable{XML}.
Hope this helps.
Spencer Graves
On 11/20/2014 5:42 PM, Matt Considine wrote:> Hi,
> I'm wondering if anyone can point me to code to parse data on Google
> Finance pages, i.e. parse the results of a URL request such as this
> http://www.google.com/finance?q=apple
>
> I know how to return the contents of the page; it's figuring out the
> best tools to parse it that I'm interested in and hopefully someone
> has already done this.
>
> (For what it is worth, the only info I am looking for are the ticker,
> exchange, currency and "Mkt Cap" datapoint)
>
> Thanks in advance for any help - scraping is not my strong suit.
> Matt
>
>
> ---
> This email is free from viruses and malware because avast! Antivirus
> protection is active.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.