thr3ads.net - R help - [R] Downloading data from from internet [Sep 2009]

If this information is useful, please help other people find it:
Share via:

Bogaso

2009-Sep-24 13:34 UTC

[R] Downloading data from from internet

Hi all,

I want to download data from those two different sources, directly into R :

http://www.rateinflation.com/consumer-price-index/usa-cpi.php
http://eaindustry.nic.in/asp2/list_d.asp

First one is CPI of US and 2nd one is WPI of India. Can anyone please give
any clue how to download them directly into R. I want to make them zoo
object for further analysis.

Thanks,
-- 
View this message in context:
http://www.nabble.com/Downloading-data-from-from-internet-tp25568930p25568930.html
Sent from the R help mailing list archive at Nabble.com.

cls59

2009-Sep-24 14:27 UTC

head link

[R] Downloading data from from internet

Bogaso wrote:> 
> Hi all,
> 
> I want to download data from those two different sources, directly into R
> :
> 
> http://www.rateinflation.com/consumer-price-index/usa-cpi.php
> http://eaindustry.nic.in/asp2/list_d.asp
> 
> First one is CPI of US and 2nd one is WPI of India. Can anyone please give
> any clue how to download them directly into R. I want to make them zoo
> object for further analysis.
> 
> Thanks,
> 
The following site did not load for me:

http://eaindustry.nic.in/asp2/list_d.asp

But I was able to extract the table from the US CPI site using Duncan Temple
Lang's XML package:

  library(XML)


First, download the website into R:

  html.raw <- readLines(
'http://www.rateinflation.com/consumer-price-index/usa-cpi.php' )

Then, convert to an HTML object using the XML package:

  html.data <- htmlTreeParse( html.raw, asText = T, useInternalNodes = T )

A quick scan of the page source in the browser reveals that the table you
want is encased in a div with a class of "dynamicContent"-- we will
use a
xpath specification[1] to retrieve all rows in that table:

  table.html <- getNodeSet( html.data,
'//div[@class="dynamicContent"]/table/tr' )

Now, the data values can be extracted from the cells in the rows using a
little sapply and xpathXpply voodoo:

  table.data <- t( sapply( table.html, function( row ){

    row.data <-  xpathSApply( row, './td', xmlValue )
    return( row.data)

  }))


Good luck!

-Charlie
 
  [1]:  http://www.w3schools.com/XPath/xpath_syntax.asp

-----
Charlie Sharpsteen
Undergraduate
Environmental Resources Engineering
Humboldt State University
-- 
View this message in context:
http://www.nabble.com/Downloading-data-from-from-internet-tp25568930p25572316.html
Sent from the R help mailing list archive at Nabble.com.

Gabor Grothendieck

2009-Sep-26 10:17 UTC

head link

[R] Downloading data from from internet

Here are three different approaches:

1. Using the first link as an example, on Windows you can copy the
data and headers from IE (won't work in Firefox) to Excel and from
there to clipboard again and then in R:

library(zoo)
DF <- read.delim("clipboard")
z <- zooreg(c(t(DF[5:1, 2:13])), start = as.yearmon("2005-01"),
freq = 12)

2. on any platform you can read it straight into R:

L <-
readLines("http://www.rateinflation.com/consumer-price-index/usa-cpi.php")

and then use the character manipulation functions (grep, sub, gsub,
substr) and as.numeric to parse out the data or

3. on any platform, use the XML package adapting the code in this post:

    https://stat.ethz.ch/pipermail/r-help/2009-July/203063.html

On Thu, Sep 24, 2009 at 9:34 AM, Bogaso <bogaso.christofer at gmail.com>
wrote:>
> Hi all,
>
> I want to download data from those two different sources, directly into R :
>
> http://www.rateinflation.com/consumer-price-index/usa-cpi.php
> http://eaindustry.nic.in/asp2/list_d.asp
>
> First one is CPI of US and 2nd one is WPI of India. Can anyone please give
> any clue how to download them directly into R. I want to make them zoo
> object for further analysis.
>
> Thanks,
> --
> View this message in context:
http://www.nabble.com/Downloading-data-from-from-internet-tp25568930p25568930.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

Seemingly Similar Threads

Search for more maybe matching threads

R help - Sep 2009 - Downloading data from from internet

[R] Downloading data from from internet

[R] Downloading data from from internet

[R] Downloading data from from internet

Seemingly Similar Threads