Displaying 20 results from an estimated 52 matches for "readhtmltable".
2012 Jun 14
1
readHTMLTable function - unable to find an inherited method ~ for signature "NULL"
Hi R experts,
I have been playing with library(XML) recently and found out that
readHTMLTable workls flawlessly for some website, but it does give me an
error like below
... Error in function (classes, fdef, mtable) :
unable to find an inherited method for function "readHTMLTable", for
signature "NULL"
let's say..for example, this code works fine
a <-"h...
2009 Nov 26
1
How to suppress errors generated by readHTMLTable?
library(XML)
download.file('http://polya.umdnj.edu/polya_db2/gene.php?llid=109079&unigene=&submit=Submit','index.html')
tables=readHTMLTable("index.html",error=function(...){})
tables
readHTMLTable gives me the following errors. Could somebody let me
know how to suppress them?
Opening and ending tag mismatch: center and table
htmlParseEntityRef: expecting ';'
htmlParseEntityRef: expecting ';'
htmlParseEntit...
2012 May 26
3
Problem with readHTMLTable
Hello All,
i was trying to simply run the readHTMLTable on the example published in the
package. And on a page I was working on. So running:
u = "http://en.wikipedia.org/wiki/List_of_countries_by_population"
tables = readHTMLTable(u)
returns the following error:
Error in tb[["thead"]] : subscript out of bounds
looking up this er...
2010 Mar 18
1
Do colClasses in readHTMLTable (XML Package) work?
Hi,
I can't get the colClasses option to work in the readHTMLTable function
of the XML package. Here's a code fragment:
require("XML")
doc <- "http://www.nber.org/cycles/cyclesmain.html"
table <- getNodeSet(htmlParse(doc),"//table") [[2]] # The
main table is the second one because it's embedded i...
2013 Jan 15
1
readHTMLTable (XML package)
Hi,
I am using XML::readHTMLTable and getting the below error. Does anyone know why? Does this function not work with https? I didn't see anything in help about that.
> library(XML)
> wampage<-readHTMLTable('https://hr-workforce-analytics.llnl.gov/wf_pi_pop.html',1)
Error in htmlParse(doc) :
File https://hr-...
2013 Mar 20
1
htmlParse (from XML library) working sporadically in the same code
...website. Sometimes code fails, sometimes it works, most of the time id doesn't and i cannot see why. The file i am trying to parse is
http://www.londonstockexchange.com/exchange/prices-and-markets/international-markets/indices/home/sp-500.html?page=0
Sometimes the following code works
n<-readHTMLTable(htmlParse(url))
But most of the time it would return the following error coming from htmlParse:
Error: failed to load HTTP resource
Error is coming from the following line in htmlParse code:
ans <- .Call("RS_XML_ParseTree", as.character(file), handlers, as.logical(ignoreBlanks...
2012 Sep 17
1
memory leak using XML readHTMLTable
...experiencing. In the
searches I've done, it sounds like the existence of the leak is fairly
well known. What isn't as clear is exactly how to solve it. The
general process I'm using is this:
require(XML)
myFunction <- function(URL) {
html <- readLines(URL)
tables <- readHTMLTable(html, stringsAsFactors = FALSE)
myData <- data.frame(Value = tables[[1]][, 2],
row.names = make.unique(tables[[1]][, 1]),
stringsAsFactors = FALSE)
rm(list = c("html", "tables")) # here, and
free(tables)...
2017 Jul 10
2
Problems with time formats when importing data using readHTMLTable
...te
and time. Older records, as e.g. "2017-07-09 17:02 (UTC)" appear as e.g.
"1499619726149961972621 hours, 59 minutes ago".
I don't know how to convert these data to the time formats used in R
(POSIXct).
The script is very simple and worked before:
library(XML)
x <- readHTMLTable('url')
where the 'url' is the link to the website with the specification of the
vessel.
I appreciate any help.
Cristina
--
Cristina Silva
Divis?o de Modela??o e Gest?o de Recursos Pesqueiros
Av. Dr. Alfredo Magalh?es Ramalho
1495-165 Lisboa
@: csilva at ipma.pt <mailto:csil...
2017 Jul 10
0
Problems with time formats when importing data using readHTMLTable
...and the best-case scenario when you let your email program send HTML is that what you saw is not what we see (worst case is your email is scrambled on our end).
Have you read the documentation for the function you are using? In particular, what about the colClasses argument? If you don't let readHTMLTable guess what the format is (have it read in as character data) then you have a fighting chance to get it right yourself, e.g.
as.POSIXct( "2017-07-10 14:04 (UTC)", format="%Y-%m-%d %H:%M (UTC)", tz="UTC" )
-----
[1] http://stackoverflow.com/questions/5963269/how-to-ma...
2011 May 05
1
issue with "strange" characters (readHTMLTable)
...gly little credit, just
the brickbats for things for which we are not responsible. (We even work
hard to port XML to Windows for you, again with almost zero credit.)
That URL is a page in UTF-8, as its header says. We have provided many ways
to work with UTF-8 on Windows, but it seems readHTMLTable() is not making
use of them.
You need to run iconv() on the strings in your object (which as it has
factors, are the levels). When you do so, you will discover that page
contains characters not in your native charset (I presume, not having your
locale).
What you can do, in Rgui o...
2012 Jun 07
1
How to set cookies in RCurl
...is a
restricted access website that I access through a proxy server (which
therefore requires me to enable cookies). I have problems in allowing Rcurl
to receive and send cookies.
The following lines give me:
library(RCurl)
library(XML)
url <- "http://www.theurl.com"
content <- readHTMLTable(url)
content
$`NULL`
V1
1...
2011 May 04
1
issue with "strange" characters (locale settings)
...32, R-21.13.0
Dear list,
I have a problem that (I think) relates to the interaction between Windows
and R.
I am trying to scrape a table with data on the Hawai'ian Islands, This is my
code:
library(XML)
u <- "http://en.wikipedia.org/wiki/Hawaii"
tables <- readHTMLTable(u)
Islands <- tables[[5]]
The output is (first set of columns):
Island Nickname
> Islands
Island Nickname
Locatio...
2017 Jul 11
1
Problems with time formats when importing data using readHTMLTable
...anged some
>> permissions.
>>
>> Here is the script used to get the data directly from the webpage
>> into R, for a sample of 20 records (10 per page):
>>
>> library(XML)
>> x <- list()
>> for (i in 1:2)
>> {
>> x[i]<-
>> readHTMLTable(paste('http://www.marinetraffic.com/en/ais/index/positions/all/shipid:318358/mmsi:263601000/shipname:NORUEGA/per_page:10/page:',
>> i, sep=''))
>> }
>>
>> ais <- do.call('rbind', x)
>> ais <- ais[,-7]
>>
>> and I got the follo...
2017 Jul 11
0
Problems with time formats when importing data using readHTMLTable
...ne this before with no problems, but probably the webpage changed some permissions.
>
> Here is the script used to get the data directly from the webpage into R, for a sample of 20 records (10 per page):
>
> library(XML)
> x <- list()
> for (i in 1:2)
> {
> x[i]<- readHTMLTable(paste('http://www.marinetraffic.com/en/ais/index/positions/all/shipid:318358/mmsi:263601000/shipname:NORUEGA/per_page:10/page:', i, sep=''))
> }
>
> ais <- do.call('rbind', x)
> ais <- ais[,-7]
>
> and I got the following table:
>
>> ais
&...
2017 Jul 11
2
Problems with time formats when importing data using readHTMLTable
...ated and quick procedure. I have done this before with no
problems, but probably the webpage changed some permissions.
Here is the script used to get the data directly from the webpage into
R, for a sample of 20 records (10 per page):
library(XML)
x <- list()
for (i in 1:2)
{
x[i]<-
readHTMLTable(paste('http://www.marinetraffic.com/en/ais/index/positions/all/shipid:318358/mmsi:263601000/shipname:NORUEGA/per_page:10/page:',
i, sep=''))
}
ais <- do.call('rbind', x)
ais <- ais[,-7]
and I got the following table:
> ais
T...
2012 Aug 09
2
read htm table error
...and I haven't been able read html table. Following is my code and error message.
Error in htmlParse(doc) :
error in creating parser for http://en.wikipedia.org/wiki/Brazil_national_football_team
theurl <- "http://en.wikipedia.org/wiki/Brazil_national_football_team"
tables <- readHTMLTable(theurl)
Regards,
Kiung
[[alternative HTML version deleted]]
2012 Sep 19
1
scraping with session cookies
...lt;- "cookies.txt"
no_cookie <- function() {
curlHandle <- getCurlHandle(cookiefile=cf, cookiejar=cf)
getURL(site, curl=curlHandle)
rm(curlHandle)
gc()
}
if ( file.exists(cf) == TRUE ) {
file.create(cf)
no_cookie()
}
allTables <- readHTMLTable(site)
allTables
[[alternative HTML version deleted]]
2013 Feb 28
0
Scraping data from website---Error in htmlParse: error in creating parser
...or the football projections (
http://accuscore.com/fantasy-sports/nfl-fantasy-sports/), it displays the
QB projections. When I click on another position (e.g., RB) it displays a
new URL (
http://accuscore.com/fantasy-sports/nfl-fantasy-sports/Rest-of-Season-RB).
When I enter this new URL into the readHTMLTable function, I receive the
following error:
Error in htmlParse("
http://accuscore.com/fantasy-sports/nfl-fantasy-sports/Rest-of-Season-RB/")
:
error in creating parser for
http://accuscore.com/fantasy-sports/nfl-fantasy-sports/Rest-of-Season-RB/
What's going on? Might this have some...
2010 Nov 04
3
postForm() in RCurl and library RHTMLForms
...to see the data on the website
url <- "http://www.nseindia.com/content/indices/ind_histvalues.htm"
for the index "S&P CNX NIFTY" for
dates "FromDate"="01-11-2010","ToDate"="02-11-2010"
then read the html table from the page using readHTMLtable()
I am using this code
webpage <- postForm(url,.params=list(
"FromDate"="01-11-2010",
"ToDate"="02-11-2010",
"IndexType"="S&P CNX NIFTY",
&...
2011 May 26
2
What am I doing wrong with sapply ?
...oking for.
9 : s <-sapply(unlist(v[c(1:length(v))]), max)
11: for(i in 1 :length(v)) v1[i] <- max(unlist(v[i]))
Shouldn't I get the same answer ?
library(XML)
rm(list=ls())
url <-
"http://webapp.montcopa.org/sherreal/salelist.asp?saledate=05/25/2011"
tbl <-data.frame(readHTMLTable(url))[2:404, c(3,5,6,8,9)]
names(tbl) <- c("Address", "Township", "Parcel", "SaleDate", "Costs");
rownames(tbl) <- c(1:length(tbl[,1]))
x <-tbl
v <- gregexpr("( aka )|( AKA )",x$Address)
s <-sapply(unlist(v[c(1:length(v))]),...