thr3ads.net - similar to: "Do colClasses in readHTMLTable (XML Package) work?"

Displaying 20 results from an estimated 800 matches similar to: "Do colClasses in readHTMLTable (XML Package) work?"

How to suppress errors generated by readHTMLTable?

2009 Nov 26

How to suppress errors generated by readHTMLTable?

library(XML) download.file('http://polya.umdnj.edu/polya_db2/gene.php?llid=109079&unigene=&submit=Submit','index.html') tables=readHTMLTable("index.html",error=function(...){}) tables readHTMLTable gives me the following errors. Could somebody let me know how to suppress them? Opening and ending tag mismatch: center and table htmlParseEntityRef: expecting

readHTMLTable (XML package)

2013 Jan 15

readHTMLTable (XML package)

Hi, I am using XML::readHTMLTable and getting the below error. Does anyone know why? Does this function not work with https? I didn't see anything in help about that. > library(XML) > wampage<-readHTMLTable('https://hr-workforce-analytics.llnl.gov/wf_pi_pop.html',1) Error in htmlParse(doc) : File https://hr-workforce-analytics.llnl.gov/wf_pi_pop.html does not exist Dan

readHTMLTable function - unable to find an inherited method ~ for signature "NULL"

2012 Jun 14

readHTMLTable function - unable to find an inherited method ~ for signature "NULL"

Hi R experts, I have been playing with library(XML) recently and found out that readHTMLTable workls flawlessly for some website, but it does give me an error like below ... Error in function (classes, fdef, mtable) : unable to find an inherited method for function "readHTMLTable", for signature "NULL" let's say..for example, this code works fine a

Problem with readHTMLTable

2012 May 26

Problem with readHTMLTable

Hello All, i was trying to simply run the readHTMLTable on the example published in the package. And on a page I was working on. So running: u = "http://en.wikipedia.org/wiki/List_of_countries_by_population" tables = readHTMLTable(u) returns the following error: Error in tb[["thead"]] : subscript out of bounds looking up this error on the web, didnt give me any hint. Is

reading tables from multiple HTML pages

2011 Aug 29

reading tables from multiple HTML pages

Hi, beginner to R and was having some problems scraping data from tables in html using the XML package. I have included some code below. I am trying to loop through a series of html pages, each of which contains a single table from which I want to scrape data. However, some of the pages are blank - and so it throws me an error message when it gets to htmlParse(). The loop then closes out and I

htmlParse (from XML library) working sporadically in the same code

2013 Mar 20

htmlParse (from XML library) working sporadically in the same code

I am using htmlParse from XML library on a paricular website. Sometimes code fails, sometimes it works, most of the time id doesn't and i cannot see why. The file i am trying to parse is http://www.londonstockexchange.com/exchange/prices-and-markets/international-markets/indices/home/sp-500.html?page=0 Sometimes the following code works n<-readHTMLTable(htmlParse(url)) But most of the

Problems with time formats when importing data using readHTMLTable

2017 Jul 10

Problems with time formats when importing data using readHTMLTable

Not reproducible. [1][2][3] If our answers don't seem to apply to your situation, it will likely be because you did not explain your question clearly. Not plain text. This is a plain text mailing list, and the best-case scenario when you let your email program send HTML is that what you saw is not what we see (worst case is your email is scrambled on our end). Have you read the

issue with "strange" characters (readHTMLTable)

2011 May 05

issue with "strange" characters (readHTMLTable)

Thank you. The line of code you give certainly resolves several of the issues. I didn't realize that font support is such a tough matter to realize. Let me express my gratitude to those who provide this for us in R. On 04-05-11, Prof Brian Ripley <ripley at stats.ox.ac.uk> wrote: Oh, please! This is about the contributed package XML, not R and not Windows. Some of

memory leak using XML readHTMLTable

2012 Sep 17

memory leak using XML readHTMLTable

Hi, I'm using the XML package to scrape data and I'm trying to figure out how to eliminate the memory leak I'm currently experiencing. In the searches I've done, it sounds like the existence of the leak is fairly well known. What isn't as clear is exactly how to solve it. The general process I'm using is this: require(XML) myFunction <- function(URL) { html

read htm table error

2012 Aug 09

read htm table error

Hi I am using Version R 2.15 and I haven't been able read html table. Following is my code and error message. Error in htmlParse(doc) : error in creating parser for http://en.wikipedia.org/wiki/Brazil_national_football_team theurl <- "http://en.wikipedia.org/wiki/Brazil_national_football_team" tables <- readHTMLTable(theurl) Regards, Kiung [[alternative HTML version

Problems with time formats when importing data using readHTMLTable

2017 Jul 10

Problems with time formats when importing data using readHTMLTable

Hi, I am extracting positions data from the marine traffic website. The table has a "Timestamp" column which, in the browser, appears with the format yyyy-mm-dd HH:MM (UTC), e.g. 2017-07-10 14:04 (UTC). When I import the table, the same date "2017-07-10 14:04 (UTC)" appears as "1499696500149969650021 minutes ago", This is the more recent date and time. Older

postForm() in RCurl and library RHTMLForms

2010 Nov 04

postForm() in RCurl and library RHTMLForms

Hi RUsers, Suppose I want to see the data on the website url <- "http://www.nseindia.com/content/indices/ind_histvalues.htm" for the index "S&P CNX NIFTY" for dates "FromDate"="01-11-2010","ToDate"="02-11-2010" then read the html table from the page using readHTMLtable() I am using this code webpage <-

How to set cookies in RCurl

2012 Jun 07

How to set cookies in RCurl

Hi, I am trying to access a website and read its content. The website is a restricted access website that I access through a proxy server (which therefore requires me to enable cookies). I have problems in allowing Rcurl to receive and send cookies. The following lines give me: library(RCurl) library(XML) url <- "http://www.theurl.com" content <- readHTMLTable(url) content

Problems with time formats when importing data using readHTMLTable

2017 Jul 11

Problems with time formats when importing data using readHTMLTable

> On Jul 11, 2017, at 6:25 AM, Cristina Silva <csilva at ipma.pt> wrote: > > Dear Jeff, > > I am sorry, I didn't notice that it was not plain text. I hope that it is now in the correct format. I explain the problem again, now with more detais. > I am collecting the track positions of our research vessel from www.marinetraffic.com. In the page, the data appear in a

Problems with time formats when importing data using readHTMLTable

2017 Jul 11

Problems with time formats when importing data using readHTMLTable

Dear Jeff, I am sorry, I didn't notice that it was not plain text. I hope that it is now in the correct format. I explain the problem again, now with more detais. I am collecting the track positions of our research vessel from www.marinetraffic.com. In the page, the data appear in a table: Timestamp Source Speed (kn) Latitude (?) Longitude (?) Course (?) Show on Map

Stuck ...can't get sapply and xmlTreeParse working

2011 Jul 05

Stuck ...can't get sapply and xmlTreeParse working

Can't seem to get the code below working. It gets stuck on line 24 inside the function hm; comments show the line in question. The function hm is called by sapply and is at the bottom of the code. Other stuff above line 24 works correctly including the first couple of lines of the function hm. Should I be using a different apply function or am I doing something wrong with xmlTreeParse ?

import HTML tables

2009 May 12

import HTML tables

Hello, I was wondering if there is a function in R that imports tables directly from a HTML document. I know there are functions (say, getURL() from {RCurl} ) that download the entire page source, but here I refer to something like google document's function importHTML() (if you don't know this function, go check it, it's very useful). Anyway, if someone of something that does this

Scraping data from website---Error in htmlParse: error in creating parser

2013 Feb 28

Scraping data from website---Error in htmlParse: error in creating parser

I'm trying to scrape football projections from accuscore.com for the different positions (right now the projections are set to zeros, but that will change). I can get the QB projections, but I can't get the projections for any of the other positions (e.g., RB). How can I get the RB projections? I'm not sure what the actual website for the RB and other projections is. When I go to

Problems with time formats when importing data using readHTMLTable

2017 Jul 11

Problems with time formats when importing data using readHTMLTable

Dear David, Thank you very much. You have solved my problem, not of importing in the right format but the date conversion. It works for me, and if I specify the timezone tz='GMT', I get exactly the same time of the webpage. Sincerely yours, Cristina On Tue, 11 Jul 2017 07:15:43 -0700, David Winsemius wrote: >> On Jul 11, 2017, at 6:25 AM, Cristina Silva <csilva at

issue with "strange" characters (locale settings)

2011 May 04

issue with "strange" characters (locale settings)

WinXP-x32, R-21.13.0 Dear list, I have a problem that (I think) relates to the interaction between Windows and R. I am trying to scrape a table with data on the Hawai'ian Islands, This is my code: library(XML) u <- "http://en.wikipedia.org/wiki/Hawaii" tables <- readHTMLTable(u) Islands <- tables[[5]] The output is (first set of columns):

similar to: Do colClasses in readHTMLTable (XML Package) work?