Displaying 20 results from an estimated 900 matches similar to: "htmlParse pop ups over web pages"
2013 Mar 20
1
htmlParse (from XML library) working sporadically in the same code
I am using htmlParse from XML library on a paricular website. Sometimes code fails, sometimes it works, most of the time id doesn't and i cannot see why. The file i am trying to parse isĀ
http://www.londonstockexchange.com/exchange/prices-and-markets/international-markets/indices/home/sp-500.html?page=0
Sometimes the following code works
n<-readHTMLTable(htmlParse(url))
But most of the
2012 May 21
1
htmlParse Error
I am trying to parse a webpage using the htmlParse command in XML package as
follows:
library(XML)
u = "http://en.wikipedia.org/wiki/World_population"
doc = htmlParse(u)
I get the following error:
Error in htmlParse(u) :
error in creating parser for http://en.wikipedia.org/wiki/World_population
I am using a R 2.13.1 (32 bit version) on a 64 bit Windows. (I tried
installing it in
2012 Jan 30
1
Getting htmlParse to work with Hebrew? (on windows)
Hello dear R-help mailing list.
I wish to be able to have htmlParse work well with Hebrew, but it keeps to
scramble the Hebrew text in pages I feed into it.
For example:
# why can't I parse the Hebrew correctly?
library(RCurl)
library(XML)
u = "http://humus101.com/?p=2737"
a = getURL(u)
a # Here - the hebrew is fine.
a2 <- htmlParse(a)
a2 # Here it is a mess...
None of
2012 Sep 04
0
get only little part of html with htmlParse
Here is my code.
there are three method to get text to be parded by htmlParse function.
1.file on mycomputer
options(encoding="gbk")
library(XML)
xmltext1 <- htmlParse("/home/tiger/Desktop/27174.htm" )
#/home/tiger/Desktop/27174.htm is the file of http://www.jb51.net/article/27174.htm downloaded on my computer.
2.url
options(encoding="gbk")
2009 Jun 30
1
How to pass parameters to htmlParse Bank of Canada html pages
To get USDCAD rates from Bank of Canada, we first go
url <- "http://banqueducanada.ca/en/rates/exchange-avg.html"
select 12 months for Rates for the past and click "Get Rates" button. Then
the page moves to
address <- "http://banqueducanada.ca/cgi-bin/famecgi_fdps"
and the rates show in the html page.
htmlParse() can read the html document but
2011 Sep 05
2
htmlParse hangs or crashes
Dear colleagues,
each time I use htmlParse, R crashes or hangs. The url I'd like to parse is included below as is the results of a series of basic commands that describe what I'm experiencing. The results of sessionInfo() are attached at the bottom of the message.
The thing is, htmlTreeParse appears to work just fine, although it doesn't appear to contain the information I need (the
2013 Feb 21
4
Getting htmlParse to work with Hebrew? (on windows)
Hello dear R-help mailing list.
Looks like the same issue in Russian:
library(RCurl)
library(XML)
u = " http://www.cian.ru/cat.php?deal_type=2&obl_id=1&room1=1"
a = getURL(u)
a # Here - the Russian is fine.
a2 <- htmlParse(a)
a2 # Here it is a mess...
None of these seem to fix it:
htmlParse(a, encoding = "windows-1251")
htmlParse(a, encoding =
2013 Feb 28
0
Scraping data from website---Error in htmlParse: error in creating parser
I'm trying to scrape football projections from accuscore.com for the
different positions (right now the projections are set to zeros, but that
will change). I can get the QB projections, but I can't get the
projections for any of the other positions (e.g., RB). How can I get the
RB projections?
I'm not sure what the actual website for the RB and other projections is.
When I go to
2018 May 23
0
Using R htmlParse() for manipulating URLs to access multiple pages
I am trying to scrape a manual from web. For privacy reasons, I cannot write here the exact URL, anyway, the structure is as follows:
https://home.lala.com/bibi/blabla/chapter_i_organization/101_contracts/whatever/,DanaInfo=intranet.lala.com+
https://home.lala.com/bibi/blabla/chapter_i_organization/125_bills/,DanaInfo=intranet.lala.com+
2012 May 19
1
Try Giving Invalid Argument Type Error
Dear R Helpers,
I am getting an error message from the try function that I don't
understand so I am hoping that someone can help.
I am scraping from web pages, but sometimes they disappear. When that
happens I need to control for it with some sort of function.
This web page is parsed without a problem.
exh<-"NASDAQ"
tic<-"EGHT"
2012 Oct 17
0
postForm() in RCurl and library RHTMLForms
Hi R Users,
I want to get the data from the url given from 10/09/2012 to 15/10/2012.
I don't know how to pass the parameters .
.......................................................................................................................................
library(RHTMLForms)
>
> ff = getHTMLFormDescription("
2011 Aug 29
1
reading tables from multiple HTML pages
Hi, beginner to R and was having some problems scraping data from tables in
html using the XML package. I have included some code below.
I am trying to loop through a series of html pages, each of which contains a
single table from which I want to scrape data. However, some of the pages
are blank - and so it throws me an error message when it gets to
htmlParse(). The loop then closes out and I
2012 Jun 08
0
XML htmlTreeParse fails with no obvious error
Hi all,
Sorry for the rather uninformative subject, but the error I get is not very informative either.
When using the XML and RCurl package to retrieve the content of an html page, htmlTreeParse fails, printing out the beginning of the HTML:
Error in htmlTreeParse(getURL(url)) :
File <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
2010 Mar 11
1
parse an HTML page with verbose error message (using XML)
I'm using the function htmlParse() in the XML package, and I need a
little bit help on error handling while parsing an HTML page. So far I
can use either the default way:
# error = xmlErrorCumulator(), by default
library(XML)
doc = htmlParse("http://www.public.iastate.edu/~pdixon/stat500/")
# the error message is:
# htmlParseStartTag: invalid element name
or the tryCatch()
2008 Dec 31
1
Chinese characters encoding problem with XML
XML is a good tool reading data from web within R. But I wonder how could get the encoding correctly.
library(XML)
url <- 'http://www.szitic.com/docc/jz-lmzq.html'
xml <- htmlTreeParse(url, useInternal=TRUE)
q <- "//tbody/tr/td"
dat <- unlist(xpathApply(xml, q, xmlValue))
df <- as.data.frame(t(matrix(dat, 4)))
dt<-as.character(df[15,1])
The first column of df
2007 May 31
1
Mac OS X crash bug?
Hi all,
I want to check if this is a bug for which I should file a report.
I am using R2.5.0 on OS X 10.4.9. When I invoke the data editor and
when I change the values of individual cells, it seems to work as
intended. However, when I try to delete/add a row/column, R.app
crashes. I've attached the crash log.
Best,
-Nathan
-------------- next part --------------
An embedded and
2012 Apr 12
3
Remove superscripts from HTML objects
Is there some way to remove superscripts from objects returned by
html/xmlParse (XML package)?
h <- "<html><p>Cat<sup>a</sup></p><p>Dog</p></html>"
doc <- htmlParse(h)
xpathSApply(doc, "//p", xmlValue)
[1] "Cata" "Dog"
I could probably remove the <sup> tags from the "h" object above,
2012 Mar 21
1
Trouble installing the XML package
Hello everyone,
I am probably not the only one having trouble with this package but here goes.
I want to install XML on Ubuntu. I installed libxml2-dev and
everything works out fine until I get the following:
Error in reconcilePropertiesAndPrototype(name, slots, prototype,
superClasses, :
No definition was found for superclass "namedList" in the
specification of class
2012 Aug 09
2
read htm table error
Hi I am using Version R 2.15 and I haven't been able read html table. Following is my code and error message.
Error in htmlParse(doc) :
error in creating parser for http://en.wikipedia.org/wiki/Brazil_national_football_team
theurl <- "http://en.wikipedia.org/wiki/Brazil_national_football_team"
tables <- readHTMLTable(theurl)
Regards,
Kiung
[[alternative HTML version
2019 Apr 04
2
Unable to verify of llvm sources with the .sig files
With the new signature file I was able to verify, but there was
still a bad signature: "gpg: key 0x0FC3042E345AD05D: 1 bad signature"
which I highlighted below. Didn't seem to be a problem, but thought
I'd point it out. I'd be glad to do additional tests if you'd like.
$ gpg --list-keys
/home/wink/.gnupg/pubring.kbx
-----------------------------
pub