Displaying 20 results from an estimated 900 matches similar to: "XML and RCurl: problem with encoding (htmlTreeParse)"
2009 Dec 31
3
XML and RCurl: problem with encoding (htmlTreeParse)
Hi,
I'm trying to get data from web page and modify it in R. I have a
problem with encoding. I'm not able to get
encoding right in htmlTreeParse command. See below
> library(RCurl)
> library(XML)
>
> site <- getURL("http://www.aarresaari.net/jobboard/jobs.html")
> txt <- readLines(tc <- textConnection(site)); close(tc)
> txt <- htmlTreeParse(txt,
2009 Oct 15
1
Removing Embedded Null characters from text/html
Hi,
I'm trying to download some data from the web and am running into
problems with 'embedded null' characters. These seem to indicate to R
that it should stop processing the page so I'd like to remove them.
I've been looking around and can't seem to identify exactly what the
character is and consequently how to remove it.
# THE CODE WORKS ON THIS PAGE
library(RCurl)
2008 Dec 17
1
Extract Data from a Webpage
Hi All:
I would like to extract the provider name, address, and phone number
from multiple webpages like this:
http://oasasapps.oasas.state.ny.us/portal/pls/portal/oasasrep.providersearch.take_to_rpt?P1=3489&P2=11490
Based on searching R-help archives, it seems like the XML package
might have something useful for this task. I can load the XML package
and supply the url as an argument to
2012 Jun 07
1
How to set cookies in RCurl
Hi,
I am trying to access a website and read its content. The website is a
restricted access website that I access through a proxy server (which
therefore requires me to enable cookies). I have problems in allowing Rcurl
to receive and send cookies.
The following lines give me:
library(RCurl)
library(XML)
url <- "http://www.theurl.com"
content <- readHTMLTable(url)
content
2012 Aug 09
2
read htm table error
Hi I am using Version R 2.15 and I haven't been able read html table. Following is my code and error message.
Error in htmlParse(doc) :
error in creating parser for http://en.wikipedia.org/wiki/Brazil_national_football_team
theurl <- "http://en.wikipedia.org/wiki/Brazil_national_football_team"
tables <- readHTMLTable(theurl)
Regards,
Kiung
[[alternative HTML version
2010 Oct 06
2
Converting scraped data
Dear Colleagues,
I used this code to scrape data from the URL conatined within. This
code should be reproducible.
require("XML")
library(XML)
theurl <- "http://www.queensu.ca/cora/_trends/mip_2006.htm"
tables <- readHTMLTable(theurl)
n.rows <- unlist(lapply(tables, function(t) dim(t)[1]))
class(tables)
test<-data.frame(tables, stringsAsFactors=FALSE)
2010 Oct 10
1
Create single vector after looping through multiple data frames with GREP
Hello all,
I changed the subject line of the e-mail, because the question I''m posing now is different than the first one. I hope that this is proper etiquette. However, the original chain is included below.
I've incorporated bits of both Ethan and Brian's code into the script below, but there's one aspect I can't get my head around. I'm totally new to programming
2011 Nov 16
1
Checking for monotonic sequence
I am scraping data from a web page using XML (excellent package BTW - that's scraping data the easy way!).
So far, I've got the code:
tables <- readHTMLTable(theurl)
rhf <- tables$tabResHistFull
div1 <- rhf[which(rhf$V1=="Div ps"),]
div1
which is giving me the result:
V1 V2 V3 V4 V5 V6 V7 V8 V9 V10 V11 V12 V13 V14 V15
15
2017 Nov 09
3
Not able to list domain in new samba DC
It’s Centos 7 and I thought all I had to do was set up nsswitch.conf for it to work.
cordially yours,
Sina Owolabi
Mob: +2348034022578
Skype: darkchild2011
On 9 Nov 2017, 4:24 PM +0100, Rowland Penny via samba <samba at lists.samba.org>, wrote:
> On Thu, 9 Nov 2017 15:58:04 +0100
> Sina Owolabi <notify.sina at gmail.com> wrote:
>
> > Yes I did setup libnss_winbind.
2012 Jun 08
0
XML htmlTreeParse fails with no obvious error
Hi all,
Sorry for the rather uninformative subject, but the error I get is not very informative either.
When using the XML and RCurl package to retrieve the content of an html page, htmlTreeParse fails, printing out the beginning of the HTML:
Error in htmlTreeParse(getURL(url)) :
File <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
2011 Aug 25
1
R hangs after htmlTreeParse
Dear colleagues,
I'm trying to parse the html content from this webpage:
2009 Oct 14
2
puzzle using gsub (and encodings maybe)
Hello,
Below is some output that shows my issue.
I have a variable x that I read from a file (more on this below)
> x
[1] "NEW YORK NEW ENGLAND"
> gsub(" -", "-", x) # this does not work!
[1] "NEW YORK NEW ENGLAND"
> Encoding(x) # is x in a special encoding? no
[1] "unknown"
> y = "NEW YORK -NEW
2010 Mar 15
1
XML: Slower parsing over time with htmlTreeParse()
Sorry, I listed the wrong package in the header of my previous post!
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
Dear List,
has anyone of you experienced a significant increase in the time it takes to
parse an URL via "htmlTreeParse()" when this function is called
2010 Mar 15
0
RMySQL: Slower parsing over time with htmlTreeParse()
Dear List,
has anyone of you experienced a significant increase in the time it takes to
parse an URL via "htmlTreeParse()" when this function is called repeatedly
every minute over a couple of hours?
Initially, a single parse takes about 0.5 seconds on my machine (Quad Core,
2.67 GHz, 8 MB RAM, Windows 7 64 Bit), . After some time, this can go up to
15 seconds or more.
2008 Nov 04
2
How to suppress errors from htmlTreeParse() function in XML package?
Dear R-help,
The following code downloads an html document into variable 'doc' and
then stores an internal representation into variable 'html.tree'. Even
if the html code is malformed, this still works which is fantastic.
However, as in the example below, i do get some ouput from R in the
console which i would like to suppress somehow, so i can keep my
window a bit cleaner.
I
2009 Nov 29
2
Time Series Rating Model
To R programming experts,
I am a undergraduate student, and now doing research personally. I apply diagonal bivariate poisson (R package "bivpois") with stochatics weighted function (refer to dixoncoles97 section 4.5 to 4.7). However I dont know how to fit this stochatical weighted function to the completed bivariate poisson model.
I know that some other references for dynamic soccer
2017 Nov 09
2
Not able to list domain in new samba DC
Yes I did setup libnss_winbind.
wbinfo -u and -g on the domain member both work:
[root at testfsrv ~]# wbinfo -u
SAMDOM\testakin
SAMDOM\testsina
SAMDOM\testigein
SAMDOM\administrator
SAMDOM\krbtgt
SAMDOM\guest
[root at testfsrv ~]# wbinfo -g
SAMDOM\allowed rodc password replication group
SAMDOM\enterprise read-only domain controllers
SAMDOM\denied rodc password replication group
SAMDOM\read-only
2008 Jan 25
1
Join me on Last.fm!
Hi asterisk-users at lists.digium.com,
Add me as a friend on Last.fm so we can share our music taste :)?
Check out what I'm listening to: http://www.last.fm/user/shina01/?invitedby=shina01&tp=ff_tp_b
--------
I also sent you a personal note:
"boo!"
Signing up is free and takes less than a minute.
Just click the link to automatically become my friend.
2001 Feb 28
1
Install samba for Aix 4.3
Hi all
Have anybody installed Samba on Aix 4.3? I try to install ,but failed. I have download samba 2.0.7 , when I want to run configure according to the Step 1 of Unix_install.txt ,system always report "syntax error". So i can't set env for compiling it.
Can anybody help me?, pls write detail install step for Aix4.3.
Thx a lot
Felix
______________________________________
2015 Jan 20
1
Samba 4 and Windows Rights Management
On 20/01/15 20:15, Marc Muehlfeld wrote:
> Hello Sina,
>
> Am 20.01.2015 um 18:48 schrieb Sina Owolabi:
>> Please I need to answer this question... I have a friendly organization
>> profiling Linux software solutions as an alternative to a Windows-based
>> back end.
>> I've been asked to find out if there's an alternative to the Windows Rights
>>