Displaying 8 results from an estimated 8 matches for "basictextgatherer".
2012 May 14
3
Scraping a web page.
...he <a> tag is always on it's own line so some kind of line-by-line grep should suffice once I figure out the following:
What is the best package/command to use to get the source of a web page. I tried using something like:
if(url.exists("http://www.omegahat.org/RCurl")) {
h = basicTextGatherer()
curlPerform(url = "http://www.omegahat.org/RCurl", writefunction = h$update)
# Now read the text that was cumulated during the query response.
h$value()
}
which works except that I get one long streamed html doc without the line breaks.
Thanks in advance for your help,
KW
[...
2007 Nov 12
1
Microsoft SOAP - Help!!
Hello,
I am trying to access Microsoft Live Search Using SOAP through R.
In R I am using the RCurl packages to make the calls.
I have the following situation that looks crazy and cannot figure out how to
solve it:
#SOAP Request
library(RCurl)
h = basicTextGatherer()
body='<?xml version="1.0" encoding="ISO-8859-15"?>
<SOAP-ENV:Envelope
SOAP-ENV:encodingStyle=" http://schemas.xmlsoap.org/soap/encoding/"
xmlns:SOAP-ENV="http://schemas.xmlsoap.org/soap/envelope/"
xmlns:xsd=" http://www.w3.org/2001/XMLSch...
2007 Jul 13
1
R and HTTP get 'has file changed'
Is there a way, maybe using Duncan TL's RCurl, to efficiently test whether
an URL such as
http://$CRAN/src/contrib/
has changed? I.e. one way is via a test of a page in that directory as per
(sorry about the long line, and this would be on Linux with links and awk
installed)
> strptime(system("links -width 160 -dump http://cran.r-project.org/src/contrib/ | awk
2011 Feb 11
0
RCurl - HTTP request of header ONLY
...www.something.com/
However, I can't figure out how to do this when using 'getURL()', for
example.
Here's what I tried:
FIRST TRY
txt <- getURL("http://www.something.com/", verbose=TRUE, header=TRUE)
cat(txt)
This gives me header AND content.
SECOND TRY
headers <- basicTextGatherer()
txt <- getURL("http://www.something.com/", header=TRUE, trace=TRUE,
headerfunction=headers$update)
cat(headers$value())
This gives me the header, but the content is also requested and sent to
'txt'.
I was looking for a RCurl option like 'head', but only found ...
2013 Apr 24
0
string size limits in RCurl
...st(userpwd = paste(user, ":", key, sep = ""),
httpheader = "Content-Type: application/json")
request <- paste(.base.url, q, sep = "")
txt <- getURL(url = request, curl = session, .opts = curl.opts,
write = basicTextGatherer())
or
r = dynCurlReader()
curlPerform(url = request, writefunction = r$update, curl = session,
.opts = curl.opts)
My guess is that the `update` or `value` functions in the `basicTextGather` or `dynCurlReader` text handler objects are having trouble with the large strings....
2010 Apr 16
0
RCurl slow when sending data over 1kb
I am using RCurl's curlPerform command to send an XML string to an HTTP server running on the localhost. The command is something like this:
reader <- basicTextGatherer()
curlPerform(url="http://127.0.0.1/",
httpheader=c('Content-Type' = "text/xml; charset=utf-8"),
postfields=toString.XMLNode(xmlRoot(xdoc)),
writefunction=reader$update,
.opts=curlOptions(noproxy="*"))
When sending a short string, the web server responds...
2008 May 07
1
[BioC] RCurl loading problem with 64 bit linux distribution
...html latex
> > RCurl-internal text html latex
> > Note: removing empty section \seealso
> > base64 text html latex example
> > basicHeaderGatherer text html latex example
> > basicTextGatherer text html latex example
> > Note: removing empty section \details
> > binaryBuffer text html latex example
> > chunkToLineReader text html latex example
> > Note: removing empty section \value
&...
2008 May 07
1
[BioC] RCurl loading problem with 64 bit linux distribution
...html latex
> > RCurl-internal text html latex
> > Note: removing empty section \seealso
> > base64 text html latex example
> > basicHeaderGatherer text html latex example
> > basicTextGatherer text html latex example
> > Note: removing empty section \details
> > binaryBuffer text html latex example
> > chunkToLineReader text html latex example
> > Note: removing empty section \value
&...