search for: basictextgatherer

Displaying 8 results from an estimated 8 matches for "basictextgatherer".

2012 May 14
3
Scraping a web page.
...he <a> tag is always on it's own line so some kind of line-by-line grep should suffice once I figure out the following: What is the best package/command to use to get the source of a web page. I tried using something like: if(url.exists("http://www.omegahat.org/RCurl")) { h = basicTextGatherer() curlPerform(url = "http://www.omegahat.org/RCurl", writefunction = h$update) # Now read the text that was cumulated during the query response. h$value() } which works except that I get one long streamed html doc without the line breaks. Thanks in advance for your help, KW [...
2007 Nov 12
1
Microsoft SOAP - Help!!
Hello, I am trying to access Microsoft Live Search Using SOAP through R. In R I am using the RCurl packages to make the calls. I have the following situation that looks crazy and cannot figure out how to solve it: #SOAP Request library(RCurl) h = basicTextGatherer() body='<?xml version="1.0" encoding="ISO-8859-15"?> <SOAP-ENV:Envelope SOAP-ENV:encodingStyle=" http://schemas.xmlsoap.org/soap/encoding/" xmlns:SOAP-ENV="http://schemas.xmlsoap.org/soap/envelope/" xmlns:xsd=" http://www.w3.org/2001/XMLSch...
2007 Jul 13
1
R and HTTP get 'has file changed'
Is there a way, maybe using Duncan TL's RCurl, to efficiently test whether an URL such as http://$CRAN/src/contrib/ has changed? I.e. one way is via a test of a page in that directory as per (sorry about the long line, and this would be on Linux with links and awk installed) > strptime(system("links -width 160 -dump http://cran.r-project.org/src/contrib/ | awk
2011 Feb 11
0
RCurl - HTTP request of header ONLY
...www.something.com/ However, I can't figure out how to do this when using 'getURL()', for example. Here's what I tried: FIRST TRY txt <- getURL("http://www.something.com/", verbose=TRUE, header=TRUE) cat(txt) This gives me header AND content. SECOND TRY headers <- basicTextGatherer() txt <- getURL("http://www.something.com/", header=TRUE, trace=TRUE, headerfunction=headers$update) cat(headers$value()) This gives me the header, but the content is also requested and sent to 'txt'. I was looking for a RCurl option like 'head', but only found ...
2013 Apr 24
0
string size limits in RCurl
...st(userpwd = paste(user, ":", key, sep = ""), httpheader = "Content-Type: application/json") request <- paste(.base.url, q, sep = "") txt <- getURL(url = request, curl = session, .opts = curl.opts, write = basicTextGatherer()) or r = dynCurlReader() curlPerform(url = request, writefunction = r$update, curl = session, .opts = curl.opts) My guess is that the `update` or `value` functions in the `basicTextGather` or `dynCurlReader` text handler objects are having trouble with the large strings....
2010 Apr 16
0
RCurl slow when sending data over 1kb
I am using RCurl's curlPerform command to send an XML string to an HTTP server running on the localhost. The command is something like this: reader <- basicTextGatherer() curlPerform(url="http://127.0.0.1/", httpheader=c('Content-Type' = "text/xml; charset=utf-8"), postfields=toString.XMLNode(xmlRoot(xdoc)), writefunction=reader$update, .opts=curlOptions(noproxy="*")) When sending a short string, the web server responds...
2008 May 07
1
[BioC] RCurl loading problem with 64 bit linux distribution
...html latex > > RCurl-internal text html latex > > Note: removing empty section \seealso > > base64 text html latex example > > basicHeaderGatherer text html latex example > > basicTextGatherer text html latex example > > Note: removing empty section \details > > binaryBuffer text html latex example > > chunkToLineReader text html latex example > > Note: removing empty section \value &...
2008 May 07
1
[BioC] RCurl loading problem with 64 bit linux distribution
...html latex > > RCurl-internal text html latex > > Note: removing empty section \seealso > > base64 text html latex example > > basicHeaderGatherer text html latex example > > basicTextGatherer text html latex example > > Note: removing empty section \details > > binaryBuffer text html latex example > > chunkToLineReader text html latex example > > Note: removing empty section \value &...