Displaying 6 results from an estimated 6 matches for "replaceentities".
2011 Apr 06
1
Treatment of xml-stylesheet processing instructions in XML module
Hello again,
Another stumble here that is defeating me.
I try:
a<-readLines(url("http://feeds.feedburner.com/grokin"))
t<-XML::xmlTreeParse(a, ignoreBlanks=TRUE, replaceEntities=FALSE,
asText=TRUE)
elem<- XML::getNodeSet(XML::xmlRoot(t),"/rss/channel/item")[[1]]
And I get:
Start tag expected, '<' not found
Error: 1: Start tag expected, '<' not found
When I modify the second line in "a" to remove the following (just
leaving the...
2013 Mar 20
1
htmlParse (from XML library) working sporadically in the same code
...))
But most of the time it would return the following error coming from htmlParse:
Error: failed to load HTTP resource
Error is coming from the following line in htmlParse code:
ans <- .Call("RS_XML_ParseTree", as.character(file), handlers, as.logical(ignoreBlanks), as.logical(replaceEntities), as.logical(asText), as.logical(trim), as.logical(validate), as.logical(getDTD), as.logical(isURL), as.logical(addAttributeNamespaces), as.logical(useInternalNodes), as.logical(isHTML), as.logical(isSchema), as.logical(fullNamespaceInfo), as.character(encoding), as.logical(useDotNames), xinclude,...
2011 Jun 07
1
XML segfault on some architectures
...ed Sep 29 13:30:21 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux
> x <- xmlTreeParse(fileName)
*** caught segfault ***
address 0x500001c4f, cause 'memory not mapped'
Traceback:
1: .Call("RS_XML_ParseTree", as.character(file), handlers, as.logical(ignoreBlanks), as.logical(replaceEntities), as.logical(asText), as.logical(trim), as.logical(validate), as.logical(getDTD), as.logical(isURL), as.logical(addAttributeNamespaces), as.logical(useInternalNodes), FALSE, as.logical(isSchema), as.logical(fullNamespaceInfo), as.character(encoding), as.logical(useDotNames), xinclud...
2012 May 15
1
KEGGSOAP installation error
...stalling help indices
** building package indices
** testing if installed package can be loaded
*** caught segfault ***
address 0x500001c4f, cause 'memory not mapped'
Traceback:
1: .Call("RS_XML_ParseTree", as.character(file), handlers,
as.logical(ignoreBlanks), as.logical(replaceEntities),
as.logical(asText), as.logical(trim), as.logical(validate),
as.logical(getDTD), as.logical(isURL),
as.logical(addAttributeNamespaces), as.logical(useInternalNodes), FALSE,
as.logical(isSchema), as.logical(fullNamespaceInfo),
as.character(encoding), as.logical(useDotNames), xinclud...
2011 Apr 06
0
Curious treatment of entities in xmlTreeParse
...AIK, this is a correct encoding,
Parsing this with the following two lines followed by inspecting "t"
shows that the & references have been translated to "&" while other
entity refs have not.
a<-readLines(url(as.character(feeds[2,2])))
t<-XML::xmlTreeParse(a, replaceEntities=FALSE, asText=TRUE)
I'm guessing this is what breaks things when I try to do things with tm:
rss2Reader <- readXML(
spec = list(
Author = list("node", "/item/creator"),
Content = list("node", "/item/description"),
DateTimeStamp = list("f...
2010 Oct 22
0
NUT and AsciiDoc (was: Next AsciiDoc release)
...se_starttag(i)
> File "/usr/lib/python2.6/HTMLParser.py", line 249, in parse_starttag
> attrvalue = self.unescape(attrvalue)
> File "/usr/lib/python2.6/HTMLParser.py", line 387, in unescape
> return re.sub(r"&(#?[xX]?(?:[0-9a-fA-F]+|\w{1,8}));", replaceEntities,
> s)
> File "/usr/lib/python2.6/re.py", line 151, in sub
> return _compile(pattern, 0).sub(repl, string, count)
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 6:
> ordinal not in range(128)
>
> This error is due to a limitation...