search for: xmlerrorcumulator

Displaying 4 results from an estimated 4 matches for "xmlerrorcumulator".

2010 Mar 11
1
parse an HTML page with verbose error message (using XML)
I'm using the function htmlParse() in the XML package, and I need a little bit help on error handling while parsing an HTML page. So far I can use either the default way: # error = xmlErrorCumulator(), by default library(XML) doc = htmlParse("http://www.public.iastate.edu/~pdixon/stat500/") # the error message is: # htmlParseStartTag: invalid element name or the tryCatch() approach: # error = NULL, errors to be caught by tryCatch() tryCatch({ doc = htmlParse("http://www.pu...
2010 Mar 15
1
XML: Slower parsing over time with htmlTreeParse()
...4 Bit), . After some time, this can go up to 15 seconds or more. I''ve tried garbage collect, "catalogClearTable()" (though I don''t think that has anything to do with the issue) and lately wondered if it maybe had to do with the accumulation of errors over time ("xmlErrorCumulator()"). Are parsing errors cumulated globally in the workspace over distinct calls to this function? If so, is there a way to "clean the buffer"? I would greatly appreciate if anyone had an idea about how to keep request/parsing time fairly constant at the initial low level of 0.5 s...
2010 Mar 15
0
RMySQL: Slower parsing over time with htmlTreeParse()
...4 Bit), . After some time, this can go up to 15 seconds or more. I''ve tried garbage collect, "catalogClearTable()" (though I don''t think that has anything to do with the issue) and lately wondered if it maybe had to do with the accumulation of errors over time ("xmlErrorCumulator()"). Are parsing errors cumulated globally in the workspace over distinct calls to this function? If so, is there a way to "clean the buffer"? I would greatly appreciate if anyone had an idea about how to keep request/parsing time fairly constant at the initial low level of 0.5 s...
2008 Nov 04
2
How to suppress errors from htmlTreeParse() function in XML package?
Dear R-help, The following code downloads an html document into variable 'doc' and then stores an internal representation into variable 'html.tree'. Even if the html code is malformed, this still works which is fantastic. However, as in the example below, i do get some ouput from R in the console which i would like to suppress somehow, so i can keep my window a bit cleaner. I