search for: readreut21578xmlasplain

Displaying 4 results from an estimated 4 matches for "readreut21578xmlasplain".

2010 Apr 23
2
Library (tm) Error: could not find function "TermDocMatrix".
Hi List I have the next code and the error. I have try with other codes and I have the same problem. > reut21578 <- system.file("texts", "crude", package = "tm") > (r <- Corpus(DirSource(reut21578), readerControl = list(reader = > readReut21578XMLasPlain))) A corpus with 20 text documents > (r <- Corpus(DirSource(reut21578), readerControl = list(reader = > readReut21578XMLasPlain)))> > > summary(r) A corpus with 20 text documents The metadata consists of 2 tag-value pairs and a data frame Available tags are: create_date crea...
2009 Dec 11
0
readHTML within tm package
...rpus. However, when I try to use that routine I get an error. When I run getReaders (below) readHTML isn't listed. > getReaders() [1] "readDOC" "readGmane" [3] "readPDF" "readReut21578XML" [5] "readReut21578XMLasPlain" "readPlain" [7] "readRCV1" "readTabular" I'm a missing something? Is there an extra install I'm missing, or has the routine been removed or replaced? Thanks, Peter Oh, yes, running the latest R release on Mac OS 10.6.2 --...
2010 Feb 04
1
How to read HTML or TEXT file with tm package
??????????????????????????????????????????... ????: ???? URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20100204/a3069c99/attachment.pl>
2010 Feb 16
0
tm package
Hi, I'm using version 0.5.1 of tm package with R 2.10.1. It looks to me as if after the following reuters21578 <- Corpus(DirSource(corpusDir), readerControl = list(reader = readReut21578XMLasPlain)) reuters21578 <- tm_map(reuters21578, stripWhitespace) reuters21578 <- tm_map(reuters21578, tolower) reuters21578 <- tm_map(reuters21578, removePunctuation) reuters21578 <- tm_map(reuters21578, removeNumbers) reuters21578.dtm <- DocumentTermMatrix(reuters21578)...