Displaying 4 results from an estimated 4 matches for "readreut21578xmlasplain".
2010 Apr 23
2
Library (tm) Error: could not find function "TermDocMatrix".
Hi List
I have the next code and the error. I have try with other codes and I have
the same problem.
> reut21578 <- system.file("texts", "crude", package = "tm")
> (r <- Corpus(DirSource(reut21578), readerControl = list(reader =
> readReut21578XMLasPlain)))
A corpus with 20 text documents
> (r <- Corpus(DirSource(reut21578), readerControl = list(reader =
> readReut21578XMLasPlain)))> >
> summary(r)
A corpus with 20 text documents
The metadata consists of 2 tag-value pairs and a data frame
Available tags are:
create_date crea...
2009 Dec 11
0
readHTML within tm package
...rpus.
However, when I try to use that routine I get an error. When I run
getReaders (below) readHTML isn't listed.
> getReaders()
[1] "readDOC" "readGmane"
[3] "readPDF" "readReut21578XML"
[5] "readReut21578XMLasPlain" "readPlain"
[7] "readRCV1" "readTabular"
I'm a missing something? Is there an extra install I'm missing, or has the
routine been removed or replaced?
Thanks, Peter
Oh, yes, running the latest R release on Mac OS 10.6.2
--...
2010 Feb 04
1
How to read HTML or TEXT file with tm package
??????????????????????????????????????????...
????: ????
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20100204/a3069c99/attachment.pl>
2010 Feb 16
0
tm package
Hi,
I'm using version 0.5.1 of tm package with R 2.10.1. It looks to me
as if after the following
reuters21578 <- Corpus(DirSource(corpusDir), readerControl =
list(reader = readReut21578XMLasPlain))
reuters21578 <- tm_map(reuters21578, stripWhitespace)
reuters21578 <- tm_map(reuters21578, tolower)
reuters21578 <- tm_map(reuters21578, removePunctuation)
reuters21578 <- tm_map(reuters21578, removeNumbers)
reuters21578.dtm <- DocumentTermMatrix(reuters21578)...