Erin Hodgess
2014-Dec-05 03:53 UTC
[R] using inspect with a TermDocumentMatrix to convert to a data frame
Hello! I am working through the "Social Media Mining with R" book and I have something that is a bit problematic. Here is the code: hash2_tdm <- TermDocumentMatrix(hash2_corpus) print(hash2_tdm) print(findFreqTerms(hash2_tdm,lowfreq=10)) hash3_tdm <- removeSparseTerms(hash2_tdm,0.92) hash3.df <- as.data.frame(inspect(hash3_tdm)) Now when the hash3.df is created, the entire data frame is printed on the console. That's ok if the data frame is relatively small, but is not acceptable for a large data frame. Has anyone run into this before, please? I have tried all kinds of other options for converting to a data frame, but to no avail. This is on R-3.1.2, on Ubuntu 14.0.4 Thanks! Sincerely, Erin -- Erin Hodgess Associate Professor Department of Mathematical and Statistics University of Houston - Downtown mailto: erinm.hodgess at gmail.com [[alternative HTML version deleted]]
Wush Wu
2014-Dec-05 05:48 UTC
[R] using inspect with a TermDocumentMatrix to convert to a data frame
Dear Erin, For the issue of printing big data.frame, you could define a customized `print.data.frame` in the user environment to prevent R prints all the data. For example: ```r print.data.frame <- function(df) { base::print.data.frame(head(df)) cat("===\n") base::print.data.frame(tail(df)) } ``` Hope that helps. Regards, Wush 2014-12-05 11:53 GMT+08:00 Erin Hodgess <erinm.hodgess at gmail.com>:> Hello! > > I am working through the "Social Media Mining with R" book and I have > something that is a bit problematic. > > Here is the code: > > hash2_tdm <- TermDocumentMatrix(hash2_corpus) > print(hash2_tdm) > print(findFreqTerms(hash2_tdm,lowfreq=10)) > hash3_tdm <- removeSparseTerms(hash2_tdm,0.92) > > hash3.df <- as.data.frame(inspect(hash3_tdm)) > > Now when the hash3.df is created, the entire data frame is printed on the > console. That's ok if the data frame is relatively small, but is not > acceptable for a large data frame. > > Has anyone run into this before, please? I have tried all kinds of other > options for converting to a data frame, but to no avail. > > > This is on R-3.1.2, on Ubuntu 14.0.4 > > Thanks! > Sincerely, > Erin > > > -- > Erin Hodgess > Associate Professor > Department of Mathematical and Statistics > University of Houston - Downtown > mailto: erinm.hodgess at gmail.com > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >[[alternative HTML version deleted]]
Erin Hodgess
2014-Dec-05 14:03 UTC
[R] using inspect with a TermDocumentMatrix to convert to a data frame
Great! Thank you! On Fri, Dec 5, 2014 at 12:48 AM, Wush Wu <wush978 at gmail.com> wrote:> Dear Erin, > > For the issue of printing big data.frame, you could define a customized > `print.data.frame` in the user environment > > to prevent R prints all the data. For example: > > ```r > print.data.frame <- function(df) { > base::print.data.frame(head(df)) > cat("===\n") > base::print.data.frame(tail(df)) > } > ``` > > Hope that helps. > > Regards, > Wush > > > 2014-12-05 11:53 GMT+08:00 Erin Hodgess <erinm.hodgess at gmail.com>: > >> Hello! >> >> I am working through the "Social Media Mining with R" book and I have >> something that is a bit problematic. >> >> Here is the code: >> >> hash2_tdm <- TermDocumentMatrix(hash2_corpus) >> print(hash2_tdm) >> print(findFreqTerms(hash2_tdm,lowfreq=10)) >> hash3_tdm <- removeSparseTerms(hash2_tdm,0.92) >> >> hash3.df <- as.data.frame(inspect(hash3_tdm)) >> >> Now when the hash3.df is created, the entire data frame is printed on the >> console. That's ok if the data frame is relatively small, but is not >> acceptable for a large data frame. >> >> Has anyone run into this before, please? I have tried all kinds of other >> options for converting to a data frame, but to no avail. >> >> >> This is on R-3.1.2, on Ubuntu 14.0.4 >> >> Thanks! >> Sincerely, >> Erin >> >> >> -- >> Erin Hodgess >> Associate Professor >> Department of Mathematical and Statistics >> University of Houston - Downtown >> mailto: erinm.hodgess at gmail.com >> >> [[alternative HTML version deleted]] >> >> ______________________________________________ >> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. >> > >-- Erin Hodgess Associate Professor Department of Mathematical and Statistics University of Houston - Downtown mailto: erinm.hodgess at gmail.com [[alternative HTML version deleted]]