Anindya Sankar Dey
2015-Jun-08 06:54 UTC
[R] Text Mining - Remove punctuation not removing quotes and dashes
Hi,
I have been doing some text mining. I created the DTM matrix using the
following steps.
corpus1<-VCorpus(VectorSource(resume1$Dat1))
corpus1<-tm_map(corpus1,content_transformer(tolower))
dtm<-DocumentTermMatrix(corpus1,
control = list(removePunctuation = TRUE,
removeNumbers = TRUE,
removeSparseTerms=TRUE,
stopwords = TRUE))
?After all the run I am still getting words like -quotation, "fun,
model"?
, etc.
What can I do about it. I do not need this dahses and extra quotations.
--
Anindya Sankar Dey
[[alternative HTML version deleted]]