Displaying 1 result from an estimated 1 matches for "happpended".
Did you mean:
happended
2011 Apr 18
0
Help with cleaning a corpus
Hi!
I created a corpus and I started to clean through this piece of code:
txt <-tm_map(txt,removeWords, stopwords("spanish"))
txt <-tm_map(txt,stripWhitespace)
txt <-tm_map(txt,tolower)
txt <-tm_map(txt,removeNumbers)
txt <-tm_map(txt,removePunctuation)
But something happpended: some of the documents in the corpus became empty,
this is a problem when i try to make a document term matrix with tfidf.
Is there any way to eliminate automatically a document if it become empty?
Or manually, how could i get the lenght of every document?
hope you can help me! thanks a lot...