Displaying 1 result from an estimated 1 matches for "freq_term".
Did you mean:
freq_terms
2011 Sep 12
1
findFreqTerms vs minDocFreq in Package 'tm'
...w:
findFreqTerms identifies 3140 words that appear more than 5 times but
minDocFreq identifies only 659 terms. Can someone please explain the reason
for the different or whether I have misunderstood their definitions??
>tdm1 <- TermDocumentMatrix(tr1,control=list(weighting=weightBin))
> freq_terms <- findFreqTerms(tdm1, lowfreq =5, highfreq = Inf)
> str(freq_terms)
chr [1:3140] "abc" "abil" "abl" "abnorm" "abort" "absenc" ...
> tdm2 <- TermDocumentMatrix(tr1,control=list(minDocFreq=5,minWordLength=1))
> str(tdm2)...