Displaying 1 result from an estimated 1 matches for "docfrequency".
2006 Oct 04
0
FW: new to R: don't understand errors
...y insight to my lsa problems (also listed below) would be of great
> help.
from what I see, the problem probably indeed lies within the
textfiles: for performance reasons, it was not possible to
include any "check" routines that exclude a file if it contains
no words (or words below a docFrequency) and thus produces
an empty column-vector.
I am pretty sure that you do not want to use docFrequency
with a value like 50 (it would mean that a term in a document
is only included if it appears more than 50 times in *that*
document).
I will send you the alpha-release of the updated lsa package
in...