Displaying 2 results from an estimated 2 matches for "minglobfreq".
2006 Oct 04
0
FW: new to R: don't understand errors
...re that you do not want to use docFrequency
with a value like 50 (it would mean that a term in a document
is only included if it appears more than 50 times in *that*
document).
I will send you the alpha-release of the updated lsa package
in a separate message which also includes a parameter called
minGlobFreq which is filtering out terms that appear less
than x times in the whole document collection. I guess that is
what you were looking for.
Considering the sanitizing: if you set minDocFreq to 1
and set minWordLength to 1, you should not get an error
with your document collection as you then are basic...
2012 Feb 22
0
LSA package: problem with textmatrix()
...his:
correlation
between
maternal
and
fetal
plasma
levels
of
glucose
and
free
fatty
acids
.
correlation
coefficients
have
been
determined
between
the
the command I am using looks like this, with the resulting error below:
>
> dtm <- textmatrix(LSAwork, stemming=TRUE, stopwords=StopListm, minGlobFreq=1, minWordLength=2, removeNumbers=TRUE)
Error in data.frame(docs = basename(file), terms = names(tab), Freq = tab, :
arguments imply differing number of rows: 1, 0
In addition: Warning message:
In FUN(c("LSAWork/med.000001", "LSAWork/med.000002", "LSAWork/med.000003&quo...