Displaying 6 results from an estimated 6 matches for "stopwords_en".
2006 Oct 03
1
new to R: don't understand errors
...er as
previous working collections, I still get the errors. So I am wondering
if it might be something in the files themselves...
At any rate I routinely get these two errors. The first is generated
when I include a minDocFreq=x, and it looks a little like this when I
run it:
> data(stopwords_en)
> CCauto = textmatrix( "CultureMineTXT" , minWordLength=3,
minDocFreq=50, stopwords=stopwords_en)
> Error in data.frame(docs = basename(file), terms = names(tab),
Freq = tab, :
> arguments imply differing number of rows: 1, 0
If I remove...
2008 Mar 25
0
Error "... x must be atomic" when using lsa (latent semantic analysis) package
...ning
into a problem that seems to be related to the number of documents being
processed. Here's the code I'm running (after loading the lsa and rstem
packages), and the error message:
> SnippetsPath <- "c:\\OED\\AuditExplain\\" # path where to find text
snippets
> data(stopwords_en)
> tdm <- textmatrix(SnippetsPath, stopwords=stopwords_en)
I get this error message with ~ 280 documents: "Error in sort(
unique.default(x), na.last = TRUE) : 'x' must be atomic"
The error won't occur if I reduce the number of documents (say to 220, for
instance). I&...
2008 Mar 25
0
Solution to: Error "... x must be atomic" when using lsa (latent semantic analysis) package
...ning
into a problem that seems to be related to the number of documents being
processed. Here's the code I'm running (after loading the lsa and rstem
packages), and the error message:
> SnippetsPath <- "c:\\OED\\AuditExplain\\" # path where to find text
snippets
> data(stopwords_en)
> tdm <- textmatrix(SnippetsPath, stopwords=stopwords_en)
I get this error message with ~ 280 documents: "Error in sort(
unique.default(x), na.last = TRUE) : 'x' must be atomic"
The error won't occur if I reduce the number of documents (say to 220, for
instance). I&...
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all,
I have some troubles using the stemming algorithm provided by the tm
(text mining) + Snowball packages.
Here is my config:
MacOS 10.5
R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
I have installed all the needed packages (tm, rJava, rWeka, Snowball)
+ dependencies. I have desactivated AWT (like written in
2017 Feb 28
0
Solr 6.4.1 config
...You may want to tweak it - this is adapted
from my version 4 config and is set for English. If you look in
solrbasedir/example/files/conf/lang or
/solrbasedir/server/solr/configsets/basic_configs/conf/lang you'll find
alternatives for "stopwords.txt" - I simply copied the
"stopwords_en.txt" file directly to "conf/stopwords.txt".
Assuming you extracted the Solr distribution to a base folder, such as
/opt/solr-6.4.1, change to that folder.
Then execute (again, /tmp/solr-dovecot is the path you extracted my
config into):
bin/solr create_collection -c dovecot -d /...
2017 Feb 09
3
Solr 6.4.1
Does this work (pcap attached)?
Daniel
On 2/8/2017 10:57 PM, Aki Tuomi wrote:
>
> On 09.02.2017 07:54, Daniel Miller wrote:
>> I've been running Solr for a while (4.10.3) - wanted to make the jump
>> to the latest & greatest. I installed 6.4.1, copied over my
>> schema.xml - and after a couple false starts where I needed to tweak
>> it work with the new