search for: stopwords_en

Displaying 6 results from an estimated 6 matches for "stopwords_en".

2006 Oct 03
1
new to R: don't understand errors
...er as previous working collections, I still get the errors. So I am wondering if it might be something in the files themselves... At any rate I routinely get these two errors. The first is generated when I include a minDocFreq=x, and it looks a little like this when I run it: > data(stopwords_en) > CCauto = textmatrix( "CultureMineTXT" , minWordLength=3, minDocFreq=50, stopwords=stopwords_en) > Error in data.frame(docs = basename(file), terms = names(tab), Freq = tab, : > arguments imply differing number of rows: 1, 0 If I remove...
2008 Mar 25
0
Error "... x must be atomic" when using lsa (latent semantic analysis) package
...ning into a problem that seems to be related to the number of documents being processed. Here's the code I'm running (after loading the lsa and rstem packages), and the error message: > SnippetsPath <- "c:\\OED\\AuditExplain\\" # path where to find text snippets > data(stopwords_en) > tdm <- textmatrix(SnippetsPath, stopwords=stopwords_en) I get this error message with ~ 280 documents: "Error in sort( unique.default(x), na.last = TRUE) : 'x' must be atomic" The error won't occur if I reduce the number of documents (say to 220, for instance). I&...
2008 Mar 25
0
Solution to: Error "... x must be atomic" when using lsa (latent semantic analysis) package
...ning into a problem that seems to be related to the number of documents being processed. Here's the code I'm running (after loading the lsa and rstem packages), and the error message: > SnippetsPath <- "c:\\OED\\AuditExplain\\" # path where to find text snippets > data(stopwords_en) > tdm <- textmatrix(SnippetsPath, stopwords=stopwords_en) I get this error message with ~ 280 documents: "Error in sort( unique.default(x), na.last = TRUE) : 'x' must be atomic" The error won't occur if I reduce the number of documents (say to 220, for instance). I&...
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all, I have some troubles using the stemming algorithm provided by the tm (text mining) + Snowball packages. Here is my config: MacOS 10.5 R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions) I have installed all the needed packages (tm, rJava, rWeka, Snowball) + dependencies. I have desactivated AWT (like written in
2017 Feb 28
0
Solr 6.4.1 config
...You may want to tweak it - this is adapted from my version 4 config and is set for English. If you look in solrbasedir/example/files/conf/lang or /solrbasedir/server/solr/configsets/basic_configs/conf/lang you'll find alternatives for "stopwords.txt" - I simply copied the "stopwords_en.txt" file directly to "conf/stopwords.txt". Assuming you extracted the Solr distribution to a base folder, such as /opt/solr-6.4.1, change to that folder. Then execute (again, /tmp/solr-dovecot is the path you extracted my config into): bin/solr create_collection -c dovecot -d /...
2017 Feb 09
3
Solr 6.4.1
Does this work (pcap attached)? Daniel On 2/8/2017 10:57 PM, Aki Tuomi wrote: > > On 09.02.2017 07:54, Daniel Miller wrote: >> I've been running Solr for a while (4.10.3) - wanted to make the jump >> to the latest & greatest. I installed 6.4.1, copied over my >> schema.xml - and after a couple false starts where I needed to tweak >> it work with the new