search for: ngramtoken

Displaying 4 results from an estimated 4 matches for "ngramtoken".

2013 Sep 26
0
R hangs at NGramTokenizer
...yCorpus, control = list(wordLengths = c(1,Inf))) Everything works fine upto this stage, if I do not include tokenizing. However, when I run the code with the following alteration:> dictCorpus <- myCorpus> myDtm <- DocumentTermMatrix(myCorpus, control = list(wordlengths=c(1,Inf),tokenize=NGramTokenizer, dictionary=dictCorpus)) it hangs. I have kept it running overnight, but no results. Any help would be much appreciated. Thanks--Neep Hazarika [[alternative HTML version deleted]]
2016 Apr 05
8
RWeka Error
When I use any function of RWeka Package in Rstudio I get an error, "Error in .jnew (name): java.lang.ClassFormatError." can anyone guide me in this? [[alternative HTML version deleted]]
2009 Aug 13
1
using package tm to find phrases
I am using the package "tm" for text-mining of abstracts and would like to use it to find instances of gene names that may contain white space. For instance "gene regulatory protein 1". The default behavior of tm is to parse this into 4 separate words, but I would like to use the class constructor "dictionary" to define phrases such as just mentioned. Is this
2016 Apr 05
0
RWeka Error
Read the Posting Guide mentioned at the bottom of this email. Highlights you should be sure to address: * HTML formatted email gets messed up on the R mailing lists, so post in plain text. Yes, you can and need to do this. * Make sure the problem occurs in R by trying it without RStudio. Sometimes RStudio interferes with R, and you have to ask elsewhere about such problems. * Give us details