similar to: Troubles with stemming (tm + Snowball packages) under MacOS

Displaying 20 results from an estimated 600 matches similar to: "Troubles with stemming (tm + Snowball packages) under MacOS"

2011 Jun 04
1
Problem with Snowball & RWeka
I too have this problem. Everything worked fine last year, but after updating R and packages I can no longer do word stemming. Unfortunately, I didn't save the old binaries, otherwise I would just revert back. Hoping someone finds a solution for R on Windows. Thanks! There is a potential solution for R on Mac OS from Kurt Hornik copied below, but I cannot get this to work on Windows.
2011 Mar 24
2
Problem with Snowball & RWeka
Dear Forum, when I try to use SnowballStemmer() I get the following error message: "Could not initialize the GenericPropertiesCreator. This exception was produced: java.lang.NullPointerException" It seems to have something to do with either Snowball or RWeka, however I can't figure out, what to do myself. If you could spend 5 minutes of your valuable time, to help me or give me a
2012 Dec 13
2
Tamaño de la matriz de términos y memoria. Paquete TM
Hola a todos! Tengo algunos problemas con el tamaño de la matriz de términos que obtengo. Los comandos que utilizo son los siguientes: # carga librerias library(tm) library(wordcloud) library(Rstem) library(Snowball) # lee el documento UTF-8 y lo convierte a ASCII txt <-
2012 Apr 13
4
Help with stemDocument
Hi, All: I am new to R and tm package. I'm trying to do the stemming using tm_map() and it doesn't seem to work: *I used:* > stemDocument(t_cmts[[100]]) *Where t_cmts is the corpus object, the results is:* bottle loose box abt airpak sections top plastic bottle squashed nearly flush neck previous shipments bottle wrapped securely bubble wrap wno bottle damage packaging poor
2011 Apr 29
0
Trying to get RWeka/Snowball to work
Hi! I was trying to install RWeka to be able to use SnowballStemmer in a Mac OS X 10.6.7 environment... but coudn't do it... I get error messages after: > library(RWeka); > install(Snowball); > ## Test the supplied vocabulary for the default stemmer ('porter'): > source <- readLines(system.file("words", "porter","voc.txt", +
2009 Nov 12
2
package "tm" fails to remove "the" with remove stopwords
I am using code that previously worked to remove stopwords using package "tm". Even manually adding "the" to the list does not work to remove "the". This package has undergone extensive redevelopment with changes to the function syntax, so perhaps I am just missing something. Please see my simple example, output, and sessionInfo() below. Thanks! Mark require(tm)
2012 Feb 26
2
tm_map help
Hi all, I am trying to do some text mining with twitter and I am getting the error: Error in structure(names(sapply(possibleCompletions, "[", 1)), names = x) : 'names' attribute [1] must be the same length as the vector [0] When I use tm_map. Has anyone had/seen this error before? The code I have is shown below and this error only occurs with #qantas, hashtags like #asx,
2012 Jan 27
2
tm package: handling contractions
I tried making a wordcloud of Obama's State of the Union address using the tm package to process the text sotu <- scan(file="c:/R/data/sotu2012.txt", what="character") sotu <- tolower(sotu) corp <-Corpus(VectorSource(paste(sotu, collapse=" "))) corp <- tm_map(corp, removePunctuation) corp <- tm_map(corp, stemDocument) corp <- tm_map(corp,
2014 Jun 17
2
No es un problema de tm tienes doc.corpus vacío
No es un problema de tm ni de SnowfallC ni de mcapply (por el path utilizas linux, en windows mcapply según el manual no va bien) No defines bien los objetos que pasas. Pasas doc.corpus en lugar de corpus ( o asignas a corpus en lugar de a doc.corpus) . Depura los programas cuando salga un error de objeto, como te pone en el Error que pasas . Temporalmente lo tienes arreglado en
2007 Jan 09
2
non-snowball stemmer
Hi! I am going to use non-snowball russian stemmer with Xapian. There is a good one at http://www.aot.ru. I've found that current implementation of Xapian::Stem does not allow it (there is no public interface for Xapian::Stem::Internal). Do you apply patches? Are there any recommendations for writing patches? Regards, Oleg Obolenskiy highpower at mail.ru
2014 Jun 18
2
No es un problema de tm tienes doc.corpus vacío
Creo que lo que quieres hacer necesita esta línea de código justo después de cargar el paquete tm: inmortal = unlist(strsplit(inmortal, " ", fixed = T)) De esta forma, trabajas con palabras, y NO con las frases enteras... Un saludo Isidro Hidalgo Arellano Observatorio Regional de Empleo Consejería de Empleo y Economía http://www.jccm.es > -----Mensaje original----- > De:
2014 Jun 18
3
No es un problema de tm tienes doc.corpus vacío
Muchas gracias isidro, a la noche reinstalo R y os digo si me ha funcionado. Perdona mi ignorancia de novato pero no he entendido muy bien eso de avisar al desarrollador. Entiendo que es a los de los paquetes, no? un saludo! ruben El 18 de junio de 2014, 13:10, Isidro Hidalgo <ihidalgo@jccm.es> escribió: > Ya he visto que tampoco así funciona. > Sí te puedo decir que me ha dejado
2011 Sep 05
0
Stemming functions only work on the last word of plain text documents
Hello, I want to use the SnowballStemmer on a collection of plain text documents. However, when I apply it to my corpus using the tm_map function it only stems the last word of each document (The problem is the for wordStem and stemDocument does not work at all).  An example: > path <- c("c:\path\to\directory")       # collection of plain text documents > corp <-
2011 Jul 28
2
R CMD CHECK doens't run configure when testing install? (Revised)
I'm trying to get ready to submit a package to CRAN, but in order for the package to install on OS X, I need to temporarily set the environment variable NOAWT=1. I put 'export NOAWT=1' in my package's 'configure' script, and 'R CMD INSTALL MyPackage' returns with no errors. However when I do 'R CMD CHECK MyPackage', the installation test fails, because
2011 Jul 28
2
R CMD CHECK doens't run configure when testing install? (Revised)
I'm trying to get ready to submit a package to CRAN, but in order for the package to install on OS X, I need to temporarily set the environment variable NOAWT=1. I put 'export NOAWT=1' in my package's 'configure' script, and 'R CMD INSTALL MyPackage' returns with no errors. However when I do 'R CMD CHECK MyPackage', the installation test fails, because
2013 Sep 26
0
R hangs at NGramTokenizer
Hi: I try to construct a Document-Term Meatrix from a corpus. The commands I used are: > library(parallel)> library(tm)> library(RWeka)> library(topicmodels)> library(RTextTools)> cl=makeCluster(detectCores())> invisible(clusterEvalQ(cl, library(tm)))> invisible(clusterEvalQ(cl, library(RWeka))) > invisible(clusterEvalQ(cl, library(topicmodels)))>
2011 Jan 31
0
Problem with loading the Snowball package
I tried using the "Snowball" package for performing stemming in text mining. But when I tried to load the package the following error is thrown: Error : .onLoad failed in loadNamespace() for 'Snowball', details: call: NULL error: .onLoad failed in loadNamespace() for 'rJava', details: call: hive[[hive$CurrentVersion]] error: attempt to select less than one
2011 Jul 16
0
Problem with Snowball & RWeka
The Java error when attempting to use the stemmers in the Snowball or tm packages on Windows machines is caused by Quicktime. See prior posts in this thread. The workaround is to uninstall Quicktime. After much trial and error on machines spanning WinXP/2k/Vista/7, I finally verified this as follows: 1) Fresh installation of Windows/Java/R. Snowball package works perfectly. 2) Install Quicktime.
2011 Apr 18
0
Help with cleaning a corpus
Hi! I created a corpus and I started to clean through this piece of code: txt <-tm_map(txt,removeWords, stopwords("spanish")) txt <-tm_map(txt,stripWhitespace) txt <-tm_map(txt,tolower) txt <-tm_map(txt,removeNumbers) txt <-tm_map(txt,removePunctuation) But something happpended: some of the documents in the corpus became empty, this is a problem when i try to make a
2009 Apr 27
5
ruby jasper report
Hi, I am trying to create a pdf using ruby jasper. For this I am referring this url:- http://oldwiki.rubyonrails.org/rails/pages/howtointegratejasperreports I have done everything whatever is in that link. Still I am not able to get the pdf. I am not getting any error in log file. My log file is:- "Processing AccountController#customer_report (for 127.0.0.1 at 2009-04-27 13:19:34) [GET]