Displaying 20 results from an estimated 800 matches similar to: "Problem with Snowball & RWeka"
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all,
I have some troubles using the stemming algorithm provided by the tm
(text mining) + Snowball packages.
Here is my config:
MacOS 10.5
R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
I have installed all the needed packages (tm, rJava, rWeka, Snowball)
+ dependencies. I have desactivated AWT (like written in
2011 Jun 04
1
Problem with Snowball & RWeka
I too have this problem. Everything worked fine last year, but after
updating R and packages I can no longer do word stemming.
Unfortunately, I didn't save the old binaries, otherwise I would just
revert back.
Hoping someone finds a solution for R on Windows. Thanks!
There is a potential solution for R on Mac OS from Kurt Hornik copied
below, but I cannot get this to work on Windows.
2011 Apr 29
0
Trying to get RWeka/Snowball to work
Hi!
I was trying to install RWeka to be able to use SnowballStemmer in a Mac OS
X 10.6.7 environment... but coudn't do it... I get error messages after:
> library(RWeka);
> install(Snowball);
> ## Test the supplied vocabulary for the default stemmer ('porter'):
> source <- readLines(system.file("words", "porter","voc.txt",
+
2012 Apr 13
4
Help with stemDocument
Hi, All:
I am new to R and tm package. I'm trying to do the stemming using tm_map()
and it doesn't seem to work:
*I used:*
> stemDocument(t_cmts[[100]])
*Where t_cmts is the corpus object, the results is:*
bottle loose box abt airpak sections top plastic bottle squashed nearly
flush neck previous shipments bottle wrapped securely bubble wrap wno
bottle damage packaging poor
2011 Jul 16
0
Problem with Snowball & RWeka
The Java error when attempting to use the stemmers in the Snowball or
tm packages on Windows machines is caused by Quicktime. See prior
posts in this thread.
The workaround is to uninstall Quicktime.
After much trial and error on machines spanning WinXP/2k/Vista/7, I
finally verified this as follows:
1) Fresh installation of Windows/Java/R. Snowball package works perfectly.
2) Install Quicktime.
2012 Dec 13
2
Tamaño de la matriz de términos y memoria. Paquete TM
Hola a todos!
Tengo algunos problemas con el tamaño de la matriz de términos que obtengo. Los comandos que utilizo son los siguientes:
# carga librerias
library(tm)
library(wordcloud)
library(Rstem)
library(Snowball)
# lee el documento UTF-8 y lo convierte a ASCII
txt <-
2007 Jan 09
2
non-snowball stemmer
Hi!
I am going to use non-snowball russian stemmer with Xapian. There is a
good one at http://www.aot.ru. I've found that current implementation of
Xapian::Stem does not allow it (there is no public interface for
Xapian::Stem::Internal). Do you apply patches? Are there any
recommendations for writing patches?
Regards,
Oleg Obolenskiy
highpower at mail.ru
2009 Nov 12
2
package "tm" fails to remove "the" with remove stopwords
I am using code that previously worked to remove stopwords using package
"tm". Even manually adding "the" to the list does not work to remove "the".
This package has undergone extensive redevelopment with changes to the
function syntax, so perhaps I am just missing something.
Please see my simple example, output, and sessionInfo() below.
Thanks!
Mark
require(tm)
2012 Jan 27
2
tm package: handling contractions
I tried making a wordcloud of Obama's State of the Union address using
the tm package to process the text
sotu <- scan(file="c:/R/data/sotu2012.txt", what="character")
sotu <- tolower(sotu)
corp <-Corpus(VectorSource(paste(sotu, collapse=" ")))
corp <- tm_map(corp, removePunctuation)
corp <- tm_map(corp, stemDocument)
corp <- tm_map(corp,
2014 Jun 17
2
No es un problema de tm tienes doc.corpus vacío
No es un problema de tm ni de SnowfallC ni de mcapply (por el path
utilizas linux, en windows mcapply según el manual no va bien)
No defines bien los objetos que pasas. Pasas doc.corpus en lugar de
corpus ( o asignas a corpus en lugar de a doc.corpus) .
Depura los programas cuando salga un error de objeto, como te pone en el
Error que pasas .
Temporalmente lo tienes arreglado en
2013 Sep 03
1
tm::stemDocument function not work
https://gist.github.com/rpietro/6430771
stemDocument function doesn't seem to be working. Tried to look up and
a few people have reported the problem, but no solution that I could
find.
would appreciate any help
2014 Jun 18
2
No es un problema de tm tienes doc.corpus vacío
Creo que lo que quieres hacer necesita esta línea de código justo después de
cargar el paquete tm:
inmortal = unlist(strsplit(inmortal, " ", fixed = T))
De esta forma, trabajas con palabras, y NO con las frases enteras...
Un saludo
Isidro Hidalgo Arellano
Observatorio Regional de Empleo
Consejería de Empleo y Economía
http://www.jccm.es
> -----Mensaje original-----
> De:
2012 Apr 17
1
differents behaviour of packages depending on rJava under 32 and 64 bit versions of R 2.14.2
Hi listeRs,
Maybe I have missed something, but I am facing a problem I don't understand
even after reading archives and manuals (…maybe not carefully enough !) :
On my PC (64-bit OS, Intel Xeon CPU , two L5506@2.13 GHz processors, 12 Go
RAM) under Windows 7 SP1,
I run both 32-bit (i386-pc-mingw32/i386) and 64-bit (x86_64-pc-mingw32/x64)
version 2.14.2 (2012-02-29) of R, packages
2009 Jan 10
1
Help needed for Loading "tm" package
Howdy Gurus again
Thanks to Tony.Breyal, I was able to writing the following script for
analyzing a text document.
But I got an error with "tm' package. I don't why I got the error from the R
script below. I think I followed proccess of R tm manual.
I use R v2.8.1. and tm_0.3-3.zip under Win XP.
Thanks in advance,
Kum Hwang
> # setting directory
> my.path
2014 Jun 18
3
No es un problema de tm tienes doc.corpus vacío
Muchas gracias isidro,
a la noche reinstalo R y os digo si me ha funcionado. Perdona mi ignorancia
de novato pero no he entendido muy bien eso de avisar al desarrollador.
Entiendo que es a los de los paquetes, no?
un saludo!
ruben
El 18 de junio de 2014, 13:10, Isidro Hidalgo <ihidalgo@jccm.es> escribió:
> Ya he visto que tampoco así funciona.
> Sí te puedo decir que me ha dejado
2012 Oct 25
2
Minería de texto
Cordial Saludo
Actualmente estoy realizando una función para gráficar una nube de palabras el código que tengo es el siguiente:
library(twitteR)library(tm)library(wordcloud)library(RXKCD)library(RColorBrewer)
tweets=searchTwitter(''@afflorezr'', n=1500)
generateCorpus= function(tweets,my.stopwords=c(),min.freq){ #Install the textmining library require(tm) require(wordcloud)
2011 Jan 31
0
Problem with loading the Snowball package
I tried using the "Snowball" package for performing stemming in text mining.
But when I tried to load the package the following error is thrown:
Error : .onLoad failed in loadNamespace() for 'Snowball', details:
call: NULL
error: .onLoad failed in loadNamespace() for 'rJava', details:
call: hive[[hive$CurrentVersion]]
error: attempt to select less than one
2014 Jul 29
2
wordcloud y tabla de palabras [Avanzando]
Buenas tardes grupo. Saludos cordiales Carlos J., muchas gracias por
tu orientación. Efectivamente, me había dado cuenta que la razón por
la que no se aplicaba colnames era porque no tenía columnas. La
cuestión es que no logro visualizar completamente/claramente en qué
parte del proceso de creación del corpus se puede hacer.
Sin embargo, siguiendo el ejemplo de
2014 Nov 22
2
Problemas con tm
Estimados compañeros tengo un problema con la librería tm o con windows
8.1 o con algo que no controlo.
Hace tiempo con windows 7 y una versión anterior de R ejecutaba este código:
library(tm)
data("crude")
crude <- tm_map(crude, tolower)
tdm<-TermDocumentMatrix(crude)
y sin problemas me creaba tdm. Ahora si lo ejecuto me da el siguiente error:
Error: inherits(doc,
2014 Jul 22
2
Ayuda Error in `colnames<-`(`*tmp*`, value = c(
Buenas tardes, grupo.
Estoy tratando de hacer la comparación de dos archivos de una misma
organización para encontrar las diferencias entre su informe del tema
edl año 2005 y el del año 2013:
Todos los comandos van bien, a exepción del último "colnames", como se
ve en la siguiente secuencia:
> pdf1<-"./PLAN de INSPECCIONES/05_seguridad_ciudadana.pdf"
>