Displaying 20 results from an estimated 800 matches similar to: "Help needed for Loading "tm" package"
2009 Jan 09
1
[R} how to build TermDocMatrix in tm text mining package of R
Howdy Gurus
I 'd like to ask a question about how to build TermDocMatrix in tm text
mining package.
It is not clear about importing a plain text file, and them converting that
text file into TermDocMatrix file, etc to me.
How can I build a TermDocMatrix of " a plain text document file for text
association?
Or are there any good manuals?
Thank you in advance,
--
Kum-Hoe Hwang, Ph.D.
2009 Jan 15
1
How to Solve the Error( error:cannot allocate vector of size 1.1 Gb)
Hi, Gurus
Thanks to your good helps, I have managed starting the use of a text
mining package so called "tm" in R under the OS of Win XP.
However, during running the tm package, I got another mine like memory problem.
What is a the best way to solve this memory problem among increasing a
physical RAM, or doing other recipes, etc?
###############################
###### my R
2010 Apr 23
2
Library (tm) Error: could not find function "TermDocMatrix".
Hi List
I have the next code and the error. I have try with other codes and I have
the same problem.
> reut21578 <- system.file("texts", "crude", package = "tm")
> (r <- Corpus(DirSource(reut21578), readerControl = list(reader =
> readReut21578XMLasPlain)))
A corpus with 20 text documents
> (r <- Corpus(DirSource(reut21578), readerControl =
2009 Mar 30
1
Help with tm assocation analysis and Rgraphviz installation.
Help with tm assocation analysis and Rgraphviz installation.
THANK YOU IN ADVANCE
Question 1:
I saved two txt file in C:\textfile
And each txt file contents only one text column, and both have 100 records.
I know term “research” occurs 49 times, so I want to find out which other
words are correlated to this word, and I got tons of association ‘1’ .
I tried other terms, and no
2006 Oct 19
2
How to get multiple Correlation Coefficients
Hi
I have used a polycor package for categorical correlation coefficients.
I run the following script. But there were no results.
Could you tell me how to correct the script?
Thanks in advance,
vars <- names(sdi)
for (i in 1:length(vars)) {
for (j in 1:length(vars)) {
paste(vars[i]," and ", vars[j])
polychor(vars[i], vars[j])
# corr
}
}
--
Kum-Hoe Hwang, Ph.D.Phone :
2005 Jul 06
2
How to sample x-y coordinates from GIS files
Hi Gurus!
I have a job that is to get randomly samples from point-based GIS data (sp
called shape GIS files) under the total sum resctricted.
For example, I would like to take random smaples under the 1000 persons in
each city.
The randomly sampled persons should not be over 1000 any case.
Thank you in advance,
--
Kum-Hoe Hwang, Ph.D.
Phone : 82-31-250-3283
Email : phdhwang@gmail.com
2006 Mar 05
2
what is scale function? Is it for variable transformation?
HOwdy
I read R books about scale function for variable transformation.
Acoording to this book
scale function leads me to better regression results. Or am I worng?
I hope somebody tell me about a scale function?
Is it for variable transformation?
--
Kum-Hoe Hwang, Phone : 82-31-250-3516Email : phdhwang@gmail.com
[[alternative HTML version deleted]]
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all,
I have some troubles using the stemming algorithm provided by the tm
(text mining) + Snowball packages.
Here is my config:
MacOS 10.5
R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
I have installed all the needed packages (tm, rJava, rWeka, Snowball)
+ dependencies. I have desactivated AWT (like written in
2007 Feb 14
1
Any packages for conducting AHP( Analytic Hierarchy Process) data
Hi, R Lovers!
I have some survey data. I'd like to run R or R packages for processing data
inputted
from AHP(Analytic Hierarchy Process) survey.
Are there any R packages or subsititues for running data from AHP survey.
Thanks in advance,
--
Kum-Hoe Hwang, Ph.D.Phone : 82-31-250-3516 Email : phdhwang@gmail.com
[[alternative HTML version deleted]]
2006 May 18
1
how to get coefficients of regression or Anova
Hi R Gurus!
I conducted regression and anova followings :
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 6.07e-01 5.95e-02 10.19 < 2e-16 ***
nemp 2.87e-06 1.04e-07 27.63 < 2e-16 ***
as.factor(corridor1)A -8.81e-02 2.13e-02 -4.14 3.6e-05 ***
as.factor(corridor1)B
2005 Feb 11
3
How to solve error : "cannot allocate vector of size 1208235 Kb"
Howdy R gurus !
I am newbie to R
I use R 2.0.1 in Windows XP. When I run R
I got the follwoing memory error.
My physical memory size is 3 Gb.
My R got the memory problem when it reached to
about 2 Gb.
Thanks in advance,
> library(spdep)
> sfr.lagsarlm <- lagsarlm(sfr.data$Bldgsqft ~ sfr.data$Ncounty + sfr.data$Nugb + sfr.data$Ngroup, data=sfr.data, listw=sfr.listw,
2011 Jan 24
1
Extracting information from text data
Hi R-Users,
Thanks in advance.
I am using R-2.12.0 on Windows XP.
I am trying to produce an n X m matrix from text data stored in different files. Where n = number of words (say w1, w2, …, wn). M is the number of documents (say d1, d2, …, dm)
A. Using package tm
I am using package tm to do the job. I have provided the code below:
> my.corpus <- Corpus(DirSource(my.path),
2008 Oct 18
2
sorting matrix output alphabetically
Hello,
I have been using the TM package to create a TermDocMatrix, which I
have saved as a matrix so that I can view word frequencies. Below is
a section of the code that I have used and an excerpt of the output:
What I wanted to be able to do is to view the output alphabetically -
rather than the results being sorted by frequency as below, that an
alphabetical list would be generated. This
2007 Dec 12
1
tm package - how to transform a TermDocMatrix to a data.frame
An embedded and charset-unspecified text was scrubbed...
Name: n?o dispon?vel
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20071212/2483b4bc/attachment.pl
2006 May 18
1
how to get correct coefficients from lm model
Howdy
I apologize for duplicated posting. But I decided to correct my previous
posting.
I had the regression results using
r <- lm(Y ~ nemp + as.factor(devt), data=d).
First, there is the result of anova(r). Here I could not find regression
coefficients.
Response: Y
Df Sum Sq Mean Sq F value Pr(>F)
nemp 1 58.2 58.2 1233.23 < 2e-16 ***
2010 Feb 16
2
Error of Stepwise Regression with number of rows in use has changed: remove missing values?
Howdy, R Grues
I have enjoyed R, but I cannot solve one problem easily. Please help my problem.
When I tried the R script, I got the following Error. This error
results from input data file exported through a Excel spreadsheet
software.
Error in step(lm(pop.rate ~ as.numeric(year) + as.factor(policy) +
as.numeric(nation.grant) + ?:
??number of rows in use has changed: remove missing values?
2009 Apr 17
0
question about the Text Mining package tm
Hello. I am trying to work with the text mining package tm.
I have a directory called textsTweet1 which contains three files
short.txt
myTextFile.txt
myTextFile.csv
short.txt contains one line: THE CAT IN THE HAT\n
myTextFile contains some tweets from Twitter. The first few lines of
myTextFile.txt are:
@oliviamunn I miss a good Yakaniku...I miss Japan...I NEED COCO EVERYBODY. I
NEED TO GET ON
2006 Mar 05
2
RWeka
Hi,
I downloaded RWeka successfully. (At least I do not see, where I could have made a mistake.) Then I tried to load it by the library-command. To my surprise this did not work. Result:
library(RWeka)
Fehler in .jinit(c(system.file("jar", "weka.jar", package = "RWeka"), system.file("jar", :
Cannot create Java Virtual Machine
Fehler: .onLoad in
2009 Aug 15
1
Error in running RWeka Clusteres
Hi,
I have a question about using RWeka Clusterers.If you could supply answer or
insight, I would really appreciate it.
When I run a simple code which uses a clusterer from RWeka I get an error.
the sample codes and errors are mentioned below
Code:
library(RWeka)
Cobweb(iris[,-5],control=NULL)
Error:
Error in names(class_ids) <- nms :
'names' attribute [150] must be the same
2005 Oct 31
2
nls() fit to Kahnemann/ Tversky function
Dear WizaRds,
I would like to fit a curve to ten points with nls() for one
unknown parameter gamma in the Kahnemann/ Tversky function, but somehow
it won't work and I am unable to locate my mistake.
p.kum <- seq(0.1,1, by=0.1)
felt.prob.kum <- c(0.16, 0.23, 0.36, 0.49, 0.61, 0.71, 0.85, 0.89, 0.95,
1) ## how to find a function that fits these points nicely?
plot(p.kum,