similar to: [R} how to build TermDocMatrix in tm text mining package of R

Displaying 20 results from an estimated 1000 matches similar to: "[R} how to build TermDocMatrix in tm text mining package of R"

2009 Jan 10
1
Help needed for Loading "tm" package
Howdy Gurus again Thanks to Tony.Breyal, I was able to writing the following script for analyzing a text document. But I got an error with "tm' package. I don't why I got the error from the R script below. I think I followed proccess of R tm manual. I use R v2.8.1. and tm_0.3-3.zip under Win XP. Thanks in advance, Kum Hwang > # setting directory > my.path
2009 Jan 15
1
How to Solve the Error( error:cannot allocate vector of size 1.1 Gb)
Hi, Gurus Thanks to your good helps, I have managed starting the use of a text mining package so called "tm" in R under the OS of Win XP. However, during running the tm package, I got another mine like memory problem. What is a the best way to solve this memory problem among increasing a physical RAM, or doing other recipes, etc? ############################### ###### my R
2007 Feb 14
1
Any packages for conducting AHP( Analytic Hierarchy Process) data
Hi, R Lovers! I have some survey data. I'd like to run R or R packages for processing data inputted from AHP(Analytic Hierarchy Process) survey. Are there any R packages or subsititues for running data from AHP survey. Thanks in advance, -- Kum-Hoe Hwang, Ph.D.Phone : 82-31-250-3516 Email : phdhwang@gmail.com [[alternative HTML version deleted]]
2006 Oct 19
2
How to get multiple Correlation Coefficients
Hi I have used a polycor package for categorical correlation coefficients. I run the following script. But there were no results. Could you tell me how to correct the script? Thanks in advance, vars <- names(sdi) for (i in 1:length(vars)) { for (j in 1:length(vars)) { paste(vars[i]," and ", vars[j]) polychor(vars[i], vars[j]) # corr } } -- Kum-Hoe Hwang, Ph.D.Phone :
2005 Jul 06
2
How to sample x-y coordinates from GIS files
Hi Gurus! I have a job that is to get randomly samples from point-based GIS data (sp called shape GIS files) under the total sum resctricted. For example, I would like to take random smaples under the 1000 persons in each city. The randomly sampled persons should not be over 1000 any case. Thank you in advance, -- Kum-Hoe Hwang, Ph.D. Phone : 82-31-250-3283 Email : phdhwang@gmail.com
2006 Mar 05
2
what is scale function? Is it for variable transformation?
HOwdy I read R books about scale function for variable transformation. Acoording to this book scale function leads me to better regression results. Or am I worng? I hope somebody tell me about a scale function? Is it for variable transformation? -- Kum-Hoe Hwang, Phone : 82-31-250-3516Email : phdhwang@gmail.com [[alternative HTML version deleted]]
2005 Feb 11
3
How to solve error : "cannot allocate vector of size 1208235 Kb"
Howdy R gurus ! I am newbie to R I use R 2.0.1 in Windows XP. When I run R I got the follwoing memory error. My physical memory size is 3 Gb. My R got the memory problem when it reached to about 2 Gb. Thanks in advance, > library(spdep) > sfr.lagsarlm <- lagsarlm(sfr.data$Bldgsqft ~ sfr.data$Ncounty + sfr.data$Nugb + sfr.data$Ngroup, data=sfr.data, listw=sfr.listw,
2006 May 18
1
how to get coefficients of regression or Anova
Hi R Gurus! I conducted regression and anova followings : Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 6.07e-01 5.95e-02 10.19 < 2e-16 *** nemp 2.87e-06 1.04e-07 27.63 < 2e-16 *** as.factor(corridor1)A -8.81e-02 2.13e-02 -4.14 3.6e-05 *** as.factor(corridor1)B
2010 Feb 16
2
Error of Stepwise Regression with number of rows in use has changed: remove missing values?
Howdy, R Grues I have enjoyed R, but I cannot solve one problem easily. Please help my problem. When I tried the R script, I got the following Error. This error results from input data file exported through a Excel spreadsheet software. Error in step(lm(pop.rate ~ as.numeric(year) + as.factor(policy) + as.numeric(nation.grant) + ?: ??number of rows in use has changed: remove missing values?
2006 May 18
1
how to get correct coefficients from lm model
Howdy I apologize for duplicated posting. But I decided to correct my previous posting. I had the regression results using r <- lm(Y ~ nemp + as.factor(devt), data=d). First, there is the result of anova(r). Here I could not find regression coefficients. Response: Y Df Sum Sq Mean Sq F value Pr(>F) nemp 1 58.2 58.2 1233.23 < 2e-16 ***
2010 Apr 23
2
Library (tm) Error: could not find function "TermDocMatrix".
Hi List I have the next code and the error. I have try with other codes and I have the same problem. > reut21578 <- system.file("texts", "crude", package = "tm") > (r <- Corpus(DirSource(reut21578), readerControl = list(reader = > readReut21578XMLasPlain))) A corpus with 20 text documents > (r <- Corpus(DirSource(reut21578), readerControl =
2006 Oct 12
1
How to Get Categorical Correlation Coefficient
Howdy Gurus ! I have a different correlation result from the same data. The "corridor1" string variable is expressed as a number like the "corridor2" number variable. -------------------------------------------------------------------------- > levels(corridor1) [1] "A" "B" "C" "D" "E" "F" >
2005 Feb 19
2
best analysis method : for time series ans cross sectional data
Howdy What I 'd like to analyze with a large data on building permits is to find time series effect of urban policy on buildings as well as cross-sectional effects in any. In 1990 the specialZone urban policy was introduced. I guess that the effects of this specialZone policy would be different from countys. There are counties that do not welcome this specialZone forced to design it. One of
2007 Dec 12
1
tm package - how to transform a TermDocMatrix to a data.frame
An embedded and charset-unspecified text was scrubbed... Name: n?o dispon?vel Url: https://stat.ethz.ch/pipermail/r-help/attachments/20071212/2483b4bc/attachment.pl
2009 Mar 30
1
Help with tm assocation analysis and Rgraphviz installation.
Help with tm assocation analysis and Rgraphviz installation. THANK YOU IN ADVANCE Question 1: I saved two txt file in C:\textfile And each txt file contents only one text column, and both have 100 records. I know term “research” occurs 49 times, so I want to find out which other words are correlated to this word, and I got tons of association ‘1’ . I tried other terms, and no
2011 Jan 24
1
Extracting information from text data
Hi R-Users,   Thanks in advance.   I am using R-2.12.0 on Windows XP.   I am trying to produce an n X m matrix from text data stored in different files. Where n = number of words (say w1, w2, …, wn). M is the number of documents (say d1, d2, …, dm)   A. Using package tm   I am using package tm to do the job. I have provided the code below:   > my.corpus <- Corpus(DirSource(my.path),
2009 Apr 17
0
question about the Text Mining package tm
Hello. I am trying to work with the text mining package tm. I have a directory called textsTweet1 which contains three files short.txt myTextFile.txt myTextFile.csv short.txt contains one line: THE CAT IN THE HAT\n myTextFile contains some tweets from Twitter. The first few lines of myTextFile.txt are: @oliviamunn I miss a good Yakaniku...I miss Japan...I NEED COCO EVERYBODY. I NEED TO GET ON
2007 Nov 30
0
How to Compute envelope of Khat in Splancs Package
Howdy Gurus I am try to compute envelope of Khat from simulations of complete spatial randomness using R package of "Splancs." I got the following error: > UL.khat <- Kenv.csr(length(X_coord), bnd, nsim=100, s) Doing simulation 1 Error in runif(n, min, max) : invalid arguments In addition: Warning message: NAs introduced by coercion I think that I have a problem with
2008 Oct 18
2
sorting matrix output alphabetically
Hello, I have been using the TM package to create a TermDocMatrix, which I have saved as a matrix so that I can view word frequencies. Below is a section of the code that I have used and an excerpt of the output: What I wanted to be able to do is to view the output alphabetically - rather than the results being sorted by frequency as below, that an alphabetical list would be generated. This
2006 Jul 15
0
How to Interpret Results of Regression in R
----------------------------------------------------------------------------------------------------- Howdy, Gurus I am appying R package for regression analysis as followings. A dependent variable is jhnet that means ratio of dividing internal trip with all trips in a traffic zone. There are many indepentent variables including factor or dummy varibles such as parkfee, ohouse, Devt2, corridor1.