similar to: question about the Text Mining package tm

Displaying 20 results from an estimated 110 matches similar to: "question about the Text Mining package tm"

2009 May 24
2
accuracy of a neural net
Hi. I started with a file which was a sparse 982x923 matrix and where the last column was a variable to be predicted. I did principle component analysis on it and arrived at a new 982x923 matrix. Then I ran the code below to get a neural network using nnet and then wanted to get a confusion matrix or at least know how accurate the neural net was. I used the first 22 principle components only for
2010 Apr 23
2
Library (tm) Error: could not find function "TermDocMatrix".
Hi List I have the next code and the error. I have try with other codes and I have the same problem. > reut21578 <- system.file("texts", "crude", package = "tm") > (r <- Corpus(DirSource(reut21578), readerControl = list(reader = > readReut21578XMLasPlain))) A corpus with 20 text documents > (r <- Corpus(DirSource(reut21578), readerControl =
2009 Jan 09
1
[R} how to build TermDocMatrix in tm text mining package of R
Howdy Gurus I 'd like to ask a question about how to build TermDocMatrix in tm text mining package. It is not clear about importing a plain text file, and them converting that text file into TermDocMatrix file, etc to me. How can I build a TermDocMatrix of " a plain text document file for text association? Or are there any good manuals? Thank you in advance, -- Kum-Hoe Hwang, Ph.D.
2011 Jan 24
1
Extracting information from text data
Hi R-Users,   Thanks in advance.   I am using R-2.12.0 on Windows XP.   I am trying to produce an n X m matrix from text data stored in different files. Where n = number of words (say w1, w2, …, wn). M is the number of documents (say d1, d2, …, dm)   A. Using package tm   I am using package tm to do the job. I have provided the code below:   > my.corpus <- Corpus(DirSource(my.path),
2003 Dec 11
4
Probelm with read.table
Hi All, I have the following text file (mytextfile.txt) 738307 527178 714456 557955 #N/A 17.42 6.22 4.73 #N/A 17.3 6.23 4.75 #N/A 17.29 6.17 4.7 #N/A 17.07 6.12 4.6 #N/A 17.27 6.19 4.7 #N/A 17.72 6.4 4.78 #N/A 17.12 6.19 4.75 #N/A 17.07 6.15 4.65 #N/A 17.03 6.07 4.64 #N/A 17.38 6.13 4.7 #N/A 17.38 6.13 4.7 #N/A 17.38 6.13 4.7 #N/A 17.38 6.13 4.7 #N/A
2008 Oct 18
2
sorting matrix output alphabetically
Hello, I have been using the TM package to create a TermDocMatrix, which I have saved as a matrix so that I can view word frequencies. Below is a section of the code that I have used and an excerpt of the output: What I wanted to be able to do is to view the output alphabetically - rather than the results being sorted by frequency as below, that an alphabetical list would be generated. This
2009 Jan 10
1
Help needed for Loading "tm" package
Howdy Gurus again Thanks to Tony.Breyal, I was able to writing the following script for analyzing a text document. But I got an error with "tm' package. I don't why I got the error from the R script below. I think I followed proccess of R tm manual. I use R v2.8.1. and tm_0.3-3.zip under Win XP. Thanks in advance, Kum Hwang > # setting directory > my.path
2012 Feb 27
2
export 3D array to .txt file
Hey, I have an array with the dimensions 200x6x200 and I want to save it in a textfile. I want to be able to reload the text file to get the same array again: new.array<-load(mytextfile.txt) or something similar. I was not able to figure out a way to do that with write.table without having to save 200 text files. Is there another way? Thank you! Frauke -- View this message in context:
2007 Jul 14
0
Extending Matrix class
This is from a private question which I'm given permission to answer in public: >>>>> "IF" == Ingo Feinerer <h0125130 at wu-wien.ac.at> >>>>> on Fri, 13 Jul 2007 16:14:07 +0200 writes: IF> Hello, We tried to derive a class from Matrix but had IF> some problems. Maybe you can help us: library("Matrix") m <-
2009 Mar 30
1
Help with tm assocation analysis and Rgraphviz installation.
Help with tm assocation analysis and Rgraphviz installation. THANK YOU IN ADVANCE Question 1: I saved two txt file in C:\textfile And each txt file contents only one text column, and both have 100 records. I know term “research” occurs 49 times, so I want to find out which other words are correlated to this word, and I got tons of association ‘1’ . I tried other terms, and no
2009 Apr 20
1
factanal error
Hi. I tried running the following code. I don't understand the error. mydata <- read.table("C:/dataForR/radiology/wordFrequencies.csv", header=TRUE, sep=",") > dim(mydata) [1] 982 924 mydataN<-mydata[,-923] > dim(mydataN) [1] 982 923 cor(mydataN) factanal(mydataN, factors=3) Error in solve.default(cv) : Lapack routine dgesv: system is exactly singular
2009 Jan 15
1
How to Solve the Error( error:cannot allocate vector of size 1.1 Gb)
Hi, Gurus Thanks to your good helps, I have managed starting the use of a text mining package so called "tm" in R under the OS of Win XP. However, during running the tm package, I got another mine like memory problem. What is a the best way to solve this memory problem among increasing a physical RAM, or doing other recipes, etc? ############################### ###### my R
2009 Apr 20
0
system is exactly singular
Hi. I have a csv file. I imported it with mydata<-read.table("C:/dataForR/radiology/WordFrequency.csv", header=TRUE, sep=",") > dim(mydata) [1] 982 925 The first column had the doc numbers like doc1, doc2, etc. so I did mydataNum<-mydata[,-1] > dim(mydataNum) [1] 982 924 The second to last column was also not numeric and so I did >
2005 Nov 23
8
getting started, reading listing and saving data
Dear List I am new to R and to the list and will try best as I can be clear and concise. My apologies if anything I write contravenes the posting code on this list. I would also like to say I have run through most of the material on the R website before writing this email however, I am stuck. Here is what I want to do and what I have done 1. Read a comma seperated text file into R I have used
2007 Dec 12
1
tm package - how to transform a TermDocMatrix to a data.frame
An embedded and charset-unspecified text was scrubbed... Name: n?o dispon?vel Url: https://stat.ethz.ch/pipermail/r-help/attachments/20071212/2483b4bc/attachment.pl
2003 Sep 17
0
cannot add machine account with ldapsam
Hello, Im using samba-3.0.0rc4 with ldapsam. adding users works fine but when i try to add a machine account i alway get this error: pdc:/usr/bin# pdbedit -v -a -m -u test_pc ldapsam_modify_entry: Failed to add user dn= uid=test_pc$,ou=Systems,dc=ovid,dc=evva,dc=com with: Object class violation object class 'sambaSamAccount' requires attribute 'sambaSID'
2009 Oct 02
1
text mining
The following code is derived from a paper titled "Text Mining Infrastructure in R" (http://www.jstatsoft.org/v25/i05/paper). The example below seems to load some default documents for analysis, some sort of latin document. I cannot for the life of me figure out to load my own document let alone an entire corpus. I have searched the above documenet as well as related documentation.
2006 Sep 06
2
Skydome not working
Hi all, I've searched all over to the answer for this but no luck. In short, skydome doesn't seem to be working, I only have a black background. Skydome is pointing at the following: $ file plaza.png plaza.png: PNG image data, 2048 x 1024, 8-bit/color RGB, non-interlaced In the "Gnome Compiz Preferences" app, on the "Workspaces" tab, under "Cube", I
2009 Jan 15
2
Interface to open source Reporting tools
Hi, I am a new user of R 2.8.1. I use Tinn-R for code editing. I use a windows 2003 system with 1 GB RAM. I am interested to generate dashboard and reports based on data from MS Access. These reports need to be posted on a weekly basis to the web. The reporting interface should provide facilities for "what if" scenarios. Is it possible to interface R analysis results to good open
2007 Jul 13
1
Cygwin: store authorized_keys in /etc/ssh/user/authorized_keys?
Hi folks, If I try to login on a Cygwin host via ssh, then my .ssh on a network drive is unaccessible until I login. I have to enter my password, even if my authorized_keys would allow me to login without. This is fatal, since it forces me to use an interactive session for working on a Windows host. Unusable for automatic builds and tests managed from a central machine, for example. There is no