Displaying 20 results from an estimated 110 matches similar to: "question about the Text Mining package tm"
2009 May 24
2
accuracy of a neural net
Hi. I started with a file which was a sparse 982x923 matrix and where the
last column was a variable to be predicted. I did principle component
analysis on it and arrived at a new 982x923 matrix.
Then I ran the code below to get a neural network using nnet and then wanted
to get a confusion matrix or at least know how accurate the neural net was.
I used the first 22 principle components only for
2010 Apr 23
2
Library (tm) Error: could not find function "TermDocMatrix".
Hi List
I have the next code and the error. I have try with other codes and I have
the same problem.
> reut21578 <- system.file("texts", "crude", package = "tm")
> (r <- Corpus(DirSource(reut21578), readerControl = list(reader =
> readReut21578XMLasPlain)))
A corpus with 20 text documents
> (r <- Corpus(DirSource(reut21578), readerControl =
2009 Jan 09
1
[R} how to build TermDocMatrix in tm text mining package of R
Howdy Gurus
I 'd like to ask a question about how to build TermDocMatrix in tm text
mining package.
It is not clear about importing a plain text file, and them converting that
text file into TermDocMatrix file, etc to me.
How can I build a TermDocMatrix of " a plain text document file for text
association?
Or are there any good manuals?
Thank you in advance,
--
Kum-Hoe Hwang, Ph.D.
2011 Jan 24
1
Extracting information from text data
Hi R-Users,
Thanks in advance.
I am using R-2.12.0 on Windows XP.
I am trying to produce an n X m matrix from text data stored in different files. Where n = number of words (say w1, w2, …, wn). M is the number of documents (say d1, d2, …, dm)
A. Using package tm
I am using package tm to do the job. I have provided the code below:
> my.corpus <- Corpus(DirSource(my.path),
2003 Dec 11
4
Probelm with read.table
Hi All,
I have the following text file (mytextfile.txt)
738307 527178 714456 557955
#N/A 17.42 6.22 4.73
#N/A 17.3 6.23 4.75
#N/A 17.29 6.17 4.7
#N/A 17.07 6.12 4.6
#N/A 17.27 6.19 4.7
#N/A 17.72 6.4 4.78
#N/A 17.12 6.19 4.75
#N/A 17.07 6.15 4.65
#N/A 17.03 6.07 4.64
#N/A 17.38 6.13 4.7
#N/A 17.38 6.13 4.7
#N/A 17.38 6.13 4.7
#N/A 17.38 6.13 4.7
#N/A
2008 Oct 18
2
sorting matrix output alphabetically
Hello,
I have been using the TM package to create a TermDocMatrix, which I
have saved as a matrix so that I can view word frequencies. Below is
a section of the code that I have used and an excerpt of the output:
What I wanted to be able to do is to view the output alphabetically -
rather than the results being sorted by frequency as below, that an
alphabetical list would be generated. This
2009 Jan 10
1
Help needed for Loading "tm" package
Howdy Gurus again
Thanks to Tony.Breyal, I was able to writing the following script for
analyzing a text document.
But I got an error with "tm' package. I don't why I got the error from the R
script below. I think I followed proccess of R tm manual.
I use R v2.8.1. and tm_0.3-3.zip under Win XP.
Thanks in advance,
Kum Hwang
> # setting directory
> my.path
2012 Feb 27
2
export 3D array to .txt file
Hey,
I have an array with the dimensions 200x6x200 and I want to save it in a
textfile. I want to be able to reload the text file to get the same array
again: new.array<-load(mytextfile.txt) or something similar. I was not
able to figure out a way to do that with write.table without having to save
200 text files. Is there another way?
Thank you!
Frauke
--
View this message in context:
2007 Jul 14
0
Extending Matrix class
This is from a private question which I'm given permission to
answer in public:
>>>>> "IF" == Ingo Feinerer <h0125130 at wu-wien.ac.at>
>>>>> on Fri, 13 Jul 2007 16:14:07 +0200 writes:
IF> Hello, We tried to derive a class from Matrix but had
IF> some problems. Maybe you can help us:
library("Matrix")
m <-
2009 Mar 30
1
Help with tm assocation analysis and Rgraphviz installation.
Help with tm assocation analysis and Rgraphviz installation.
THANK YOU IN ADVANCE
Question 1:
I saved two txt file in C:\textfile
And each txt file contents only one text column, and both have 100 records.
I know term “research” occurs 49 times, so I want to find out which other
words are correlated to this word, and I got tons of association ‘1’ .
I tried other terms, and no
2009 Apr 20
1
factanal error
Hi. I tried running the following code. I don't understand the error.
mydata <- read.table("C:/dataForR/radiology/wordFrequencies.csv",
header=TRUE, sep=",")
> dim(mydata)
[1] 982 924
mydataN<-mydata[,-923]
> dim(mydataN)
[1] 982 923
cor(mydataN)
factanal(mydataN, factors=3)
Error in solve.default(cv) :
Lapack routine dgesv: system is exactly singular
2009 Jan 15
1
How to Solve the Error( error:cannot allocate vector of size 1.1 Gb)
Hi, Gurus
Thanks to your good helps, I have managed starting the use of a text
mining package so called "tm" in R under the OS of Win XP.
However, during running the tm package, I got another mine like memory problem.
What is a the best way to solve this memory problem among increasing a
physical RAM, or doing other recipes, etc?
###############################
###### my R
2009 Apr 20
0
system is exactly singular
Hi. I have a csv file. I imported it with
mydata<-read.table("C:/dataForR/radiology/WordFrequency.csv", header=TRUE,
sep=",")
> dim(mydata)
[1] 982 925
The first column had the doc numbers like doc1, doc2, etc. so I did
mydataNum<-mydata[,-1]
> dim(mydataNum)
[1] 982 924
The second to last column was also not numeric and so I did
>
2005 Nov 23
8
getting started, reading listing and saving data
Dear List
I am new to R and to the list and will try best as I can be clear and
concise. My apologies if anything I write contravenes the posting code
on this list. I would also like to say I have run through most of the
material on the R website before writing this email however, I am
stuck.
Here is what I want to do and what I have done
1. Read a comma seperated text file into R
I have used
2007 Dec 12
1
tm package - how to transform a TermDocMatrix to a data.frame
An embedded and charset-unspecified text was scrubbed...
Name: n?o dispon?vel
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20071212/2483b4bc/attachment.pl
2003 Sep 17
0
cannot add machine account with ldapsam
Hello,
Im using samba-3.0.0rc4 with ldapsam.
adding users works fine but when i try to add a machine account i alway
get this error:
pdc:/usr/bin# pdbedit -v -a -m -u test_pc
ldapsam_modify_entry: Failed to add user dn=
uid=test_pc$,ou=Systems,dc=ovid,dc=evva,dc=com with: Object class violation
object class 'sambaSamAccount' requires attribute 'sambaSID'
2009 Oct 02
1
text mining
The following code is derived from a paper titled "Text Mining Infrastructure
in R" (http://www.jstatsoft.org/v25/i05/paper). The example below seems to
load some default documents for analysis, some sort of latin document. I
cannot for the life of me figure out to load my own document let alone an
entire corpus. I have searched the above documenet as well as related
documentation.
2006 Sep 06
2
Skydome not working
Hi all,
I've searched all over to the answer for this but no luck. In short, skydome doesn't seem to be working, I only have a black background. Skydome is pointing at the following:
$ file plaza.png
plaza.png: PNG image data, 2048 x 1024, 8-bit/color RGB, non-interlaced
In the "Gnome Compiz Preferences" app, on the "Workspaces" tab, under "Cube", I
2009 Jan 15
2
Interface to open source Reporting tools
Hi,
I am a new user of R 2.8.1. I use Tinn-R for code editing. I use a windows
2003 system with 1 GB RAM.
I am interested to generate dashboard and reports based on data from MS
Access. These reports need to be posted on a weekly basis to the web. The
reporting interface should provide facilities for "what if" scenarios.
Is it possible to interface R analysis results to good open
2007 Jul 13
1
Cygwin: store authorized_keys in /etc/ssh/user/authorized_keys?
Hi folks,
If I try to login on a Cygwin host via ssh, then my
.ssh on a network drive is unaccessible until I login.
I have to enter my password, even if my authorized_keys
would allow me to login without. This is fatal, since it
forces me to use an interactive session for working on a
Windows host. Unusable for automatic builds and tests
managed from a central machine, for example.
There is no