Displaying 9 results from an estimated 9 matches for "textmine".
Did you mean:
textline
2011 Jan 24
1
Extracting information from text data
...f documents (say d1, d2, …, dm)
A. Using package tm
I am using package tm to do the job. I have provided the code below:
> my.corpus <- Corpus(DirSource(my.path), readerControl = list (reader=readPlain))
In readLines(y, encoding = x$Encoding) :
incomplete final line found on 'M:\textmine/slr.txt'
> x <- TermDocMatrix(my.corpus)
Error: could not find function "TermDocMatrix"
B. Using package(s) other than tm
Once again, thank you very much for the time you have given.
Regards,
Deb
The code:
library(tm)
my.path<- 'M:\\textmine'
my.corpus...
2010 Aug 22
2
CRAN (and crantastic) updates this week
CRAN (and crantastic) updates this week
New packages
------------
* DCGL (1.0)
Bao-Hong Liu
http://crantastic.org/packages/DCGL
Functions for basic differential coexpression analyses: gene
filtering, link filtering, DCG (Differentially-Coexpressed Gene)
identification and DCL (Differentially-Coexpressed Links)
identification.Two algorithms,named DCP and DCe, are provided for
2012 Oct 25
2
Minería de texto
Cordial Saludo
Actualmente estoy realizando una función para gráficar una nube de palabras el código que tengo es el siguiente:
library(twitteR)library(tm)library(wordcloud)library(RXKCD)library(RColorBrewer)
tweets=searchTwitter(''@afflorezr'', n=1500)
generateCorpus= function(tweets,my.stopwords=c(),min.freq){ #Install the textmining library require(tm) require(wordcloud)
2008 Jan 07
1
glibc detected *** /usr/lib64/R/bin/exec/R: double free or corruption ???? tm package
Hi,
I have a collection of .txt documents in my working folder for which I want to do some text mining. If I run TextDocCol from the tm package, R crashes with some memory issues. Does anyone has any idea if this is related to R itself or to the tm package?
Below you can find what is happening here.
> setwd("/home/jan/Work/2008/Profacts/textmining/tryouts/workfolder")
>
2012 Apr 15
2
Cluster Analysis
Hi,
I was wondering what the best equivalent to SAS's FASTCLUS and PROC CLUSTER would be. I need to be able to test the significance of the clusters by comparing the probability of obtaining an equal or greater pseudo F to the Bonferroni-corrected level. I will also need to plot r squared against the number of clusters.
Thanks so much,
Taisa
[[alternative HTML version deleted]]
2012 Mar 23
1
how to cluster rows of words in a text file
Hi:
I am trying to cluster the rows of a text file with kmeans:
I load the data as follows
file1 <- read.csv("somefile.csv")
and the file can be viewed having the following line of words
> file1
1 word1 word3 word4 word1
2 word1 word4 word3 word1
3 word4 word2 word4 word3
4 word4 word2 word1 word3
5 word2 word2 word4 word2
file_as_matrix <- as.matrix(file1);
Now,
2006 Feb 07
15
So, this search thing...
I am using ferret right now, and it works great for all my regular text
documents/information. My problem arises when I want to index/search all of
our assets (mostly pdf files). Currently, there is no way to READ pdfs from
Ruby. Because of this I have to resort to using Java to read the PDF''s and
then Lucene to index them. My problem here is a couple things.
One, to index a asset I have
2012 Apr 13
4
Help with stemDocument
Hi, All:
I am new to R and tm package. I'm trying to do the stemming using tm_map()
and it doesn't seem to work:
*I used:*
> stemDocument(t_cmts[[100]])
*Where t_cmts is the corpus object, the results is:*
bottle loose box abt airpak sections top plastic bottle squashed nearly
flush neck previous shipments bottle wrapped securely bubble wrap wno
bottle damage packaging poor
2011 Aug 14
2
Problem installing R Commander plugin...
Hi ho folks:
I'm running the indicated version of R on Hardy Heron Ubuntu.
(Yes, I am quite aware that it is considered old news but then I don't
run the "latest and greatest" computer gear. I've tried both Gnome and
KDE editions of Ubuntu Lynx and even a current run of Fedora. I find
Heron simply works better on my machine.)
When I try to install RcmdrPlugin.PT most of