Displaying 20 results from an estimated 5000 matches similar to: "tm 0.1 uploaded to CRAN"
2012 May 29
1
package tm: reading XML files
Dear fellow R users,
I'm using the package tm for text mining, and have a problem with
reading in a corpus from XML files.
When I copy the example from "Introduction to the tm package" of the
small reuters subset "crude", everything goes well, and I get a corpus
with the required meta data.
When I read in the entire reuters21578 corpus in XML format however (or
a
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all,
I have some troubles using the stemming algorithm provided by the tm
(text mining) + Snowball packages.
Here is my config:
MacOS 10.5
R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
I have installed all the needed packages (tm, rJava, rWeka, Snowball)
+ dependencies. I have desactivated AWT (like written in
2010 Feb 16
0
tm package
Hi,
I'm using version 0.5.1 of tm package with R 2.10.1. It looks to me
as if after the following
reuters21578 <- Corpus(DirSource(corpusDir), readerControl =
list(reader = readReut21578XMLasPlain))
reuters21578 <- tm_map(reuters21578, stripWhitespace)
reuters21578 <- tm_map(reuters21578, tolower)
reuters21578 <- tm_map(reuters21578, removePunctuation)
2006 Nov 04
0
Ferret 0.10.6 released (and some benchmarks)
Hey folks,
** Description **
Firstly for those who don''t know, Ferret is a full-text search library
which makes adding search to your application a breeze. It''s much
faster than MySQL full-text search as well most other search libraries
out there. It allows you to do Boolean (+ruby + rails -jewelry) and
phrase queries ("the quick brown fox") as well as some more
2011 May 18
0
text mining problem using TM package
Hi, I’m using R (TM package) for text mining and I’m having problems
filtering articles out of my data set by local meta data.
Here is the code:
*data <- ("C:/… /19970331")*
* *
* *
*rs <- ReutersSource(data , encoding = "UTF-8")*
*RC <- VCorpus(DirSource(data), readerControl = list(reader =
readRCV1asPlain,*
*
language = "en_US",*
*
load =
2010 Mar 18
0
error while usig "tm" package
I have recently started using "tm" package by Feinerer, K. Hornik, and D.
Meyer.
While trying to create a term-document matrix from a corpus (approxly 440
docs)
I get the following error:
tdm <- TermDocumentMatrix(tmp, control=list(weighting=weightTfIdf,
minDocFreq=2, minWordLength=3))
*Error in rowSums(m > 0) : 'x' must be an array of at least two dimensions*
This error
2007 Jul 14
0
Extending Matrix class
This is from a private question which I'm given permission to
answer in public:
>>>>> "IF" == Ingo Feinerer <h0125130 at wu-wien.ac.at>
>>>>> on Fri, 13 Jul 2007 16:14:07 +0200 writes:
IF> Hello, We tried to derive a class from Matrix but had
IF> some problems. Maybe you can help us:
library("Matrix")
m <-
2009 Aug 13
1
using package tm to find phrases
I am using the package "tm" for text-mining of abstracts and would like to
use it to find instances of gene names that may contain white space. For
instance "gene regulatory protein 1". The default behavior of tm is to parse
this into 4 separate words, but I would like to use the class constructor
"dictionary" to define phrases such as just mentioned.
Is this
2007 Jun 08
1
data mining/text mining?
Dear R-user,
Could anybody tell me of the key difference between data mining and text
mining?
Please make a list for packages about data/text mining.
And give me an example of text mining with R (any relating materials
will be highly appreciated), because a vignette written by Ingo Feinerer
seems too concise for me.
Thanks
_____________________________________________
Dr.Ruixin ZHU
Shanghai
2004 Nov 15
1
[Spam] Any plans implement MaxAuthTriesLog?
I'll rephrase my question...
When a user gets their password wrong more than MaxAuthTries times why
isn't the message "Too many authentication failures for %.100s" written
to syslog? The user seems to get it (in a dialog in putty) but it
doesn't get logged. The usual "Failed password for..." messages are
logged.
Regards,
Richard Dickens
-----Original Message-----
2003 May 07
1
Bug report: deletion of files only on the target is not logged
Please see the attached file and let me know if you need any more
information.
/Sam
Sam Sexton <mailto:sam.sexton@reuters.com>
Reuters Coventry
Automated Dealing Technologies
Phone: +44 24 7625 6562
Fax: +44 24 7655 5203
--------------------------------------------------------------- -
Visit our Internet site at http://www.reuters.com
Get closer to the
2009 Nov 12
2
package "tm" fails to remove "the" with remove stopwords
I am using code that previously worked to remove stopwords using package
"tm". Even manually adding "the" to the list does not work to remove "the".
This package has undergone extensive redevelopment with changes to the
function syntax, so perhaps I am just missing something.
Please see my simple example, output, and sessionInfo() below.
Thanks!
Mark
require(tm)
2003 May 09
1
No subject
I sent three messages to the list on Tuesday, but none of them have
reached the archive, and I've not seen a reply. I'll send the other two
separately. The problem may be at my end, but I didn't get any error
messages.
/Sam
Sam Sexton <mailto:sam.sexton@reuters.com>
Reuters Coventry
Automated Dealing Technologies
Phone: +44 24 7625 6562
Fax: +44 24 7655 5203
2003 Jun 19
0
passwd and secrets files (2.5.6)
I am a little confused regarding the above files. As I read the man
pages, the passwd file is for the password of the user as which the
rsync server runs - on the server machine. The secrets files (AFAIK)
contain the name:password for the valid users of rsync. Some problems
that arose ("address family not supported") went away after I updated
both this file and the password file. These
2006 Nov 30
2
non-searchable columns, normalization
Hello. I am new to Ferret. I am using it through Acts as Ferret.
Let''s say I have such a table, and all columns are indexed using the
default behavior provided by acts_as_ferret:
ARTICLES
-id
-year
-body
[1] A typical request will be "select id from articles where KEYWORDS
% body". Will id be indexed for fulltext searching? clearly the
fulltext index on id will never be
2006 Mar 29
2
1.1 ruby schema breaks MySQL FULLTEXT index
If you used MySQL FULLTEXT indexes and just upgraded to Rails 1.1, watch
out for this one. Rails 1.1 now defaults to Ruby as your schema format.
So instead of development-structure.sql, Rake will use schema.rb.
That''s portable and good stuff overall but there is a slight
compatibility issue. Ruby schema doesn''t know how to do FULLTEXT index
syntax for MySQL.
To work around it,
2003 May 01
4
--exclude-from works but "exclude from" in rsyncd.conf doesn't ?
I'm setting rsync up for the first time and would prefer to have the
exclude file defined in the conf file, but the exclusions aren't
honoured when I define the parameter in rsyncd.conf - although they are
when I specify the file in an argument. The server is the remote system
and both rsyncd.conf and the exclude file are the same on both local and
remote systems. I'm attaching the
2003 May 24
1
Deletion of files only on remote system is not logged
I've only just seen the response from May 9, as I'm not enrolled in the
list (too much mail already!). Here is a cut-down example as requested,
showing the same behaviour:
Script started on Fri May 23 15:18:37 2003
15:18
[220](H:root@Edelfelt)/tmp: ls -l /tmp/testdir
total 32
-rw-r----- 1 root other 9 May 23 15:15 both
-rw-r----- 1 root other 7 May 23 14:59
2004 Nov 11
1
Any plans implement MaxAuthTriesLog?
Hello there,
I've just upgraded to 3.9p1 and I notice that I'm not getting any "Too
many authorization failure" messages written to my syslog.
This seems to be controlled by MaxAuthTriesLog in Sun's implementation
of SSH. Are there any plans to do the same in OpenSSH?
Regards,
Richard Dickens
-----------------------------------------------------------------
2005 Aug 11
2
Porting to Dynix 4.1.3...
Hello there,
I am trying to build OpenSSH on the above (very old) version of unix.
With a bit of hacking, I have managed to get everything to compile and
run. Scp works fine however when I log in via ssh I end up with a
session without a controlling terminal.
I have followed the code in pty_make_controlling_tty and it seems to go
as follows:
1. Starts with a controlling tty.
2. setsid works,