similar to: tm 0.1 uploaded to CRAN

Displaying 20 results from an estimated 5000 matches similar to: "tm 0.1 uploaded to CRAN"

2012 May 29
1
package tm: reading XML files
Dear fellow R users, I'm using the package tm for text mining, and have a problem with reading in a corpus from XML files. When I copy the example from "Introduction to the tm package" of the small reuters subset "crude", everything goes well, and I get a corpus with the required meta data. When I read in the entire reuters21578 corpus in XML format however (or a
2012 Jan 13
4
Troubles with stemming (tm + Snowball packages) under MacOS
Dear all, I have some troubles using the stemming algorithm provided by the tm (text mining) + Snowball packages. Here is my config: MacOS 10.5 R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions) I have installed all the needed packages (tm, rJava, rWeka, Snowball) + dependencies. I have desactivated AWT (like written in
2010 Feb 16
0
tm package
Hi, I'm using version 0.5.1 of tm package with R 2.10.1. It looks to me as if after the following reuters21578 <- Corpus(DirSource(corpusDir), readerControl = list(reader = readReut21578XMLasPlain)) reuters21578 <- tm_map(reuters21578, stripWhitespace) reuters21578 <- tm_map(reuters21578, tolower) reuters21578 <- tm_map(reuters21578, removePunctuation)
2006 Nov 04
0
Ferret 0.10.6 released (and some benchmarks)
Hey folks, ** Description ** Firstly for those who don''t know, Ferret is a full-text search library which makes adding search to your application a breeze. It''s much faster than MySQL full-text search as well most other search libraries out there. It allows you to do Boolean (+ruby + rails -jewelry) and phrase queries ("the quick brown fox") as well as some more
2011 May 18
0
text mining problem using TM package
Hi, I’m using R (TM package) for text mining and I’m having problems filtering articles out of my data set by local meta data. Here is the code: *data <- ("C:/… /19970331")* * * * * *rs <- ReutersSource(data , encoding = "UTF-8")* *RC <- VCorpus(DirSource(data), readerControl = list(reader = readRCV1asPlain,* * language = "en_US",* * load =
2010 Mar 18
0
error while usig "tm" package
I have recently started using "tm" package by Feinerer, K. Hornik, and D. Meyer. While trying to create a term-document matrix from a corpus (approxly 440 docs) I get the following error: tdm <- TermDocumentMatrix(tmp, control=list(weighting=weightTfIdf, minDocFreq=2, minWordLength=3)) *Error in rowSums(m > 0) : 'x' must be an array of at least two dimensions* This error
2007 Jul 14
0
Extending Matrix class
This is from a private question which I'm given permission to answer in public: >>>>> "IF" == Ingo Feinerer <h0125130 at wu-wien.ac.at> >>>>> on Fri, 13 Jul 2007 16:14:07 +0200 writes: IF> Hello, We tried to derive a class from Matrix but had IF> some problems. Maybe you can help us: library("Matrix") m <-
2009 Aug 13
1
using package tm to find phrases
I am using the package "tm" for text-mining of abstracts and would like to use it to find instances of gene names that may contain white space. For instance "gene regulatory protein 1". The default behavior of tm is to parse this into 4 separate words, but I would like to use the class constructor "dictionary" to define phrases such as just mentioned. Is this
2007 Jun 08
1
data mining/text mining?
Dear R-user, Could anybody tell me of the key difference between data mining and text mining? Please make a list for packages about data/text mining. And give me an example of text mining with R (any relating materials will be highly appreciated), because a vignette written by Ingo Feinerer seems too concise for me. Thanks _____________________________________________ Dr.Ruixin ZHU Shanghai
2004 Nov 15
1
[Spam] Any plans implement MaxAuthTriesLog?
I'll rephrase my question... When a user gets their password wrong more than MaxAuthTries times why isn't the message "Too many authentication failures for %.100s" written to syslog? The user seems to get it (in a dialog in putty) but it doesn't get logged. The usual "Failed password for..." messages are logged. Regards, Richard Dickens -----Original Message-----
2003 May 07
1
Bug report: deletion of files only on the target is not logged
Please see the attached file and let me know if you need any more information. /Sam Sam Sexton <mailto:sam.sexton@reuters.com> Reuters Coventry Automated Dealing Technologies Phone: +44 24 7625 6562 Fax: +44 24 7655 5203 --------------------------------------------------------------- - Visit our Internet site at http://www.reuters.com Get closer to the
2009 Nov 12
2
package "tm" fails to remove "the" with remove stopwords
I am using code that previously worked to remove stopwords using package "tm". Even manually adding "the" to the list does not work to remove "the". This package has undergone extensive redevelopment with changes to the function syntax, so perhaps I am just missing something. Please see my simple example, output, and sessionInfo() below. Thanks! Mark require(tm)
2003 May 09
1
No subject
I sent three messages to the list on Tuesday, but none of them have reached the archive, and I've not seen a reply. I'll send the other two separately. The problem may be at my end, but I didn't get any error messages. /Sam Sam Sexton <mailto:sam.sexton@reuters.com> Reuters Coventry Automated Dealing Technologies Phone: +44 24 7625 6562 Fax: +44 24 7655 5203
2003 Jun 19
0
passwd and secrets files (2.5.6)
I am a little confused regarding the above files. As I read the man pages, the passwd file is for the password of the user as which the rsync server runs - on the server machine. The secrets files (AFAIK) contain the name:password for the valid users of rsync. Some problems that arose ("address family not supported") went away after I updated both this file and the password file. These
2006 Nov 30
2
non-searchable columns, normalization
Hello. I am new to Ferret. I am using it through Acts as Ferret. Let''s say I have such a table, and all columns are indexed using the default behavior provided by acts_as_ferret: ARTICLES -id -year -body [1] A typical request will be "select id from articles where KEYWORDS % body". Will id be indexed for fulltext searching? clearly the fulltext index on id will never be
2006 Mar 29
2
1.1 ruby schema breaks MySQL FULLTEXT index
If you used MySQL FULLTEXT indexes and just upgraded to Rails 1.1, watch out for this one. Rails 1.1 now defaults to Ruby as your schema format. So instead of development-structure.sql, Rake will use schema.rb. That''s portable and good stuff overall but there is a slight compatibility issue. Ruby schema doesn''t know how to do FULLTEXT index syntax for MySQL. To work around it,
2003 May 01
4
--exclude-from works but "exclude from" in rsyncd.conf doesn't ?
I'm setting rsync up for the first time and would prefer to have the exclude file defined in the conf file, but the exclusions aren't honoured when I define the parameter in rsyncd.conf - although they are when I specify the file in an argument. The server is the remote system and both rsyncd.conf and the exclude file are the same on both local and remote systems. I'm attaching the
2003 May 24
1
Deletion of files only on remote system is not logged
I've only just seen the response from May 9, as I'm not enrolled in the list (too much mail already!). Here is a cut-down example as requested, showing the same behaviour: Script started on Fri May 23 15:18:37 2003 15:18 [220](H:root@Edelfelt)/tmp: ls -l /tmp/testdir total 32 -rw-r----- 1 root other 9 May 23 15:15 both -rw-r----- 1 root other 7 May 23 14:59
2004 Nov 11
1
Any plans implement MaxAuthTriesLog?
Hello there, I've just upgraded to 3.9p1 and I notice that I'm not getting any "Too many authorization failure" messages written to my syslog. This seems to be controlled by MaxAuthTriesLog in Sun's implementation of SSH. Are there any plans to do the same in OpenSSH? Regards, Richard Dickens -----------------------------------------------------------------
2005 Aug 11
2
Porting to Dynix 4.1.3...
Hello there, I am trying to build OpenSSH on the above (very old) version of unix. With a bit of hacking, I have managed to get everything to compile and run. Scp works fine however when I log in via ssh I end up with a session without a controlling terminal. I have followed the code in pty_make_controlling_tty and it seems to go as follows: 1. Starts with a controlling tty. 2. setsid works,