similar to: About R and text mining

Displaying 20 results from an estimated 3000 matches similar to: "About R and text mining"

2011 May 25
0
text mining - text comparing
Hi all, I'll try to explain what i would like to achieve. I have two problmes that i would need help on if someone has a clue. 1.) I have a TXT file containing two fields : USCS and Description. For each field of USCS I have a field Descrition that contained a lot of words that describe that particular USCS type. What i would like to do is tomine the text using tm
2013 Nov 21
2
RStudio and R.app "segmentation fault" errors
R-helpers, I'm using system() to run a shell script that uses a library written in C++ to analyze natural language (FreeLing: http://nlp.lsi.upc.edu/freeling). When I run the following code in RStudio (0.97.248) and R.app (1.62) on Max OSX (10.7.5): > cmd <- "analyze -f /usr/local/share/freeling/config/es.cfg --lang es --outf tagged </Users/earlbrown/temp_input.txt" >
2009 Oct 02
1
text mining
The following code is derived from a paper titled "Text Mining Infrastructure in R" (http://www.jstatsoft.org/v25/i05/paper). The example below seems to load some default documents for analysis, some sort of latin document. I cannot for the life of me figure out to load my own document let alone an entire corpus. I have searched the above documenet as well as related documentation.
2009 Jun 15
1
non-english text mining with tm package
Hello, I'm having issues working with the tm package on non-english languages. are there any extensions that enable the package to work with Hebrew (and other non-roman letter languages for that matter) ? (e.g. although I can construct a Corpus that shows the hebrew documents alright, I cannot create a term-document matrix (it does not identify any of the hebrew words, so it thinks there
2017 Sep 03
2
readLines() segfaults on large file & question on how to work around
Jeroen: Thank you for pointing me to ndjson, which I had not heard of and is exactly my case. My experience: jsonlite::stream_in - segfaults ndjson::stream_in - my fault, I am running Ubuntu 14.04 and it is too old so it won't compile the package corpus::read_ndjson - works!!! Of course it does a different simplification than jsonlite::fromJSON, so I have to change some code, but
2006 Jan 16
2
automon - one touch record
Actually the docs for the Queue application say: 'w' -- allow the called user to write the conversation to disk via Monitor 'W' -- allow the calling user to write the conversation to disk via Monitor couldn't get these to work tho. Does this mean I can do one touch recording with agents, or does it mean I can use the monitor() command? Very confusing... Doug.
2012 Feb 01
0
Startup Chile Company Looking For Founding Developer/CTO
Hi everyone, My name is Jennifer Turliuk. I''m currently in Santiago, Chile for the next 6 months as part of the Startup Chile program. I think you may be able to help me out. We are looking to bring on a developer ASAP (see description below). We''d love to hear from you. Or, if you know of anyone that may be interested, we''d be very grateful if you would pass this
2017 Sep 04
0
readLines() segfaults on large file & question on how to work around
Although the problem can apparently be avoided in this case. readLines causing a segfault still seems unwanted behaviour to me. I can replicate this with the example below (sessionInfo is further down): # Generate an example file l <- paste0(sample(c(letters, LETTERS), 1E6, replace = TRUE), collapse="") con <- file("test.txt", "wt") for (i in
2002 Oct 07
2
Samba file server and Active Directories
i am trying to setup our hp unix server with samba 2.2.5 to authenicate via our ad user directory. Our NT structure is setup as follows: Active directories in mixed mode - 6 domain controllers one BDC here is a copy of my samba.conf # Samba config file created using SWAT # from ws00478.rbinc.com (10.27.52.184) # Date: 2002/10/04 08:38:08 # Global parameters [global] client code page = 437
2002 Oct 30
2
Groups and Samba and Winbind
I am having an issue with winbind/samba and my groups. when I type the following: $ groups ntdomain\\jfountain I get the following nothing. When i type this (logged in as jfountain) $ groups I get the following: $ 50000 I am assuming this means domain admins. when i type $./wbinfo -r jfountain I get the following: 50016 50017 50018 50019 50020 50021 50010 50022 50023 50000 I
2002 Dec 10
2
Installed New version of Samba for HPUX 2.2.7 - Winbind doesn't w ork
Errors: could not check secret 'ping' to winbindd failed winbind is in my nsswitch any ideas? Thanks ------------------------------------------ Thank you Jenn Fountain 215.712.5156
2009 Apr 17
0
question about the Text Mining package tm
Hello. I am trying to work with the text mining package tm. I have a directory called textsTweet1 which contains three files short.txt myTextFile.txt myTextFile.csv short.txt contains one line: THE CAT IN THE HAT\n myTextFile contains some tweets from Twitter. The first few lines of myTextFile.txt are: @oliviamunn I miss a good Yakaniku...I miss Japan...I NEED COCO EVERYBODY. I NEED TO GET ON
2011 Feb 10
2
Help using "tm" text mining package - preprocessing
Thanks all for your help. I fear text mining is an abstract little corner of "R". I have imported 3228 text (.txt) files, each a news story, into R using [tm]: textd <- Corpus(DirSource("other/docs"), readerControl = list(reader =readPlain)) I can pre-process each individual document using tolower(textd[[1]]) however, when I try to run tmTolower() I get a no such command
2011 May 18
0
text mining problem using TM package
Hi, I’m using R (TM package) for text mining and I’m having problems filtering articles out of my data set by local meta data. Here is the code: *data <- ("C:/… /19970331")* * * * * *rs <- ReutersSource(data , encoding = "UTF-8")* *RC <- VCorpus(DirSource(data), readerControl = list(reader = readRCV1asPlain,* * language = "en_US",* * load =
2008 May 30
1
existing package (mmlcr) modification -- appropriate process?
All: I am new to R and would like your help in identifying the appropriate process to follow in order to modify the output from an existing package. I've had difficulty finding an answer online, perhaps because I am using incorrect terminology. A package that I am using (mmlcr) invokes another package (multinom). An output of multinom is the standard errors, but this output is not
2005 May 12
1
Queue/Agent recording and configuration
Good Morning, Does anyone know a way around my problem? The call is from a queue. I need to know how to play a message to the customer (terms & conditions) keep the agent with the call while a message is played and record only a small portion of the call (the callers acceptance of the terms and conditions). Kind regards Jenn Hales -------------- next part -------------- An
2004 May 24
0
Errors authenticating on schedualed tasks
I run a Samba 3.0.2a domain controller under Solaris X86, with a Windows 2003 server computer joined to the domain. I am receiving a problem when I try and schedual a task (using Task Schedular), and using the "Run As" feature. I attempt to put the user DOMAIN\user for "Run As". Windows reports the error "0x80070005: Access is denied" and when I check
2010 Sep 29
0
Transforming/appending data (words in IMDB)
Hi everyone, I am doing an analysis of reviews in IMDB and am running into trouble getting my data into the right shape for analysis. Key question: I want to know for each word in the IMDB, whether it is over- or under-represented in a particular category (Rating x Genre). I was figuring on estimating this with a g-test, fwiw. But the basic question I'm asking here is about data
2002 Oct 08
4
problem with samba server
here is my config file - # Samba config file created using SWAT # from ws09573.rb.net (10.27.52.177) # Date: 2002/10/08 10:25:55 # Global parameters [global] client code page = 437 workgroup = domain1 netbios name = server1 server string = ares interfaces = lan4 127.0.0.1 bind interfaces only = Yes security = DOMAIN encrypt passwords = Yes update encrypted = Yes allow trusted domains
2008 Oct 08
1
Suspicious output from lme4-mcmcsamp
Hello, R community, I have been using the lmer and mcmcsamp functions in R with some difficulty. I do not believe this is my code or data, however, because my attempts to use the sample code and 'sleepstudy' data provided with the lme4 packaged (and used on several R-Wiki pages) do not return the same results as those indicated in the help pages. For instance: > sessionInfo() R