similar to: How does findAssocs() calculate the correlation value ??

Displaying 20 results from an estimated 100 matches similar to: "How does findAssocs() calculate the correlation value ??"

2011 Sep 26
2
findAssocs()
I am trying to find the math behind the "tm" package findAssocs() ?findAssocs does not say anything besides "association" and "correlate" Usually entering "findAssocs" at the CLI gives the code for a R function, but in this case I obtain: function (x, term, corlimit) UseMethod("findAssocs", x) <environment: namespace:tm> Any ideas?
2012 Mar 23
1
how to cluster rows of words in a text file
Hi: I am trying to cluster the rows of a text file with kmeans: I load the data as follows file1 <- read.csv("somefile.csv") and the file can be viewed having the following line of words > file1 1 word1 word3 word4 word1 2 word1 word4 word3 word1 3 word4 word2 word4 word3 4 word4 word2 word1 word3 5 word2 word2 word4 word2 file_as_matrix <- as.matrix(file1); Now,
2010 Dec 30
1
recursively count the words occurrence in the text files
I just can't google for it: I'm searching for a "bash" "one liner" (awk, perl, or anything) for this: there are text files, in several directories: mkdir one mkdir two mkdir three echo "word1 word2 word3" > one/asf.txt echo "word2 word4, word5" > one/asfcxv saf.txt echo "word1. word2" > one/dsgsdg.txt echo "word6,
2005 Nov 08
0
sorting during xtabs? sorting by "individual" order?
Hey alltogether, refacturing a package (before it will be released), I ran across the following problem. I have two directories with different text files, I want to read the first and construct a document-term matrix from it (every term=word in a row, every file in a column, occurrence frequencies form the values). The second directory contains different files. It needs to be read in to also
2017 Nov 15
2
NEED HELP : Association in single DTM
I have free text data in a single text document. I create a corpus, and then a document term matrix out of it. I can create a word cloud too. But when I do word association for the same, using "findAssocs(), it always returns numeric(0). EX : findAssocs(dtm, "king" ,000000000000000000000.1) I read on stack overflow that it is because I have a single document. What is the
2017 Nov 15
0
NEED HELP : Association in single DTM
If you consider the definition of a DTM, and that findAssoc() computes associations between words as correlations across documents(!), you will realize that you can't what you want from a single document. Indeed, what kind of an "association" would you even be looking for? B. > On Nov 15, 2017, at 12:40 AM, Rahul singh <rahulutube69 at gmail.com> wrote: > > I have
2009 Nov 03
3
re ading tokens
Greetings, I am not familiar with processing text in R. Can someone tell me how to read each line of words as separate elements in a list? FE, I would like to turn: word1 word2 word3 word2 word4 into a list of length two with three character elements in the first list and two elements in the second. I know that this should be easy, but I am a little confused by the text functions. Thanks in
2017 Nov 16
2
NEED HELP : Association in single DTM
Hi Boris, In that case, if I have lot of free text data (let us assume part of an Election speech) in one single TEXT document, and i want to find the association of the top 3 most frequently occurring words with the other words in the speech, what method do I adopt ? On Wed, Nov 15, 2017 at 7:08 PM, Boris Steipe <boris.steipe at utoronto.ca> wrote: > If you consider the definition of
2017 Nov 16
0
NEED HELP : Association in single DTM
In general, statistical methodology queries, which seems to be your concern, are offtopic here.This list is about R programming. Consider stats.stackexchange.com for statistical queries. However, the CRAN task view on natural language processing might be useful, so you may wish to check it: https://cran.r-project.org/web/views/NaturalLanguageProcessing.html Cheers, Bert Bert Gunter
2009 Jan 15
1
How to Solve the Error( error:cannot allocate vector of size 1.1 Gb)
Hi, Gurus Thanks to your good helps, I have managed starting the use of a text mining package so called "tm" in R under the OS of Win XP. However, during running the tm package, I got another mine like memory problem. What is a the best way to solve this memory problem among increasing a physical RAM, or doing other recipes, etc? ############################### ###### my R
2007 Jul 07
2
Extending/Modifying QueryParser
Hi, I''ve implemented synonym searching in my rails application but have an idea I''d like to implement but can''t figure out how to do. The idea is that I''d like to give the end user the choice on whether to search for the synonym of a word or not. Preferably by extending the query language to parse a construct similar to ''%word1'' and
2009 Mar 30
1
Help with tm assocation analysis and Rgraphviz installation.
Help with tm assocation analysis and Rgraphviz installation. THANK YOU IN ADVANCE Question 1: I saved two txt file in C:\textfile And each txt file contents only one text column, and both have 100 records. I know term “research” occurs 49 times, so I want to find out which other words are correlated to this word, and I got tons of association ‘1’ . I tried other terms, and no
2010 Jan 28
2
Constrained vector permutation
Hello, I'm trying to permute a vector of positive integers > 0 with the constraint that each element must be <= twice the element before it (i.e. for some vector x, x[i] <= 2*x[i-1]), assuming the "0th" element is 1. Hence the first element of the vector must always be 1 or 2 (by assuming the "0th" element is 1). Similarly, the 2nd must always be below/= 4, the
2011 Nov 05
1
3-D ellipsoid equations
+ Hello, The parametric equations of an ellipsoid can be written in terms of spherical coordinates. The three spherical coordinates are converted to Cartesian coordinates by X=a cos (α) sin(θ) Y=b sin(α) sin(θ) Z=c cos(θ) for α and θ The parameter α varies from 0 to 2 π and θ varies from 0 to π . Here ( X o , Y o ,Z o ) is the center of the ellipsoid, and θ is the angle
2009 Apr 03
2
Geometric Brownian Motion Process with Jumps
Hi, I have been using maxLik to do some MLE of Geometric Brownian Motion Process and everything has been going fine, but know I have tried to do it with jumps. I have create a vector of jumps and then added this into my log-likelihood equation, know I am getting a message: NA in the initial gradient My codes is hear # n<-length(combinedlr) j<-c(1,2,3,4,5,6,7,8,9,10)
2010 Jun 03
1
Question about avoid the for loop
Dear R-helpers, I would like to generate a binary random variable within a stratum's stratum. Here is a simple example. ## x is the first level strata index, here I have 3 strata. x=c(rep(1,5), rep(2,5), rep(3,5)) ## within x, there is a second strata indexed by t=0 and t=1 t=rep(c(0,0,1,1,1),3) ## and within strata i and t=0 and t=1, I generate the random binomial variable respectively,
2010 Apr 04
1
How to add a column to dtm showing a part from directory source?
Hello Experts, I'm new with R and having troubles doing my graduation project.I have 20 subfolders including almost 20000 txt files.What i need to do is to create a dtm and add a column to it showing a "class" information of the txt files. My directory source is like "C:\\R\\20news-18828\\comp.graphics" for the comp.graphic subfolder.I need to take only
2011 Mar 22
1
help need on working in subset within a dataframe
Dear R-experts Execuse me for an easy question, but I need help, sorry for that. >From days I have been working with a large dataset, where operations are needed within a component of dataset. Here is my question: I have big dataset where x1:.....x1000 or so. What I need to do is to work on 4 consequite variables to calculate a statistics and output. So far so good. There are more vector
2011 Aug 25
1
Combining a greek letter and a newline on the same label
Hi, I would like to add a two-lines label consisting of: "Delta word1\n word2" (with Delta being the actual greek letter). However, when I try "expression(paste(Delta,"word1\nword2"))", I obtain the following display: "word1\n Delta word2". Do you have any idea on how to obtain the desired display? Thank you for your help! Sebastien [[alternative HTML
2008 Oct 26
1
Transferring results from R to MS Word2
Tom: Supposse your Excel file is named "myExcelFile" Open myExcelFile and SaveAs "myExcelFile.csv" or "myExcelFile.txt" Then go to the RConsole and type: #Option # 1 mydata <- read.csv("PathTomyExcelFile.csv",header=T) mydata # You should be able to see your dataset in R # Option # 2 # If you saved your Excel file as text then you would do: mydata