Displaying 8 results from an estimated 8 matches for "word3".
Did you mean:
word
2012 Mar 23
1
how to cluster rows of words in a text file
Hi:
I am trying to cluster the rows of a text file with kmeans:
I load the data as follows
file1 <- read.csv("somefile.csv")
and the file can be viewed having the following line of words
> file1
1 word1 word3 word4 word1
2 word1 word4 word3 word1
3 word4 word2 word4 word3
4 word4 word2 word1 word3
5 word2 word2 word4 word2
file_as_matrix <- as.matrix(file1);
Now, I want to apply some clustering algorithm such as kmeans to
cluster the rows in the file to get the following output:
Cluster1...
2011 Sep 26
2
findAssocs()
I am trying to find the math behind the "tm" package findAssocs()
?findAssocs does not say anything besides "association" and "correlate"
Usually entering "findAssocs" at the CLI gives the code for a R
function, but in this case I obtain:
function (x, term, corlimit)
UseMethod("findAssocs", x)
<environment: namespace:tm>
Any ideas?
2009 Nov 03
3
re ading tokens
Greetings,
I am not familiar with processing text in R. Can someone tell me how to
read each line of words as separate elements in a list?
FE, I would like to turn:
word1 word2 word3
word2 word4
into a list of length two with three character elements in the first list
and two elements in the second. I know that this should be easy, but I am a
little confused by the text functions.
Thanks in advance!
--
View this message in context: http://old.nabble.com/reading-tokens-tp261...
2010 Dec 30
1
recursively count the words occurrence in the text files
I just can't google for it:
I'm searching for a "bash" "one liner" (awk, perl, or anything) for this:
there are text files, in several directories:
mkdir one
mkdir two
mkdir three
echo "word1 word2 word3" > one/asf.txt
echo "word2 word4, word5" > one/asfcxv saf.txt
echo "word1. word2" > one/dsgsdg.txt
echo "word6, word3!" > two/sdgsd dsf.txt
echo "word6" > two/ergd.txt
echo "asdf, word2" > three/werdf.txt
echo "word7, w...
2005 Nov 08
0
sorting during xtabs? sorting by "individual" order?
...so construct a document-term
matrix -- however, in the same "term-order" to enable
similarity comparisons in a vector space of the
same format.
Let's make a (fake) example:
(1) support function
# directory 1 contains 2 files (F1 & F2):
F1 = c("word4", "word3", "word2")
F2 = c("word1", "word4", "word2")
# directory 2 contains also 2 files (F3 & F4):
F3 = c("word1", "word2", "bla")
F4 = c("word1", "word2", "word3")
#...
2017 Jul 07
1
How does findAssocs() calculate the correlation value ??
...().
I have found that someone had asked the question before, and have a good explanation by Rick.
?]http://r.789695.n4.nabble.com/findAssocs-td3845751.html?^
But I still don't understand how to calculate the correlation value between the two vectors.
For example:
# Correlation word2 with word3
cor(c(0,1,1,1,1),c(0,0,1,1,1))
[1] 0.6123724
How does it caculate cor(c(0,1,1,1,1),c(0,0,1,1,1)) to get 0.6123724 ?
Eva
[[alternative HTML version deleted]]
2007 Jul 07
2
Extending/Modifying QueryParser
...do. The
idea is that I''d like to give the end user the choice on whether to
search for the synonym of a word or not. Preferably by extending the
query language to parse a construct similar to ''%word1'' and then have
the word turned into a or list (i.e., word1|word2|word3|...).
Currently, the query parser constantly calls SynonymTokenFilter to
get synonyms for each token. Is there a way I can go about achieving
this functionality?
Here''s an overview of what I''ve done so far:
My model classes in my rails app use acts_as_ferret with a call tha...
2010 Apr 04
1
How to add a column to dtm showing a part from directory source?
Hello Experts,
I'm new with R and having troubles doing my graduation project.I have 20
subfolders including almost 20000 txt files.What i need to do is to create a
dtm and add a column to it showing a "class" information of the txt files.
My directory source is like "C:\\R\\20news-18828\\comp.graphics" for the
comp.graphic subfolder.I need to take only