search for: word3

Displaying 8 results from an estimated 8 matches for "word3".

Did you mean: word
2012 Mar 23
1
how to cluster rows of words in a text file
Hi: I am trying to cluster the rows of a text file with kmeans: I load the data as follows file1 <- read.csv("somefile.csv") and the file can be viewed having the following line of words > file1 1 word1 word3 word4 word1 2 word1 word4 word3 word1 3 word4 word2 word4 word3 4 word4 word2 word1 word3 5 word2 word2 word4 word2 file_as_matrix <- as.matrix(file1); Now, I want to apply some clustering algorithm such as kmeans to cluster the rows in the file to get the following output: Cluster1...
2011 Sep 26
2
findAssocs()
I am trying to find the math behind the "tm" package findAssocs() ?findAssocs does not say anything besides "association" and "correlate" Usually entering "findAssocs" at the CLI gives the code for a R function, but in this case I obtain: function (x, term, corlimit) UseMethod("findAssocs", x) <environment: namespace:tm> Any ideas?
2009 Nov 03
3
re ading tokens
Greetings, I am not familiar with processing text in R. Can someone tell me how to read each line of words as separate elements in a list? FE, I would like to turn: word1 word2 word3 word2 word4 into a list of length two with three character elements in the first list and two elements in the second. I know that this should be easy, but I am a little confused by the text functions. Thanks in advance! -- View this message in context: http://old.nabble.com/reading-tokens-tp261...
2010 Dec 30
1
recursively count the words occurrence in the text files
I just can't google for it: I'm searching for a "bash" "one liner" (awk, perl, or anything) for this: there are text files, in several directories: mkdir one mkdir two mkdir three echo "word1 word2 word3" > one/asf.txt echo "word2 word4, word5" > one/asfcxv saf.txt echo "word1. word2" > one/dsgsdg.txt echo "word6, word3!" > two/sdgsd dsf.txt echo "word6" > two/ergd.txt echo "asdf, word2" > three/werdf.txt echo "word7, w...
2005 Nov 08
0
sorting during xtabs? sorting by "individual" order?
...so construct a document-term matrix -- however, in the same "term-order" to enable similarity comparisons in a vector space of the same format. Let's make a (fake) example: (1) support function # directory 1 contains 2 files (F1 & F2): F1 = c("word4", "word3", "word2") F2 = c("word1", "word4", "word2") # directory 2 contains also 2 files (F3 & F4): F3 = c("word1", "word2", "bla") F4 = c("word1", "word2", "word3") #...
2017 Jul 07
1
How does findAssocs() calculate the correlation value ??
...(). I have found that someone had asked the question before, and have a good explanation by Rick. ?]http://r.789695.n4.nabble.com/findAssocs-td3845751.html?^ But I still don't understand how to calculate the correlation value between the two vectors. For example: # Correlation word2 with word3 cor(c(0,1,1,1,1),c(0,0,1,1,1)) [1] 0.6123724 How does it caculate cor(c(0,1,1,1,1),c(0,0,1,1,1)) to get 0.6123724 ? Eva [[alternative HTML version deleted]]
2007 Jul 07
2
Extending/Modifying QueryParser
...do. The idea is that I''d like to give the end user the choice on whether to search for the synonym of a word or not. Preferably by extending the query language to parse a construct similar to ''%word1'' and then have the word turned into a or list (i.e., word1|word2|word3|...). Currently, the query parser constantly calls SynonymTokenFilter to get synonyms for each token. Is there a way I can go about achieving this functionality? Here''s an overview of what I''ve done so far: My model classes in my rails app use acts_as_ferret with a call tha...
2010 Apr 04
1
How to add a column to dtm showing a part from directory source?
Hello Experts, I'm new with R and having troubles doing my graduation project.I have 20 subfolders including almost 20000 txt files.What i need to do is to create a dtm and add a column to it showing a "class" information of the txt files. My directory source is like "C:\\R\\20news-18828\\comp.graphics" for the comp.graphic subfolder.I need to take only