Displaying 19 results from an estimated 19 matches for "word2".
Did you mean:
word
2012 Mar 23
1
how to cluster rows of words in a text file
Hi:
I am trying to cluster the rows of a text file with kmeans:
I load the data as follows
file1 <- read.csv("somefile.csv")
and the file can be viewed having the following line of words
> file1
1 word1 word3 word4 word1
2 word1 word4 word3 word1
3 word4 word2 word4 word3
4 word4 word2 word1 word3
5 word2 word2 word4 word2
file_as_matrix <- as.matrix(file1);
Now, I want to apply some clustering algorithm such as kmeans to
cluster the rows in the file to get the following output:
Cluster1
word1 word3 word4 word1
word1 word4 word3 word1
Cl...
2009 Nov 03
3
re ading tokens
Greetings,
I am not familiar with processing text in R. Can someone tell me how to
read each line of words as separate elements in a list?
FE, I would like to turn:
word1 word2 word3
word2 word4
into a list of length two with three character elements in the first list
and two elements in the second. I know that this should be easy, but I am a
little confused by the text functions.
Thanks in advance!
--
View this message in context: http://old.nabble.com/reading-tokens...
2011 Sep 26
2
findAssocs()
I am trying to find the math behind the "tm" package findAssocs()
?findAssocs does not say anything besides "association" and "correlate"
Usually entering "findAssocs" at the CLI gives the code for a R
function, but in this case I obtain:
function (x, term, corlimit)
UseMethod("findAssocs", x)
<environment: namespace:tm>
Any ideas?
2011 Aug 25
1
Combining a greek letter and a newline on the same label
Hi,
I would like to add a two-lines label consisting of:
"Delta word1\n
word2"
(with Delta being the actual greek letter).
However, when I try "expression(paste(Delta,"word1\nword2"))", I obtain the
following display:
"word1\n
Delta word2".
Do you have any idea on how to obtain the desired display?
Thank you for your help!
Sebastien
[...
2010 Dec 30
1
recursively count the words occurrence in the text files
I just can't google for it:
I'm searching for a "bash" "one liner" (awk, perl, or anything) for this:
there are text files, in several directories:
mkdir one
mkdir two
mkdir three
echo "word1 word2 word3" > one/asf.txt
echo "word2 word4, word5" > one/asfcxv saf.txt
echo "word1. word2" > one/dsgsdg.txt
echo "word6, word3!" > two/sdgsd dsf.txt
echo "word6" > two/ergd.txt
echo "asdf, word2" > three/werdf.txt
echo "wo...
2005 Nov 08
0
sorting during xtabs? sorting by "individual" order?
...ment-term
matrix -- however, in the same "term-order" to enable
similarity comparisons in a vector space of the
same format.
Let's make a (fake) example:
(1) support function
# directory 1 contains 2 files (F1 & F2):
F1 = c("word4", "word3", "word2")
F2 = c("word1", "word4", "word2")
# directory 2 contains also 2 files (F3 & F4):
F3 = c("word1", "word2", "bla")
F4 = c("word1", "word2", "word3")
# I file in the firs...
2008 Oct 26
1
Transferring results from R to MS Word2
Tom:
Supposse your Excel file is named "myExcelFile"
Open myExcelFile and SaveAs "myExcelFile.csv" or "myExcelFile.txt"
Then go to the RConsole and type:
#Option # 1
mydata <- read.csv("PathTomyExcelFile.csv",header=T)
mydata
# You should be able to see your dataset in R
# Option # 2
# If you saved your Excel file as text then you would do:
mydata
2010 Jul 21
2
Issues reshaping data
...haven't worked
much with the reshape function, I have found the reshape package useful and
intuitive for reshaping data from long to wide format. However, going the
other way has me stumped with this type of data. My data is set up, roughly,
as follows:
SUBJECT TRIAL WORD1 WORD1.RT WORD2 WORD2.RT ...
WORD25.RT
1 1 My 100 friend 200
... ...
1 2 John's 250 dog 320
... ...
1 3 His 120...
2009 Aug 16
1
Can a variable name include '_' or '.' in portable R code
...9;_' and '.' can be used in portable R code.
It very common that some variable name should be composed of two or
more English words. Can somebody let me know what is the widely
accepted practice to concatenate two English words into a variable
name? For example, shall I use "word1_word2" or "Word1Word2" or
"word1.word2"?
Regards,
Peng
2006 May 28
2
Word97 launches but still cannot save or load
...typed,
within this subdirectory: wine WINWORD.EXE. That launched Word97 at last.
But when I try to save a document, all I get is the first letter of the file
name. When I try to save, Word refuses to do so. Please see error messages:
http://www.websher.net/tmp/word1.png
http://www.websher.net/tmp/word2.png
Plus the data files:
http://www.websher.net/tmp/user.reg
http://www.websher.net/tmp/userdef.reg
http://www.websher.net/tmp/system.reg
I would be most grateful for your kind help.
Benjamin
2017 Jul 07
1
How does findAssocs() calculate the correlation value ??
...findAssocs().
I have found that someone had asked the question before, and have a good explanation by Rick.
?]http://r.789695.n4.nabble.com/findAssocs-td3845751.html?^
But I still don't understand how to calculate the correlation value between the two vectors.
For example:
# Correlation word2 with word3
cor(c(0,1,1,1,1),c(0,0,1,1,1))
[1] 0.6123724
How does it caculate cor(c(0,1,1,1,1),c(0,0,1,1,1)) to get 0.6123724 ?
Eva
[[alternative HTML version deleted]]
2007 Jul 07
2
Extending/Modifying QueryParser
...ow to do. The
idea is that I''d like to give the end user the choice on whether to
search for the synonym of a word or not. Preferably by extending the
query language to parse a construct similar to ''%word1'' and then have
the word turned into a or list (i.e., word1|word2|word3|...).
Currently, the query parser constantly calls SynonymTokenFilter to
get synonyms for each token. Is there a way I can go about achieving
this functionality?
Here''s an overview of what I''ve done so far:
My model classes in my rails app use acts_as_ferret with a ca...
2013 Nov 30
4
Full text search improvements
...text, it's pretty useless to index it. Then again, we do want to index all kinds of IDs that someone might want to search. This could be a bit difficult to implement well.
8. Index attachments separately, so it would be possible to search only attachments. (Should "SEARCH BODY word1 BODY word2" return matches if word1 and word2 are in different attachments?)
9. Attachments can be translated to indexable UTF-8 text already with fts_decoder setting by doing it via a conversion script. This could also support Apache Tika server directly.
10. It should be configurable which fields are...
2008 Feb 13
1
can a class have the association has_itself?
is it possible to implement a recursive foreign key association like
that?
i''m writing a vocab app, where i want the vocab table to have an id
and two foreign keys, word1 and word2.
the word table will have an id, a word-string, and a language
i''m trying to do this as follows:
class Vocab < ActiveRecord::Base
belongs_to :word
belongs_to :word
class Word < ActiveRecord::Base
has_many :vocabs
has_many :words, :through => :vocabs
but it doesn'...
2013 Apr 10
2
grup function
Hello,
How can i mattch blanks within words, when i have more phrases?
c("Shangh i", "Hello here i am","h llo")
> gsub(" ","a",c("Shangh i", "Hello here i am","h llo"))
[1] "Shanghai" "Helloahereaiaam" "hallo"
I would like to have [1] "Shanghai" "Hello here
2010 Apr 04
1
How to add a column to dtm showing a part from directory source?
Hello Experts,
I'm new with R and having troubles doing my graduation project.I have 20
subfolders including almost 20000 txt files.What i need to do is to create a
dtm and add a column to it showing a "class" information of the txt files.
My directory source is like "C:\\R\\20news-18828\\comp.graphics" for the
comp.graphic subfolder.I need to take only
2008 Jun 11
5
Similar words
Is there a way to get a list of similar words to the ones a user has
searched for?
So if they search for (in my case) transferaze which has no matches I
can get back an array like this [''transferase''] ?
I know I can just add ~ on the end to make it fuzzy, but what I''d like
is to be able to say "Sorry, no matches for ''transferaze''. Did you
2004 Dec 21
1
Search::Xapian add_database'd search results are odd?
...=======
my $db = Search::Xapian::Database->new($xapiandirbase . '-11' ||
die("Error.\n"));
my $db->add_database(Search::Xapian::Database->new($ARGV[0] . '-10' ||
die("Error.\n")));
my $query = Search::Xapian::Query->new(OP_AND, 'word', 'word2');
print "Query: " . $query . "\n";
my $enq = $db->enquire($query);
my @matches = $enq->matches(0,1000000);
foreach my $match ( @matches ) {
print $match->get_docid . ',';
}
===================================================================
If I don&...
2023 May 19
1
range() for Date and POSIXct could respect `finite = TRUE`
.... but then you did say "not sure" ..
>
> I'm still somewhat favoring this proposal,
> because it would be a bit more generally applicable
> but still very simple.
>
> Personally, I'd go for the shorter allowsInf() name,
> not adding another <word1>.<word2>() generic function,
> but that's less important and should not determine decisions I think.
>
> Martin
>
> >> > allows.infinite.Date <- function(x) {
> >> > TRUE
> >> > }
> >> > allows.infinite.POSIXct...