Displaying 20 results from an estimated 4000 matches similar to: "how to subset rows using regular expression patterns"
2008 Jan 26
3
An R clause to bind dataframes under certain contions
Hi netters,
Suppose I have two data frames X and Y. X has three colnames A, B and C. Y has three colnames A,B and D.
I want to combine them into one matrix, joining the rows having the same A and B values (X$A==Y$A and X$B = Y$B). So the resulting dataframe has four variables/columns: A,B,C and D.
I was wondering what's the best way to do it in R. Could anyone give me some advice?
Thanks!
2007 Jul 19
3
Error: evaluation nested too deeply when doing heatmap with binary distfunction
Hi netters,
I have a matrix X of the size (1000,100). The values are from -3 to +3.
When I tried
heatmap(X,
distfun=function(c),dist(c,method="bin"),hclustfun=function(m),hclust(m,method="average"))
I got the error message:
Error: evaluation nested too deeply: infinite recursion /
options(expressions=)?
However, if I used default parameters for distfunction:
2008 Mar 07
4
locate the rows in a dataframe with some criteria
Hi, netters,
This is probably a rookie question but I couldn't find the answer after hours of searching and trying.
Suppose there'a a dataframe M:
x y
10 A
13 B
8 A
11 A
I want to locate the rows where x >=10 and y="A". I know how to do it to vectors by using
which, but how to do it with the dataframe?
Thank you very much!
Zhihua Li
2008 Jan 26
3
Comparison of aggregate in R and group by in mysql
Hi, netters,
First of all, thanks a lot for all the prompt replies to my earlier question about "merging" data frames in R.
Actually that's an equivalence to the "join" clause in mysql.
Now I have another question. Suppose I have a data frame X with lots of columns/variables:
Name, Age,Group, Type, Salary.
I wanna do a subtotal of salaries:
aggregate(X$Salary,
2005 Jun 23
2
quotient and remainder
hi netters
Is there a function in R that can compute the quotient and remainder of a
division calculation? such that when 11 is given as the dividend and 5
the divider, the function returns 2(quotient) and 1(remainder).
Thanks a lot!
_________________________________________________________________
伱佲伔佈佅伮佋佖 MSN Explorer: http://explorer.msn.com/lccn/
2005 Dec 08
2
how to change a dataframe with characters to a numeric matrix?
hi netters,
i have a dataframe TEST like this:
Y1 Y2 Y3
X1 4 7 8
X2 6 2 Z
X3 8 0 1
i would like to change it to a numeric matrix, replacing "Z" with NA
Y1 Y2 Y3
X1 4 7 8
X2 6 2 NA
X3 8 0 1
i've tried the function data.matrix but it didn't work. is there any easy
way to do this?
thanks a lot!
2007 Jul 02
2
working with R graphics remotely
Hi netters,
Now I'm connecting from my local windows machine to a remote linux machine
and launch R out there using SSH. When I tried to create grahics, like
using plot or heatmap, I cannot see the output. Maybe a new R window
displaying the graphics has popped out in the remote machine? Or I need to
change some settings for the graphics to display? I don't know. I googled
it and
2005 May 13
1
manipulating dataframe according to the values of some columns
hi netters,
I'm a newbie to R and there are some very simple problems puzzeled me for
two days.
I've a dataframe here with several columns different in modes. Two of the
columns are special for me: column 1 has the mode "factor" and column 2 has
the mode "numeric vectors".
The values for column 1 are either "T" or "F". I wanna do two things:
2005 Jun 21
2
how to count "associated" factors?
hi netters
Suppose I have a factor X, with 10 elements and 3 levels: A B B C A C B A C
C .
It is easy to count the number of elements for each level:
tapply(X,X,length).
Now I have another factor Y, which formed a matrix with X:
X| A B B C A C B A C C
Y| B B C C C A A A B B
I wanna count the number of elements for each of these conditions: when X=A
and Y=A; when X=A and Y=B; when X=A and
2008 Apr 17
1
how to use a function in aggregate which accepts matrix and outputs matrix?
Dear netters, suppose I have a matrix X [1,] 'c1' 'r6' '150'[2,] 'c1' 'r4' '70'[3,] 'c1' 'r2' '20'[4,] 'c1' 'r5' '90'[5,] 'c2' 'r2' '20'[6,] 'c3' 'r1' '10'I want to apply some funciton to groups of rows by the first column.If the function is just to
2007 Jul 18
2
memory error with 64-bit R in linux
Hi netters,
I'm using the 64-bit R-2.5.0 on a x86-64 cpu, with an RAM of 2 GB. The
operating system is SUSE 10.
The system information is:
-uname -a
Linux someone 2.6.13-15.15-smp #1 SMP Mon Feb 26 14:11:33 UTC 2007 x86_64
x86_64 x86_64 GNU/Linux
I used heatmap to process a matrix of the dim [16000,100]. After 3 hours
of desperating waiting, R told me:
cannot allocate vector of size
2007 Jun 05
1
rJava installation under linux: configuration failed
Hi netter,
Recently I was trying to install rJava. The operating system is suse 10.0,
and the R versionis 2.5.0.
Following the instructions of R Wiki for rJava, I did configuration first:
R CMD javareconf
and then it showed a series of information, from what it seems that java is
in the system and the configuration succeeded.
Then I tried to install rJava:
2005 Dec 12
2
store and retrieve object names in a vector
hi netters,
suppose i have a series of objects X1, X2, B1,C1........... they all have
the same dimensions. i want to combine into one by using cbind:
y<-cbind(X1,X2,B1,C1.....)
but i don't want to type the names of these objects one by one. instead,
i've put their names into a vector: x<-c("X1","X2","B1","C1",....)
i used y<-cbind(x).
2005 May 30
3
how to "singlify" entries
hi netters
I have a rather simple question. I have a data frame with two variables X
and Y, both of which are factors. X has 100 levels while Y has 10 levels
only. The data frame has 100 rows in all, so for X the values are unique,
and Y has many replicate values. Now I wanna reduce the data frame into 10
rows only, according to the 10 levels of Y. I don't care which value of X
is in
2005 Jul 12
2
how to generate argument from a vector automatically
hi netters
i have a vector NAMES containing a series of variable names:
NAMES=c(x,r,z,m,st,qr,.....nn).
i wanna fit a regression tree by using the code:
my.tree<-tree(y~x+r+z+m+....nn,my.dataframe)
but i don't want to type out "x+r+z+m+....+nn" one by one, as there are so
many variables. besides, sometimes i wanna put the code in a function. so i
need to have the
2005 Aug 26
2
learning decision trees with one's own scoring functins
Hi netters,
I want to learn a decision tree from a series of instances (learning data).
The packages
tree or rpart can do this quite well, but the scoring functions (splitting
criteria) are
fixed in these packages, like gini or something. However, I'm going to use
another scoring
function.
At first I wanna modify the R code of tree or rpart and put my own scoring
function in. But it
2008 Sep 22
4
sort a data matrix by all the values and keep the names
Dear all,
If I have a data frame x<-data.frame(x1=c(1,7),x2=c(4,6),x3=c(8,2)):
x1 x2 x3
1 4 8
7 6 2
I want to sort the whole data and get this:
x1 1
x3 2
x2 4
x2 6
x1 7
x3 8
If I do sort(X), R reports:
Error in order(list(x1 = c(1, 7), x2 = c(4, 6), x3 = c(8, 2)), decreasing = FALSE) :
unimplemented type 'list' in 'orderVector1'
The only way
2010 Oct 25
2
Find index of a string inside a string?
Hi,
I am searching for the equivalent of the function Index from SAS.
In SAS: index("abcd", "bcd") will return 2 because bcd is located in the 2nd cell of the abcd string.
The equivalent in R should do this:
> myIndex <- foo("abcd", "bcd") #return 2.
What is the function that I am looking for?
I want to use the return value in substr, like I do
2005 Mar 26
0
learning networks with a large number of variables andpre-set parents.
I didn't go into details when I asked the question for feat that I would
overly specific and blur my real goals.
The links between variables are defined as conditional probability
distributions. So if the probability distribution of a variable X's value
is conditioned on the probability distribution of the values of Y and Z, we
say Y and Z are X's parents, and in the network,
2006 Oct 13
4
a correlation matrix subset where the subset avg is a maximum
Hello R group,
Given a correlation matrix, I would like to obtain the best subset of
pairs in the matrix of some size > n such that the mean of r for that
subset is a maximum compared to any other possible subset of size > n.
I've been looking at the deal and subselect packages but they don't seem
to do what I need. Does anyone have any suggestions?
Thanks in advance,
Ryan