thr3ads.net - similar to: "how to subset rows using regular expression patterns"

Displaying 20 results from an estimated 4000 matches similar to: "how to subset rows using regular expression patterns"

An R clause to bind dataframes under certain contions

2008 Jan 26

An R clause to bind dataframes under certain contions

Hi netters, Suppose I have two data frames X and Y. X has three colnames A, B and C. Y has three colnames A,B and D. I want to combine them into one matrix, joining the rows having the same A and B values (X$A==Y$A and X$B = Y$B). So the resulting dataframe has four variables/columns: A,B,C and D. I was wondering what's the best way to do it in R. Could anyone give me some advice? Thanks!

Error: evaluation nested too deeply when doing heatmap with binary distfunction

2007 Jul 19

Error: evaluation nested too deeply when doing heatmap with binary distfunction

Hi netters, I have a matrix X of the size (1000,100). The values are from -3 to +3. When I tried heatmap(X, distfun=function(c),dist(c,method="bin"),hclustfun=function(m),hclust(m,method="average")) I got the error message: Error: evaluation nested too deeply: infinite recursion / options(expressions=)? However, if I used default parameters for distfunction:

locate the rows in a dataframe with some criteria

2008 Mar 07

locate the rows in a dataframe with some criteria

Hi, netters, This is probably a rookie question but I couldn't find the answer after hours of searching and trying. Suppose there'a a dataframe M: x y 10 A 13 B 8 A 11 A I want to locate the rows where x >=10 and y="A". I know how to do it to vectors by using which, but how to do it with the dataframe? Thank you very much! Zhihua Li

Comparison of aggregate in R and group by in mysql

2008 Jan 26

Comparison of aggregate in R and group by in mysql

Hi, netters, First of all, thanks a lot for all the prompt replies to my earlier question about "merging" data frames in R. Actually that's an equivalence to the "join" clause in mysql. Now I have another question. Suppose I have a data frame X with lots of columns/variables: Name, Age,Group, Type, Salary. I wanna do a subtotal of salaries: aggregate(X$Salary,

quotient and remainder

2005 Jun 23

quotient and remainder

hi netters Is there a function in R that can compute the quotient and remainder of a division calculation? such that when 11 is given as the dividend and 5 the divider, the function returns 2(quotient) and 1(remainder). Thanks a lot! _________________________________________________________________ 伱佲伔佈佅伮佋佖 MSN Explorer: http://explorer.msn.com/lccn/

how to change a dataframe with characters to a numeric matrix?

2005 Dec 08

how to change a dataframe with characters to a numeric matrix?

hi netters, i have a dataframe TEST like this: Y1 Y2 Y3 X1 4 7 8 X2 6 2 Z X3 8 0 1 i would like to change it to a numeric matrix, replacing "Z" with NA Y1 Y2 Y3 X1 4 7 8 X2 6 2 NA X3 8 0 1 i've tried the function data.matrix but it didn't work. is there any easy way to do this? thanks a lot!

working with R graphics remotely

2007 Jul 02

working with R graphics remotely

Hi netters, Now I'm connecting from my local windows machine to a remote linux machine and launch R out there using SSH. When I tried to create grahics, like using plot or heatmap, I cannot see the output. Maybe a new R window displaying the graphics has popped out in the remote machine? Or I need to change some settings for the graphics to display? I don't know. I googled it and

manipulating dataframe according to the values of some columns

2005 May 13

manipulating dataframe according to the values of some columns

hi netters, I'm a newbie to R and there are some very simple problems puzzeled me for two days. I've a dataframe here with several columns different in modes. Two of the columns are special for me: column 1 has the mode "factor" and column 2 has the mode "numeric vectors". The values for column 1 are either "T" or "F". I wanna do two things:

how to count "associated" factors?

2005 Jun 21

how to count "associated" factors?

hi netters Suppose I have a factor X, with 10 elements and 3 levels: A B B C A C B A C C . It is easy to count the number of elements for each level: tapply(X,X,length). Now I have another factor Y, which formed a matrix with X: X| A B B C A C B A C C Y| B B C C C A A A B B I wanna count the number of elements for each of these conditions: when X=A and Y=A; when X=A and Y=B; when X=A and

how to use a function in aggregate which accepts matrix and outputs matrix?

2008 Apr 17

how to use a function in aggregate which accepts matrix and outputs matrix?

Dear netters, suppose I have a matrix X [1,] 'c1' 'r6' '150'[2,] 'c1' 'r4' '70'[3,] 'c1' 'r2' '20'[4,] 'c1' 'r5' '90'[5,] 'c2' 'r2' '20'[6,] 'c3' 'r1' '10'I want to apply some funciton to groups of rows by the first column.If the function is just to

memory error with 64-bit R in linux

2007 Jul 18

memory error with 64-bit R in linux

Hi netters, I'm using the 64-bit R-2.5.0 on a x86-64 cpu, with an RAM of 2 GB. The operating system is SUSE 10. The system information is: -uname -a Linux someone 2.6.13-15.15-smp #1 SMP Mon Feb 26 14:11:33 UTC 2007 x86_64 x86_64 x86_64 GNU/Linux I used heatmap to process a matrix of the dim [16000,100]. After 3 hours of desperating waiting, R told me: cannot allocate vector of size

rJava installation under linux: configuration failed

2007 Jun 05

rJava installation under linux: configuration failed

Hi netter, Recently I was trying to install rJava. The operating system is suse 10.0, and the R versionis 2.5.0. Following the instructions of R Wiki for rJava, I did configuration first: R CMD javareconf and then it showed a series of information, from what it seems that java is in the system and the configuration succeeded. Then I tried to install rJava:

store and retrieve object names in a vector

2005 Dec 12

store and retrieve object names in a vector

hi netters, suppose i have a series of objects X1, X2, B1,C1........... they all have the same dimensions. i want to combine into one by using cbind: y<-cbind(X1,X2,B1,C1.....) but i don't want to type the names of these objects one by one. instead, i've put their names into a vector: x<-c("X1","X2","B1","C1",....) i used y<-cbind(x).

how to "singlify" entries

2005 May 30

how to "singlify" entries

hi netters I have a rather simple question. I have a data frame with two variables X and Y, both of which are factors. X has 100 levels while Y has 10 levels only. The data frame has 100 rows in all, so for X the values are unique, and Y has many replicate values. Now I wanna reduce the data frame into 10 rows only, according to the 10 levels of Y. I don't care which value of X is in

how to generate argument from a vector automatically

2005 Jul 12

how to generate argument from a vector automatically

hi netters i have a vector NAMES containing a series of variable names: NAMES=c(x,r,z,m,st,qr,.....nn). i wanna fit a regression tree by using the code: my.tree<-tree(y~x+r+z+m+....nn,my.dataframe) but i don't want to type out "x+r+z+m+....+nn" one by one, as there are so many variables. besides, sometimes i wanna put the code in a function. so i need to have the

learning decision trees with one's own scoring functins

2005 Aug 26

learning decision trees with one's own scoring functins

Hi netters, I want to learn a decision tree from a series of instances (learning data). The packages tree or rpart can do this quite well, but the scoring functions (splitting criteria) are fixed in these packages, like gini or something. However, I'm going to use another scoring function. At first I wanna modify the R code of tree or rpart and put my own scoring function in. But it

sort a data matrix by all the values and keep the names

2008 Sep 22

sort a data matrix by all the values and keep the names

Dear all, If I have a data frame x<-data.frame(x1=c(1,7),x2=c(4,6),x3=c(8,2)): x1 x2 x3 1 4 8 7 6 2 I want to sort the whole data and get this: x1 1 x3 2 x2 4 x2 6 x1 7 x3 8 If I do sort(X), R reports: Error in order(list(x1 = c(1, 7), x2 = c(4, 6), x3 = c(8, 2)), decreasing = FALSE) : unimplemented type 'list' in 'orderVector1' The only way

Find index of a string inside a string?

2010 Oct 25

Find index of a string inside a string?

Hi, I am searching for the equivalent of the function Index from SAS. In SAS: index("abcd", "bcd") will return 2 because bcd is located in the 2nd cell of the abcd string. The equivalent in R should do this: > myIndex <- foo("abcd", "bcd") #return 2. What is the function that I am looking for? I want to use the return value in substr, like I do

learning networks with a large number of variables andpre-set parents.

2005 Mar 26

learning networks with a large number of variables andpre-set parents.

I didn't go into details when I asked the question for feat that I would overly specific and blur my real goals. The links between variables are defined as conditional probability distributions. So if the probability distribution of a variable X's value is conditioned on the probability distribution of the values of Y and Z, we say Y and Z are X's parents, and in the network,

a correlation matrix subset where the subset avg is a maximum

2006 Oct 13

a correlation matrix subset where the subset avg is a maximum

Hello R group, Given a correlation matrix, I would like to obtain the best subset of pairs in the matrix of some size > n such that the mean of r for that subset is a maximum compared to any other possible subset of size > n. I've been looking at the deal and subselect packages but they don't seem to do what I need. Does anyone have any suggestions? Thanks in advance, Ryan

similar to: how to subset rows using regular expression patterns