thr3ads.net - similar to: "Subsetting question"

Displaying 20 results from an estimated 9000 matches similar to: "Subsetting question"

redundant factor levels after subsetting a dataset

2009 Nov 12

redundant factor levels after subsetting a dataset

#I have a data frame with a numeric and a character variable. x=c(1,2,3,2,0,2,-1,-2,-4) md=c(rep("Miller",3), rep("Richard",3),rep("Smith",3)) data1=data.frame(x,md) #I subset this data.frame in a way such that one level of the character variable does not appear in the new dataset. data2=data1[x>0,] data3=subset(data1,x>0) #However, when I check the levels

Subsetting data by eliminating redundant variables

2011 Oct 19

Subsetting data by eliminating redundant variables

Dear All, I am new to R, I have one question which might be easy. I have a large data with more than 250 variable, i am reducing number of variables by redun function as in the example below, n <- 100 x1 <- runif(n) x2 <- runif(n) x3 <- x1 + x2 + runif(n)/10 x4 <- x1 + x2 + x3 + runif(n)/10 x5 <- factor(sample(c('a','b','c'),n,replace=TRUE)) x6 <-

transforming a .csv file column names as per a particular column rows using R code

2012 Oct 14

transforming a .csv file column names as per a particular column rows using R code

Hello all, I have a .csv file like below. Tool,Step_Number,Data1,Data2... etc up to 100 columns. A,1,0,1 A,2,3,1 A,3,2,1 . . B,1,3,2 B,2,1,2 B,3,3,2 . . ...... so on upto 50 rows where the column "*Tool*" has distinct steps in second column "*Step_Number*",but both have same entries in Step_Number column. I want the output like below.

Subsetting does not remove unwanted data in table

2011 Sep 07

Subsetting does not remove unwanted data in table

Dear all, This relatively routine analysis has left me frustrated and in a rut. I have a dataset (data1), which I subset in order to remove rows where HabitatDensity="Med". This dataset looks correct when I call it up, however, when I create a table out of the new subset (data2), my table continues to show the "Med" information as 0. This is a problem because I need a

Replace / swap values of subset of a data.frame

2009 May 19

Replace / swap values of subset of a data.frame

Dear R users, I have 1 data.frame of 1500x80 - data1. I found out that there are a few cells of data that I have misplace, and I need to fix the ordering of them. In an attempt trying to swap column 22 & 23 of the Subject with misplaced data, I did the following: > data2 <- data1 > subset(data1,(Subject==25 & Session==1))[,22] <- subset(data2,(Subject==25 &

combining collumns for data.frames

2010 Sep 06

combining collumns for data.frames

Hi This question is far less simple than the title suggests, please read carefully, thanks. I have 2 sets of data, both read into R >data1<-read.table ("1.txt", header=T, sep="\t") >data2<-read.table ("2.txt", header=T, sep="\t") >data1 Taxon stage1 stage2 stage3 stage4 T1 0 0 1 1 T2 0

using subset

2004 Dec 06

using subset

Hi, I have doubts in using subset command. I have a list of, lets say, 15 species and I want to make a subset with only 2 of them. I??m the command data2 <- subset(data1, species=="sp1"|species=="sp2") Nevertheless, when I ask for the summary (summary(data2)) the others species names still apearing. Also I tried data2<-data1(data1$species %in%

probem on merge data

2009 Nov 06

probem on merge data

Hi there, data1<-matrix(data=c(1,1.2,1.3,"3/23/2004",1,1.5,2.3,"3/22/2004",2,0.2,3.3,"4/23/2004",3,1.5,1.3,"5/22/2004"),nrow=4,ncol=4,byrow=TRUE) data1<-data.frame(data1) names(data1)<-c("areaid","x","y","date") data1 areaid x y date 1 1 1.2 1.3 3/23/2004 2 1 1.5 2.3 3/22/2004 3 2

subset in dataframes

2011 Oct 02

subset in dataframes

I need help in subseting a dataframe: data1<-data.frame(year=c(2001,2002,2003,2004,2001,2002,2003,2004, 2001,2002,2003,2004,2001,2002,2003,2004), firm=c(1,1,1,1,2,2,2,2,3,3,3,3,4,4,4,4),x=c(11,22,-32,25,-26,47,85,98, 101,14,87,56,12,43,67,54), y=c(110,220,302,250,260,470,850,980,1010,140,870,560,120,430,670,540)) data1 I want to keep the firms where all x>0 (where there are

plotting a double y axis when x and y lengths differ

2011 Nov 16

plotting a double y axis when x and y lengths differ

Hello All, Many thanks to the help I have received so far. Here is an example data set I hope to plot Data1 Year Data SE 1 2005 2 0.01 2 2006 4 0.01 3 2007 5 0.01 4 2008 2 0.01 5 2009 3 0.01 6 2010 6 0.01 Data2 Year Data SE 1 2006 32 1 2 2007 100 2 3 2008 60 4 4 2009 67 3 5 2010 8 1 Notice Data2 has one less years worth of data than Data1 (which is my

why the same values cannot be judged to be the same in R

2009 Nov 13

why the same values cannot be judged to be the same in R

Hi Rusers, I found sometimes that the same values cannot be judged to be the same in R. Anybody knows the probelm? I think i ignored some minor detail. Thanks. Here is the example. ############ data1<-matrix(data=c(1,1.2,1.3,"3/23/2004",1,1.5,2.3,"3/22/2004",2,0.2,3.3,"4/23/2004",3,1.5,1.3,"5/22/2004"),nrow=4,ncol=4,byrow=TRUE)

fuse_setlk_cbk error

2008 Nov 04

fuse_setlk_cbk error

I'm building a two node cluster to run vserver systems on. I've setup glusterfs with this config: # node a volume data-posix type storage/posix option directory /export/cluster end-volume volume data1 type features/posix-locks subvolumes data-posix end-volume volume data2 type protocol/client option transport-type tcp/client option remote-host

Passing options as lists

2002 Dec 05

Passing options as lists

Hi, I apologize if this has previously been posted. I've just subscribed to the R-help digest. I'm writing a plotting function that uses layout() to plot several different plots on the same device. This function uses plot(), image(), and a custom function that uses text(). Each cell of the layout needs different par() parameters, so what I'd like to do is pass them as lists:

problem merging two data sets ( one with a header and one without)

2008 Aug 21

problem merging two data sets ( one with a header and one without)

I have two set of data, Data1 and Data2 . Data1 has a header and Data2 does not. I would like to merge the two data sets after removing some columns from data2 . I am having a problem merging so I had to write and read final data and specify the ?header=F? so the merge can be done by?V1?. Is there a way to avoid this step. The problem is when I do cbind the FinalData has different column names

Merge dataframes

2011 Oct 07

Merge dataframes

Hello, I am having some problems to use the 'merge' function. I'm not sure if I got its working right. What I want to do is: 1) Suppose I have a dataframe like: height width 1 1.1 2.3 2 2.1 2.5 3 1.8 1.9 4 1.6 2.1 5 1.8 2.4 2) And I generate a second

Match 2 vectors

2010 May 28

Match 2 vectors

Hi, I have 2 dataframes of unequal length, and I would like to match a factor to them so that both dataframes will have the same number of rows. example: # create the 2 dataframes with unequal length data1 <- data.frame(letters, 1:26)[-c(5,10,19:21),] data2 <- data.frame(letters, 1:26)[-c(6,9,15:18),] data2a <- match(data1[,1], data2[,1]) data2b <- data2[data2a,] When I match

Problem with strptime

2012 May 27

Problem with strptime

Hello Forum, I have a problem with the strptime function. With the ''data1'' dataset below it works fine, but with the ''data2'' dataset something goes wrong (see final line below). Both data1 and data2 are in exactly the same original format, the only difference is that they span different dates. Please help, since it is driving me nuts! Many thanks. Best

Faster way to zero-pad a data frame...?

2006 May 30

Faster way to zero-pad a data frame...?

Hello List, I am working on creating periodograms from IP network traffic logs using the Fast Fourier Transform. The FFT requires all the data points to be evenly-spaced in the time domain (constant delta-T), so I have a step where I zero-pad the data. Lately I've been wondering if there is a faster way to do this. Here's what I've got: * data1 is a data frame consisting of

%in% operator - NOT IN

2011 May 08

%in% operator - NOT IN

Hello everyone, I am attempting to use the %in% operator with the ! to produce a NOT IN type of operation. Why does this not work? Suggestions? > data2[data1$char1 %in% c("string1","string2"),1]<-min(data1$x1) > data2[data1$char1 ! %in% c("string1","string2"),1]<-max(data1$x1)+1000 Error: unexpected '!' in "data2[data1$char1

ks.test "greater" and "less"

2006 Dec 15

ks.test "greater" and "less"

Hello r-group I have a question to the ks.test. I would expect different values for less and greater between data1 and data2. Does anybody could explain my point of misunderstanding the function? data1<-c(8,12,43,70) data2<- c(70,43,12,8) ks.test(data1,"pnorm") ks.test(data1,"pnorm",alternative ="less") #expected < 0.001

similar to: Subsetting question