similar to: dataframe subset

Displaying 20 results from an estimated 20000 matches similar to: "dataframe subset"

2003 Apr 15
4
graphics output produces corrupt/empty files
Hi there. as a newbie I figured out the basics of R but when it came to graphic output it was a bad surprise. All but the X11 device do not function properly. Most of them generate an empty file and pdf() produces non valid output (gs and acroread as well). Very annoying, because I have to write a report. I got the recent stable binary rpm (v1.6.2) from CRAN on a SuSE Linux 8.1. Any help? CU
2004 Oct 11
1
Bug#275946: Acknowledgement (newline not recognized when logcheck sends emails)
I upgraded to 1.2.28, same results. Here are the rules I added. ^\w{3} [ :0-9]{11} [._[:alnum:]-]+ perdition\[[0-9]+\]: Connect: ^\w{3} [ :0-9]{11} [._[:alnum:]-]+ pure-ftpd: [^[:space:]]+ \[NOTICE\] ^\w{3} [ :0-9]{11} [._[:alnum:]-]+ pure-ftpd: [^[:space:]]+ \[INFO\] ^\w{3} [ :0-9]{11} [._[:alnum:]-]+ exact\[[0-9]+\]: ^\w{3} [ :0-9]{11} [._[:alnum:]-]+ slapd\[[0-9]+\]: ^\w{3} [ :0-9]{11}
2013 Feb 17
2
How to findout the name of a dataframe
Let'say we have a dataframe mydata with column v1. If mydata$v1 is passed to a function, is there way, then, to extract the name of the dataframe? What I now do is passing the name of the dataframe to the funcion, so passing two parameters. Maybe with mydata$v1 it is not possible, but with mydata['v1'] or mydata[,'v1'] it is? Thanks Frans ------------------- Frans Marcelissen
2007 Apr 16
2
indexing a subset dataframe
Hello I am having problems indexing a subset dataframe, which was created as: > waspsNoGV<-subset(wasps,site!="GV") Fitting a linear model revealed some data points which had high leverage, so I attempted to redo the regression without these data points: >wasps.lm<-lm(r~Nt,data=waspsNoGV[-c(61,69,142),]) which resulted in a "subscript out of bounds" error. I'm
2011 Aug 13
2
How do I subset a dataframe
I have a dataframe zeespan. One of the columns has the name "customer". The data in the customer column is text. I would like to return a subset of the dataframe with all rows that DON'T begin with either "ibm" or "exxon", or "sears" in the customer column. I tried .... subset(zeespan, customer != c("ibm" | "exxon" |
2002 Jul 03
2
operating on a subset of a dataframe
Hi everyone, I've got a dataframe with columns of different types. A certain number of columns in the dataframe hold the results of a series of Likert-type items. I've got a function that will print a simple table of frequencies and I want to apply that function to those columns of the dataframe only. What's the best approach? -Tim -- Tim Wilson | Visit Sibley online: |
2010 Dec 03
2
subset of a dataframe
HI, I have a dataframe like this: name type A t1 B t2 C t1 D t1 E t3 F t2 how can I have a "sub dataframe" based with the column "type" like this: (for type = t1) name type A t1 C t1 D t1 (for type = t2) name type B t2 F t2 Thanks -- View this message
2003 Jan 23
1
subset dataframe based on rows
I want to subset the dataframe based on certain values in a row. for each row in my dataframe if ANY one value of a particular set of columns satisfies cond append a logical value true at the end of the row else append a false at the end of the row in the end I want to be able to subset the whole data based on the appended true or false value. I could literally code like this, but I think
2009 May 26
2
Problem accessing "row number" from subset on a dataframe
I would like to use the "row number" information returned from performing a subset command on a dataframe. For example, I would like to automatically delete some rows from a dataframe if they match a criteria. Here is my example below. data(airquality) names(airquality) subset(airquality, airquality$Month == 6) Now how do I delete the row numbers returned automatically? I
2011 Jun 30
1
Match strings across two differently sized dataframes and copy corresponding row to dataframe
Hello- Sorry, this is a bit of a noob question, but I can't seem to progress it any further. I have two dataframes which contain a series of strings which exactly match. The problem is one has more rows than the other (more cases have been added) and they have been sorted so that they are not in the same order. The smaller dataframe, though, contains in another column which has codes
2012 Apr 25
1
fill a dataframe with zeros where the rows are a smaller subset of a larger dataframe (species by site)
row <- c("a","b","c","d","e","f","g") #rows from larger data frame row.1 <- c("a","b","c","g") #rows of smaller data frame because d, e, and f don't contain any of the species, but the zeros are important x <- data.frame(sp1=rnorm(4), sp2=rnorm(4), sp3=rnorm(4),
2006 Feb 24
2
Sorting a dataframe by one column?
Given the following dataframe: A=1:10 B=(a-5)^2 DATAFRAME=data.frame(A,B) How can I sort DATAFRAME increasing (or decreasing, for that matter) by B without making reference to A, or any other column? I've read ?order and ?sort but cannot seem to figure this out. Thank you. Mark
2006 Apr 29
1
splitting and saving a large dataframe
Hi, I searched for this in the mailing list, but found no results. I have a large dataframe ( dim(mydata)= 1297059 16, object.size(mydata= 145280576) ) , and I want to perform some calculations which can be done by a factor's levels, say, mydata$myfactor. So what I want is to split this dataframe into nlevels(mydata$myfactor) = 80 levels. But I must do this efficiently, that is, I
2011 Mar 22
1
help need on working in subset within a dataframe
Dear R-experts Execuse me for an easy question, but I need help, sorry for that. >From days I have been working with a large dataset, where operations are needed within a component of dataset. Here is my question: I have big dataset where x1:.....x1000 or so. What I need to do is to work on 4 consequite variables to calculate a statistics and output. So far so good. There are more vector
2009 May 04
1
Please help me subset this dataframe, thanks...
Dear R-helpers, I have a dataframe called "trackpoints" with several columns including a column called "time", eg: > trackpoints ?? ? ? ?time 1 ? 12:00:00 2 ? 12:00:01 3 ? 12:00:02 . . . 298 12:04:57 299 12:04:58 300 12:04:59 I also have a dataframe called "data" that contains columns called "ID", "start" and "stop", eg: > data
2009 Sep 14
3
Eliminate cases in a subset of a dataframe
Hi folks, I created a subset of a dataframe (i.e., selected only men): subdata <- subset(data,data$gender==1) After a residual diagnostic of a regression analysis, I detected three outliers: linmod <- lm(y ~ x, data=subdata) plot(linmod) Say, the cases 11,22, and 33 were outliers. Here comes the problem: When I want to exclude these three cases in a further regression analysis, - for
2002 Jan 30
2
dataframe subset using levels as criteria
Hi I want to select a subset of a dataframe (frota) using the levels of a factor (matricula) in another dataframe (logbooks.eda). The command and output are bellow and it's not working ... > frota[frota$matricula==levels(logbooks.eda$matricula),] matricula compff tab potmaq 1 A-1420-C NA NA NA 2 A-1676-C NA 12070 36800 3 A-1954-C 3214 17565 44100 4 A-1961-C
2009 Apr 14
2
subset dataframe by rows using character vector?
Dear List, I'm stuck on what seems like a simple indexing problem, I'd be very grateful to anyone willing to help me out. I queried a dataframe which returns a character vector called "plot". I have another dataframe from which I want to subset or select only those rows that match "plot". I've tried subset, and also the "which" command. plot
2009 May 24
1
subset dataframe by number of rows of equal values
Hi R helpers! I have the following dataframe ?choose? choose<-data.frame(firm=c(1,1,2,2,2,2,3,3,4,4,4,4,4,4), year=c(2000,2001,2000,2001,2002,2003,2000,2003,2001,2002,2003,2004,2005,2006),code=c(10,10,11,11,11,11,12,12,13,13,13,13,13,13)) choose I want to subset it to obtain another one with those observations for which there more than 2 observations in the column ?code?. So I want a
2003 Apr 07
5
How to sort a dataframe?
Dear R-helpers, for the purpose of plotting a dataframe, i am trying to sort a dataframe by one column, for example tester <- data.frame(one=c(3,2,1), two=c(2,3,1)) #> tester # one two #1 3 2 #2 2 3 #3 1 1 # I want to sort "tester" by column "one", so that i get a dataframe # that looks like: #one two #1 1 #2 3 #3 2 I know of 'sort' but