thr3ads.net - similar to: "subsetting a dataframe"

Displaying 20 results from an estimated 4000 matches similar to: "subsetting a dataframe"

Maintaining factors when copying from one data frame to another

2005 Dec 07

Maintaining factors when copying from one data frame to another

Greetings all: OK, this is bugging the @#@%* out of me. I know the answer is simple and straightforward but for the life of me I cannot find it in the documentation, in the archives, or in my notes (because I know I've encountered this in the past). My problem is: I have a data frame with columns A, B, C, D, and E. A, B, and E are factors and C and D are numeric. I need a new data frame with

insert insertRow?

2006 Jul 21

insert insertRow?

Dear all, In the search for a command to insert a row between other rows in a data frame I found that there seems to be no such command in the base R package. There is however a very simple function insertRow in the micEcon package, that makes use of rbind. I wondered if it would not be possible to include the following micEcon functions in the base package: insertRow insertCol Since the

insert insertRow?

2006 Jul 21

insert insertRow?

Help with parsing a data file

2008 Mar 06

Help with parsing a data file

Hi All, I need to parse data from a file, example shown below. The first two lines can be skipped, the third line contains the column names. The next 13 lines can be skipped. The next line "1991" is a year value, with the following 13 values data for that year. The file then repeats this format with (year, 13 lines of data for that year). I would ideally like to end up with an

eliminating a large subset of data from a frame

2003 Sep 05

eliminating a large subset of data from a frame

I have a data frame with 155,000 rows. One of the columns represents the user id (of which about 10,000 are unique). I am able to isolate 1000 of these user ids (stored in a list) that I want to eliminate from the data set, but I don't know of an efficient way to do this. Certainly this would be slow: newdf<-df for(i in listofbadusers) { newdf<-subset(tmp,uid!=i) } is there a better

extract date time from a text file

2010 Jun 18

extract date time from a text file

I a have a text file where every line is like that: "2007-12-03 13:50:17 Juan Perez" ("yy-mm-dd hh:mm:ss First Name Second Name") I would like to make a data frame with two column one for date and the other one for name. When I use read.delim it was transformed in a data frame with 4 colums. Bye, Sebasti?n.

Recreate new dataframe based on condition

2006 Jul 14

Recreate new dataframe based on condition

Hi, How can I achieve this in R. Dataset is as follows: >df x 1 2 2 4 3 1 4 3 5 3 6 2 structure(list(x = c(2, 4, 1, 3, 3, 2)), .Names = "x", row.names = c("1", "2", "3", "4", "5", "6"), class = "data.frame") I want to recreate a new data frame whose rows are sum of (1&2, 3&4, 5&6)

Normalizing grouped data in a data frame

2007 Nov 09

Normalizing grouped data in a data frame

Hi I am a newbie to R but have tried a number of ways in R to do this and can't find a good solution. (I could do it out of R in perl or awk but would like to know how to do this in R). I have a large data frame 49 variables and 7000 observations however for simplicity I can express it in the following data frame Base, Image, LVEF, ES_Time A, 1, 4.32, 0.89 A, 2, 4.98, 0.67 A, 3, 3.7, 0.5

Unexpected behavior in factor level ordering

2012 Feb 25

Unexpected behavior in factor level ordering

Hello, Everybody: This may not be a "bug", but for me it is an unexpected outcome. A factor variable's levels do not retain their ordering after the levels function is used. I supply an example in which a factor with values "BC" "AD" (in that order) is unintentionally re-alphabetized by the levels function. To me, this is very bad behavior. Would you agree? #

One step way to create data frame with variable "variable names"?

2011 Nov 11

One step way to create data frame with variable "variable names"?

Suppose plotx <- "someName" modx <- "otherName" plotxRange <- c(10,20) modxVals <- c(1,2,3) It often happens I want to create a dataframe or object with plotx or modx as the variable names. But can't understand syntax to do that. I can get this done in 2 steps, creating the data frame and then assigning names, as in newdf <- data.frame( c(1, 2, 3, 4),

Error in names(x) <- value: 'names' attribute must be the same length as the vector

2012 May 14

Error in names(x) <- value: 'names' attribute must be the same length as the vector

Dear R-helpers, I am stuck on an error in R: When I run my code (below), I get this error back: Error in names(x) <- value : 'names' attribute must be the same length as the vector Then when I use traceback(), R gives me back this in return: `colnames<-`(`*tmp*`, value = c(""Item", "Color" ,"Number", "Size")) I'm not exactly

predictOMatic for regression. Please try and advise me

2012 Apr 20

predictOMatic for regression. Please try and advise me

I'm pasting below a working R file featuring a function I'd like to polish up. I'm teaching regression this semester and every time I come to something that is very difficult to explain in class, I try to simplify it by writing an R function (eventually into my package "rockchalk"). Students have a difficult time with predict and newdata objects, so right now I'm

still problems with predict!

2004 Jul 16

still problems with predict!

Hi all, I still have problems with the predict function by setting up the values on which I want to predict ie: original df: p1 (193 obs) variates y x1 x2 rm(list=ls()) x1<-rnorm(193) x2<-runif(193,-5,5) y<-rnorm(193)+x1+x2 p1<-as.data.frame(cbind(y,x1,x2)) p1 y x1 x2 1 -0.6056448 -0.1113607 -0.5859728 2 -4.2841793 -1.0432688 -3.3116807 ...... 192

Selecting cases from matrices stored in lists

2011 Aug 22

Selecting cases from matrices stored in lists

Hi, I have two lists (c and h - see below) containing matrices with similar cases but different values. I want to split these matrices into multiple matrices based on the values in h. So, I did the following: years<-c(1997:1999) for (t in 1:length(years)) { year=as.character(years[t]) h[[year]]<-sapply(colnames(h[[year]]), function(var)

predict.glm returns different results for the same model

2018 Apr 27

predict.glm returns different results for the same model

Hi all, Very surprising (to me!) and mystifying result from predict.glm(): the predictions vary depending on whether or not I use ns() or splines::ns(). Reprex follows: library(splines) set.seed(12345) dat <- data.frame(claim = rbinom(1000, 1, 0.5)) mns <- c(3.4, 3.6) sds <- c(0.24, 0.35) dat$wind <- exp(rnorm(nrow(dat), mean = mns[dat$claim + 1], sd = sds[dat$claim + 1])) dat <-

Can't understand syntax

2012 Jul 14

Can't understand syntax

OK, I need help!! I've been searching, but I don't understand the logic of some this dataframe addressing syntax. What is this type of code called? test [["v3"]] [is.na(test[["v2"]])] <-10 #choose column v3 where column v2 is == 4 and replace with 10 and where is it documented? The code below works for what I want to do (find the non-missing value in a row),

Arranging column data to create plots

2017 Jul 16

Arranging column data to create plots

On Sat, 15 Jul 2017, Michael Reed via R-help wrote: > Dear All, > > I need some help arranging data that was imported. It would be helpful if you were to use dput to give us the sample data since you say you have already imported it. > The imported data frame looks something like this (the actual file is > huge, so this is example data) > > DF: > IDKey X1 Y1 X2 Y2

Problem with predict and lines in plotting binomial glm

2011 Sep 21

Problem with predict and lines in plotting binomial glm

Problems with predict and lines in plotting binomial glm Dear R-helpers I have found quite a lot of tips on how to work with glm through this mailing list, but still have a problem that I can't solve. I have got a data set of which the x-variable is count data and the y-variable is proportional data, and I want to know what the relationship between the variables are. The data was

Arranging column data to create plots

2017 Jul 16

Arranging column data to create plots

Dear All, I need some help arranging data that was imported. The imported data frame looks something like this (the actual file is huge, so this is example data) DF: IDKey X1 Y1 X2 Y2 X3 Y3 X4 Y4 Name1 21 15 25 10 Name2 15 18 35 24 27 45 Name3 17 21 30 22 15 40 32 55 I would like to create a new data frame with the following NewDF: IDKey X Y Name1 21 15 Name1

Transforming 4x3 data frame into 2 column df in R

2013 Feb 01

Transforming 4x3 data frame into 2 column df in R

I have the following data frame: > foo w x y z n 1.51550092 1.4337572 1.2791624 1.1771230 q 0.09977303 0.8173761 1.6123402 0.1510737 r 1.17083866 1.2469347 0.8712135 0.8488029 What I want to do is to change it into : > newdf 1 n w 1.51550092 2 q w 0.09977303 3 r w 1.17083866 4 n x 1.43375725 5 q x 0.81737606 6 r x

similar to: subsetting a dataframe