similar to: Merging data frames of different length

Displaying 20 results from an estimated 60000 matches similar to: "Merging data frames of different length"

2009 Apr 23
4
rbind data frames stored in a list
Hello everyone! I have a list X with 3 elements, each of which is a data frame, for example: a<-data.frame(a=1,b=2,c=3) b<-data.frame(a=c(4,7),b=c(5,8),c=c(6,9)) c<-data.frame(a=c(10,13,16),b=c(11,14,17),c=c(12,15,18)) X<-list() X[[1]]<-a X[[2]]<-b X[[3]]<-c (X) How can I most effectively transform X into a data frame with columns a, b, and c? I would love to find a generic
2009 Mar 26
2
Analogy for %in% for the whole columns (rather than individual values)
Hello! I have a matrix a with 2 variables (see below) that contain character strings. I need to create a 3rd variable that contains True if the value in column x is equal to the value in column y. The code below does it. a<-data.frame(x=c("john", "mary", "mary", "john"),y=c("mary","mary","john","john"))
2008 Sep 08
7
Question about multiple regression
Dear R-list, maybe some of you could point me in the right direction: Are you aware of any FREE Fortran or Java libraries/actual pieces of code that are VERY efficient (time-wise) in running the regular linear least-squares multiple regression? More specifically, I have to run small regression models (between 1 and 15 predictors) on samples of up to N=700 but thousands and thousands of them. I
2009 Feb 16
2
Printing out a graph using different graphics devices
Hello, everyone! The code below allows me to produce the graph I want (I know - the colors are strange, but it's just for the sake of an example). After you run the plot<- part and then do print(plot) - that's what I want. However, when I run the bits of code below (with graphics devices) - what they print is different from the original plot. In .png, .emf, and .tiff - my dots change
2009 Feb 11
2
changing settings on a barchart (lattice)
Hello! I apologize - I never used lattice before, so my question is probably very basic - but I just can't find the answer in the archive nor in the documentation: I have a named numeric vector p of 6 numbers (of the type 6 numbers with people's names to whom those numbers belong). I want a simple bar chart. I am doing: library(lattice) trellis.par.set(fontsize=list(text=12)) #
2009 Feb 13
2
tapply bug? - levels of a factor in a data frame after tapply are intermixed
Hello! I have encountered a really weird problem. Maybe you've encountered it before? I have a large data frame "importances". It has one factor ($A) with 3 levels: 3, 9, and 15. $B is a regular numeric variable. Below I am picking a really small sub-frame (just 3 rows) based on "indices". "indices" were chosen so that all 3 levels of A are present:
2009 Feb 18
1
interaction.plot - gridlines and formatting legend title...
Thank you for providing advice on this graphics question. I am building an interaction.plot. d=data.frame(xx=c(3,3,2,2,1,1),yy=c(4,3,4,3,4,3),zz=c(5.1,4.2,4.4,3.5,3.3,-1.1,-1.3) d[[1]]<-as.factor(d[[1]]) d[[2]]<-as.factor(d[[2]]) print(d) interaction.plot(d$xx, d$yy, d$zz, type="b", col=c("red","blue"), legend=F, lty=c(1,2), lwd=2, pch=c(18,24),
2009 Apr 28
4
Producing customized tickmarks when producing a graph using "curve"
Hello! I am using function "curve" to create a line graph. I was wondering, if it's possible to "turn off" the default tick marks and introduce those tick marks in specific locations. For example, currently in my X axis tick marks are (automatically) at 10, 11, 12, 13 but I want them to be in 5 specific locations, like 9.89, 10.34, etc. Any hint would be greatly
2009 Mar 04
2
adding value labels on Interaction Plot
Hello - and sorry for what might look like a simple graphics question. I am building an interaction plot for d: d=data.frame(xx=c(3,3,2,2,1,1),yy=c(4,3,4,3,4,3),zz=c(5.1,4.4,3.5,3.3,-1.1,-1.3)) d[[1]]<-as.factor(d[[1]]) d[[2]]<-as.factor(d[[2]]) print(d) interaction.plot(d$xx, d$yy, d$zz, type="b", col=c("red","blue"), legend=F, lty=c(1,2), lwd=2,
2009 Apr 23
2
transposing a matrix - row by row?
Hello, I have a matrix that is a product of tapply on a larger data set. Let's assume it looks like this: X<-matrix(c(10,20,30,40,50,60),2,3) dimnames(X)<-list(c("1","2"),c("1","2","3")) (X) 1 2 3 1 10 30 50 2 20 40 60 Is there an efficient way of transforming this matrix into the following matrix: rows columns entries 1
2009 May 18
1
Barchart in lattice - wrong order of groups, data labels on top of each other, and a legend question
Hello! I have a question about my lattice barchart that I am trying to build in Section 3 below. I can't figure out a couple of things: 1. When I look at the dataframe "test" that I am trying to plot, it looks right to me (the group "Total" is always the first out of 5). However, in the chart it is the last. Why? 2. How can I make sure the value labels (on y) are not
2009 Apr 10
1
Random Forests: Question about R^2
Dear Random Forests gurus, I have a question about R^2 provided by randomForest (for regression). I don't succeed in finding this information. In the help file for randomForest under "Value" it says: rsq: (regression only) - "pseudo R-squared'': 1 - mse / Var(y). Could someone please explain in somewhat more detail how exactly R^2 is calculated? Is "mse"
2009 Feb 10
1
Putting values and axis X labels on the charts based on allEffects
Dear everybody! Need help with graphics. I am runnig a simple lm and then using allEffects from 'effects' package: require(effects) model<-lm(Y~A+B, data=mydataframe) I am trying to build (for each predictor - A and then B) a plot of means on Y. I was successful doing it like this - in one swoop: ml.eff<-allEffects(ml1, se=F) plot(ml.eff,ylab="Title of Y") Is it
2009 Apr 20
7
Fitting linear models
I am not sure if this is an R-users question, but since most of you here are statisticians, I decided to give it a shot. I am using the lm() function in R to fit a dependent variable to a set of 3 to 5 independent variables. For this, I used the following commands: >model1<-lm(function=PBW~SO4+NO3+NH4) Coefficients: (Intercept) SO4 NO3 NH4 0.01323 0.01968
2010 Nov 09
2
Merging data frames one of which is NULL
Hello! I am running a loop. The result of each run of the loop is a data frame. I am merging all the data frames. For exampe: The dataframe from run 1: x<-data.frame(a=1,b=2,c=3) The dataframe from run 2: y<-data.frame(a=10,b=20,d=30) What I want to get is: merge(x,y,all.x=T,all.y=T) Then I want to merge it with the output of the 3rd run, etc. Unfortunately, I can't create the
2009 Mar 27
1
asking advice for Integer Programming packages
Dear everyone, I don't know much about Integer Programming but am afraid I am facing a problem that can only be solved via Integer Programming. I was wondering if those of you who have experience with it could recommend an R package. I found the following R packages: Rglpk glpk lpSolve lpSolveAPI Are there any others? Are some of them easier to use than others for a beginner? Any advice
2009 Feb 09
1
percentage of variance explained by factors
Hello! I've run a simple linear model: result<-lm(DV~A+B+C,data=Data) My Data$A,Data$B, and Data$C are factors. So, lm automatically recoded them into dummy variables. I have all the results I need but one. Question: Where could I see the variance explained by all A dummy variables together, then all B dummy variables together, and all C dummy variables together - when other predictors
2009 Apr 20
1
Random Forests: Predictor importance for Regression Trees
Hello! I think I am relatively clear on how predictor importance (the first one) is calculated by Random Forests for a Classification tree: Importance of predictor P1 when the response variable is categorical: 1. For out-of-bag (oob) cases, randomly permute their values on predictor P1 and then put them down the tree 2. For a given tree, subtract the number of votes for the correct class in the
2009 May 16
2
Question about barplot: gridlines & value labels
Hello! I promise I looked into help files before asking. Still cannot figure it out. I think it's because I am totally confused what packages use lettice, which use trellis, etc. Sections 1 and 2 below produce the data and the data to plot. My question is about barplot in Section 3. I am trying to: 1. add only horizontal gridlines and manipulate the type and color of that line. tck = 1 is not
2009 Apr 27
2
Problem with R 2.9.0 vs. R2.8.1 with either "survival" package or "predict" function
Just wanted to alert everyone regarding a problem that I only experience with R2.9.0 and not with R2.8.1 when I run both using exactly the same script and the same data. Here is what I do with my data "MyData", 2 predictors (V1 and V2) and my survival DV "Depvar": library(survival)