Displaying 20 results from an estimated 9000 matches similar to: "how to cite R data sets"
2009 May 26
Problem accessing "row number" from subset on a dataframe
I would like to use the "row number" information returned from performing a subset command on a dataframe.
For example, I would like to automatically delete some rows from a dataframe if they match a criteria. Here is my example below.
subset(airquality, airquality$Month == 6)
Now how do I delete the row numbers returned automatically?
2007 Apr 01
Non-unique column names in data frames
Dear r-devel members,
It's just been brought to my attention that R permits non-unique column
names in data frames -- e.g., via assignment to names() or colnames(). This
behaviour is consistent with the help files (as I discovered), but it's not
consistent with the behaviour of rownames() and row.names(). For example,
row.names(airquality) <- rep("a", nrow(airquality))
2009 Aug 04
Stacked plots with common x-axis and different y-axis
Is there a place that shows how to create two plots that are stacked on top of each other where they share a common x-axis scale, but have differnt y-axis scale?
Say have the following data: airquality
Stack plot(airquality$Day, airquality$Wind) on top of plot(airquality$Day, airquality$Temp).
I am interested in stacking the two on top of each other with no seam, or plotting the two lines with
2012 Feb 23
Sexpr not getting expanded in Sweave
An Sweave file, 'test.Rnw':
\title{Sweave minimal}
We try Sweave:
x <- airquality[1, 1]
I try Sexpr: \Sexpr{x}
We plot:
<<2, fig=TRUE, echo=FALSE >>=
boxplot(Ozone ~ Month, data = airquality)
I check the
2013 Jul 23
cbind error with check.names
Here is an example where?cbind?fails with an error when?check.names=TRUE?is set.
airQualityBind =cbind(airquality,airquality,check.names =TRUE)
?I understand that?cbind?is a call to?data.frame?and the following works:
airQualityBind =data.frame(airquality,airquality,check.names =TRUE)
but I would like to understand why?cbind?throws an error.
I asked this question on SO here:
2008 Dec 23
Using transform to add a date column to a dataframe
I would like to add a column to the airquality dataset that contains the date
1950-01-01 in each row. This method does not appear to work:
> attach(airquality)
> data1 <- transform(airquality,Date=as.Date("1950-01-01"))
Error in data.frame(list(Ozone = c(41L, 36L, 12L, 18L, NA, 28L, 23L, 19L, :
arguments imply differing number of rows: 153, 1
I can't decipher what
2008 Sep 11
periodicity validation
There is a series of data contains time in fixed step and energy
varying with time, how to test its periodicity?In R, it seems there is
no direct tools since I have search the R manual with periodic and I
have not found any related topic.
Thanks a lot
2008 Oct 20
calculating mean for samples
Hi everyone,
> does any one knows how can I calculate mean for different samples
> i.e. I have a data like this:
> s1 s2 s3 s4
> 1 0 0 0 1
> 2 1 0 1 0
> 3 0 0 0 0
> 4 0 0 0 0
> 5 0 1 0 1
> 6 1 0 0 0
> 7 0 0 0 0
> 8 0 0 0 0
> 9 0 0 0 0
> 10 0 0 0 1
> I need to make 5 different sample with 5
2007 Sep 16
Identifying objects from a data set
Given the following data for a data set called airquality. To identify the nature of the objects from the data set airquality example "Ozone" would it be best to use the command is. like is.character(airquality$Ozone) ....... I tried attributes(airquality$Ozone) but it came up null. Would there be a better way to identify these objects.
Thanking you in advance for your
2008 Nov 04
Zoo seems to be running slow in R 2.8.0 windows
R version 2.8.0 (2008-10-20)
LC_COLLATE=English_United States.1252;LC_CTYPE=English_United
States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] StreamMetabolism_0.01 chron_2.3-24 zoo_1.5-4
2005 Jul 01
Simple indexing conundrum
My apologies in advance for my thickness but I can't seem to solve the
following, seemingly simple, data manipulation problem:
I have a data frame that contains multiple factors and multiple
continuous response variables, but duplicates of some factor
combinations. The duplicates contain bad data, so I would like to
eliminate the duplicates. I would like to retain the entire rows
2008 Mar 24
Newbie help with Sweave
I think I've gotten my Emacs/Sweave/R system set up correctly, thanks to
Vincent and Jim, but I haven't been successful getting my first document
produced. I'm trying to use one of Friedrich Leisch's examples,
http://www.ci.tuwien.ac.at/~leisch/Sweave/example-1.Snw. I cut and
pasted the text into a document sweaveexample.Rnw in Emacs. It seemed to
be processed successfully with R:
2008 Sep 20
fitting a hyperbole
I have got a data set that is Gross Primary Productivity ~ Total
Suspended Solids it is a hyperbola just like:
how do I model this relationship so that I can get all of the neat
things that lm gives residuals etc. etc. so that I can see if my
eyeball model stands up. Thanks for any help, pointers, or good
things to read.
Stephen Sefick
Research Scientist
2008 Aug 28
abline of an lm fit not correct
mac osx 10.5.4
R 2.7.1
I have fit a model
with an R^2 of 0.963
but when I issue the command
the line is below where it ought to be. Looks like the right slope,
but not the right intercept.
Stephen Sefick
Research Scientist
Southeastern Natural Sciences Academy
Let's not spend our time and resources thinking about things that are
so little or so large
2008 Oct 23
Reversing xlim qplot
I would like to be able to reverse the xlim on qplot
this is the code that I am using
qplot(a[,"River.Mile"], a[,26]
,ylab=colnames(a)[26], xlab="RiverMile", xlim=rev(c(60,
2008 Oct 29
Macro stuff to work on up through august 2007
Title says it all remember cast() with sum as the aggregation function
Stephen Sefick
Research Scientist
Southeastern Natural Sciences Academy
Let's not spend our time and resources thinking about things that are
so little or so large that all they really do for us is puff us up and
make us feel like gods. We are mammals, and have not exhausted the
annoying little problems of being
2008 Nov 09
maptools sunrise sunset function
##This is a function that I am trying to write to calculate sunrise
and sunset and works "mostly", but returns nonsensical values. What
am I #missing? Thanks in advance.
###remember to include maptools as dependence###
sunrise.set <- function(lat, long, date, timezone="UTC", num.days=1){
#this needs to be long lat#
lat.long <- matrix(c(long, lat),
2008 Oct 22
plotting matrix
a <- c(1:26)
b <- rnorm(25)
e <- rnorm(25)
f <- rnorm(25)
g <- data.frame(b,e, a,f)
I would like to plot a agianst all possibilities and then shoot it out
to a pdf one graph per page. I think it would be okay to have this as
a lattice plot or a ggplot with many graphs per page. I can figure
all of that out I think, but I need something like
r <- as.matrix(g)
plot(.~a, data=r)
2004 Sep 19
Nnet: Returning the response
Hello list.
Maybe this is a simple question but I can't find the answer anywhere.
With lm I use the parameter y=TRUE to have the response returned in $y. Of course, namely because of NA's in the data frame, this might not include all the values in the original column. For example:
> data(airquality)
> length(airquality$Day)
[1] 153
> my.lm <- lm(Day~.,data=airquality,y=TRUE)
2017 Dec 19
lm considers removed predictors when finding complete cases
Dear R-devel list,
I realized that removing a predictor in lm through the "-"'s operator in
formula() does not affect the complete cases that are considered. A minimal
example is:
summary(lm(Wind ~ ., data = airquality))
# 42 observations deleted due to missingness
summary(lm(Wind ~ . - Ozone, data = airquality))
# still 42 observations deleted due to missingness, even if only 7