similar to: how to cite R data sets

Displaying 20 results from an estimated 9000 matches similar to: "how to cite R data sets"

2009 May 26
2
Problem accessing "row number" from subset on a dataframe
I would like to use the "row number" information returned from performing a subset command on a dataframe. For example, I would like to automatically delete some rows from a dataframe if they match a criteria. Here is my example below. data(airquality) names(airquality) subset(airquality, airquality$Month == 6) Now how do I delete the row numbers returned automatically? I
2007 Apr 01
1
Non-unique column names in data frames
Dear r-devel members, It's just been brought to my attention that R permits non-unique column names in data frames -- e.g., via assignment to names() or colnames(). This behaviour is consistent with the help files (as I discovered), but it's not consistent with the behaviour of rownames() and row.names(). For example, row.names(airquality) <- rep("a", nrow(airquality))
2009 Aug 04
5
Stacked plots with common x-axis and different y-axis
Is there a place that shows how to create two plots that are stacked on top of each other where they share a common x-axis scale, but have differnt y-axis scale? Say have the following data: airquality Stack plot(airquality$Day, airquality$Wind) on top of plot(airquality$Day, airquality$Temp). I am interested in stacking the two on top of each other with no seam, or plotting the two lines with
2012 Feb 23
1
Sexpr not getting expanded in Sweave
An Sweave file, 'test.Rnw': \documentclass{article} \title{Sweave minimal} \author{MK} \begin{document} \maketitle We try Sweave: <<1>>= data(airquality) summary(airquality) x <- airquality[1, 1] @ I try Sexpr: \Sexpr{x} We plot: \begin{center} <<2, fig=TRUE, echo=FALSE >>= boxplot(Ozone ~ Month, data = airquality) @ \end{center} \end{document} I check the
2013 Jul 23
1
cbind error with check.names
Here is an example where?cbind?fails with an error when?check.names=TRUE?is set. data(airquality) airQualityBind =cbind(airquality,airquality,check.names =TRUE) ?I understand that?cbind?is a call to?data.frame?and the following works: airQualityBind =data.frame(airquality,airquality,check.names =TRUE) but I would like to understand why?cbind?throws an error. I asked this question on SO here:
2008 Dec 23
3
Using transform to add a date column to a dataframe
I would like to add a column to the airquality dataset that contains the date 1950-01-01 in each row. This method does not appear to work: > attach(airquality) > data1 <- transform(airquality,Date=as.Date("1950-01-01")) Error in data.frame(list(Ozone = c(41L, 36L, 12L, 18L, NA, 28L, 23L, 19L, : arguments imply differing number of rows: 153, 1 I can't decipher what
2008 Sep 11
3
periodicity validation
There is a series of data contains time in fixed step and energy varying with time, how to test its periodicity?In R, it seems there is no direct tools since I have search the R manual with periodic and I have not found any related topic. Thanks a lot
2007 Sep 16
1
Identifying objects from a data set
Hello Given the following data for a data set called airquality. To identify the nature of the objects from the data set airquality example "Ozone" would it be best to use the command is. like is.character(airquality$Ozone) ....... I tried attributes(airquality$Ozone) but it came up null. Would there be a better way to identify these objects. Thanking you in advance for your
2008 Oct 20
2
calculating mean for samples
Hi everyone, > does any one knows how can I calculate mean for different samples > i.e. I have a data like this: > > s1 s2 s3 s4 > 1 0 0 0 1 > 2 1 0 1 0 > 3 0 0 0 0 > 4 0 0 0 0 > 5 0 1 0 1 > 6 1 0 0 0 > 7 0 0 0 0 > 8 0 0 0 0 > 9 0 0 0 0 > 10 0 0 0 1 > > I need to make 5 different sample with 5
2008 Nov 04
2
Zoo seems to be running slow in R 2.8.0 windows
R version 2.8.0 (2008-10-20) i386-pc-mingw32 locale: LC_COLLATE=English_United States.1252;LC_CTYPE=English_United States.1252;LC_MONETARY=English_United States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] StreamMetabolism_0.01 chron_2.3-24 zoo_1.5-4 loaded
2005 Jul 01
2
Simple indexing conundrum
My apologies in advance for my thickness but I can't seem to solve the following, seemingly simple, data manipulation problem: I have a data frame that contains multiple factors and multiple continuous response variables, but duplicates of some factor combinations. The duplicates contain bad data, so I would like to eliminate the duplicates. I would like to retain the entire rows
2008 Mar 24
2
Newbie help with Sweave
I think I've gotten my Emacs/Sweave/R system set up correctly, thanks to Vincent and Jim, but I haven't been successful getting my first document produced. I'm trying to use one of Friedrich Leisch's examples, http://www.ci.tuwien.ac.at/~leisch/Sweave/example-1.Snw. I cut and pasted the text into a document sweaveexample.Rnw in Emacs. It seemed to be processed successfully with R:
2008 Sep 20
1
fitting a hyperbole
I have got a data set that is Gross Primary Productivity ~ Total Suspended Solids it is a hyperbola just like: plot(1/c(1:1000)) how do I model this relationship so that I can get all of the neat things that lm gives residuals etc. etc. so that I can see if my eyeball model stands up. Thanks for any help, pointers, or good things to read. -- Stephen Sefick Research Scientist Southeastern
2008 Aug 28
1
abline of an lm fit not correct
mac osx 10.5.4 R 2.7.1 I have fit a model d<-lm(y~x) with an R^2 of 0.963 but when I issue the command abline(d) the line is below where it ought to be. Looks like the right slope, but not the right intercept. thanks -- Stephen Sefick Research Scientist Southeastern Natural Sciences Academy Let's not spend our time and resources thinking about things that are so little or so large
2008 Oct 23
1
Reversing xlim qplot
I would like to be able to reverse the xlim on qplot this is the code that I am using qplot(a[,"River.Mile"], a[,26] ,ylab=colnames(a)[26], xlab="RiverMile", xlim=rev(c(60, 216)))+geom_smooth()+scale_x_continuous(breaks=c(215,202,198,190,185,179,148,119,61),
2008 Oct 29
1
Macro stuff to work on up through august 2007
Title says it all remember cast() with sum as the aggregation function -- Stephen Sefick Research Scientist Southeastern Natural Sciences Academy Let's not spend our time and resources thinking about things that are so little or so large that all they really do for us is puff us up and make us feel like gods. We are mammals, and have not exhausted the annoying little problems of being
2008 Nov 09
1
maptools sunrise sunset function
##This is a function that I am trying to write to calculate sunrise and sunset and works "mostly", but returns nonsensical values. What am I #missing? Thanks in advance. ###remember to include maptools as dependence### library(maptools) sunrise.set <- function(lat, long, date, timezone="UTC", num.days=1){ #this needs to be long lat# lat.long <- matrix(c(long, lat),
2008 Oct 22
1
plotting matrix
a <- c(1:26) b <- rnorm(25) e <- rnorm(25) f <- rnorm(25) g <- data.frame(b,e, a,f) I would like to plot a agianst all possibilities and then shoot it out to a pdf one graph per page. I think it would be okay to have this as a lattice plot or a ggplot with many graphs per page. I can figure all of that out I think, but I need something like r <- as.matrix(g) plot(.~a, data=r)
2004 Sep 19
1
Nnet: Returning the response
Hello list. Maybe this is a simple question but I can't find the answer anywhere. With lm I use the parameter y=TRUE to have the response returned in $y. Of course, namely because of NA's in the data frame, this might not include all the values in the original column. For example: > data(airquality) > length(airquality$Day) [1] 153 > my.lm <- lm(Day~.,data=airquality,y=TRUE)
2017 Dec 19
1
lm considers removed predictors when finding complete cases
Dear R-devel list, I realized that removing a predictor in lm through the "-"'s operator in formula() does not affect the complete cases that are considered. A minimal example is: summary(lm(Wind ~ ., data = airquality)) # 42 observations deleted due to missingness summary(lm(Wind ~ . - Ozone, data = airquality)) # still 42 observations deleted due to missingness, even if only 7