thr3ads.net - search: "data

Displaying 20 results from an estimated 29 matches for "data_frame".

parallel max, min, and median of dataframe columns

2008 Apr 27

parallel max, min, and median of dataframe columns

...umns. I want to show the maximum, minimum, and median with a vertical line and a central dot (I'd use a boxplot, but with only three data points, that's overkill; I can't just use points, because of overlap and some of the other data plotted on the graph). This works: > boxplot(data_frame, col="transparent", border="dark grey", xaxt="n", add=TRUE) But these don't: > st <- c(1:19) > segments(st, pmax(data_frame), st, pmin(data_frame)) (no, I haven't tried to put in the median points yet) I can plot them individually with > segmen...

predict nbinomial glm

2005 Aug 16

predict nbinomial glm

Dear R-helpers, let us assume, that I have the following dataset: a <- rnbinom(200, 1, 0.5) b <- (1:200) c <- (30:229) d <- rep(c("q", "r", "s", "t"), rep(50,4)) data_frame <- data.frame(a,b,c,d) In a first step I run a glm.nb (full code is given at the end of this mail) and want to predict my response variable a. In a second step, I would like to run a glm.nb based on a subset of the data_frame. As soon as I want to predict the response variable a, I get the foll...

need help with stat functions(like adaboost, random forests and glm)

2008 Aug 13

need help with stat functions(like adaboost, random forests and glm)

Ok, so basically I have a dataframe named data_frame data_frame contains: startdate startprice endpricethreshold1 endpricethreshold2 endpricethreshold3 all of these endpricethresholds are true/false binary vectors. They are true or false depending on whether the endprice was above or below whatever the endpricethreshold is. now I want to try to...

No scientific notation in format

2006 Jan 24

No scientific notation in format

...0.07919450,0.02808281,0, 0.06958082 08,0,0,1.05600e+05,0.3930612, 0.0,0,0, 8.955513,0.08800232,0.28571429,0, 0.06691399 09,0,0,0.00000e+00,0.0000000,12.5,0,0,13.513589,0.05575315,0.00000000,0, 0.04875261 10,0,0,4.75889e-01,0.0000000, 0.0,0,0,10.084557,0.10251731,0.00000000,0, 0.15396753 I am using data_frame <- format(data_frame, trim = TRUE, digits = 3, nsmall = 4, width = 5) for format. And data_frame <- formatC(data_frame, digits = 3, width = 4, format = "f", mode = NULL) leads to the following error: Error in as.integer.default(floor(log10(abs(x + ifelse(x == 0, 1, 0))))) :...

empty files created with trellis xyplot jpeg device

2012 Jan 01

empty files created with trellis xyplot jpeg device

...t; <> 22376 Dec 31 16:20 isoranp-z1.zone_cpu.jpg -rw-rw-r-- 1 <> <> 18910 Dec 31 16:20 isoranp-z1.zone_mem.jpg -rw-rw-r-- 1 <> <> 0 Dec 31 16:20 isoranp-z1.zone_util.jpg # here is the gen_zone_charts function: > gen_zone_charts function(zone,year,wk){ data_frame<-get_zonedata_url(year,wk,zone) # this results in a 0 length file # i have tried using jpeg(), and trellis.device() with the same results #jpeg(file=paste("charts/",zone,".zone_util.jpg",sep="")) trellis.device(jpeg,file=paste("charts/",z...

Testing for membership in an array of strings

2009 Jul 02

Testing for membership in an array of strings

...contains a column with text values. What I am trying to do is use the subset function on the data frame to select only data for "sue" or "jane" (for example.) But maybe I have not taken the correct approach? So obviously I could do something like the following. subset( data_frame, name = "sue" | name == "jane", select = c(name, age, birthdate)) However, my subset needs to be much more than 2 and being lazy I do not want to type "| name == "some text" for each one. Is there an other way? Neil

accessing factor levels from C

2002 Mar 02

accessing factor levels from C

Hi, I am trying to get information about factors from a C-program. As I see, the factors are basically integers with attribute ,,levels''. But unfortunately I am not been able to read the levels information. I am using: SEXP variable, levels; ... variable = VECTOR_ELT( data_frame, j); switch( TYPEOF( variable)) { case INTSXP: if( isFactor( VECTOR_ELT( data_frame, j))) { fvalue = INTEGER( variable)[i]; levels = STR_PTR( GET_LEVELS( variable)); .... but the variable levels is not an array of strings as I expected. I am probably doing something in the wr...

Ggplot error

2017 Nov 08

Ggplot error

...rallel_3.4.2 survival_2.41-3 yaml_2.1.14 [66] colorspace_1.3-2 rvest_0.3.2 bindr_0.1 haven_1.1.0 > conflicts(detail = TRUE) $.GlobalEnv [1] "iris" $`package:dplyr` [1] "%>%" "%>%" "add_row" "as_data_frame" "as_tibble" "data_frame" [7] "data_frame_" "frame_data" "glimpse" "lst" "lst_" "tbl_sum" [13] "tibble" "tribble" "trunc_mat" "type_sum&...

Ggplot error

2017 Nov 08

Ggplot error

...14 > [66] colorspace_1.3-2 rvest_0.3.2 bindr_0.1 haven_1.1.0 > > > > conflicts(detail = TRUE) > $.GlobalEnv > [1] "iris" > > $`package:dplyr` > [1] "%>%" "%>%" "add_row" "as_data_frame" > "as_tibble" "data_frame" > [7] "data_frame_" "frame_data" "glimpse" "lst" > "lst_" "tbl_sum" > [13] "tibble" "tribble" "trunc_mat"...

R strings from C

2001 Dec 19

R strings from C

Hi, I am trying to study R internal behaviour. So long, I have not succeeded to access the value of R strings from C. I use: void salvesta_tabel( SEXP data_frame, SEXP file ) { printf( "nimi %d\n", (R_CHAR)( file)); } and from the R side: salvesta.tabel <- function (x, file = "") { .Call( "salvesta_tabel", x, file) } When calling from R as salvesta.tabel( x, "file") R always crashes while...

Weird differing results when using the Wilcoxon-test

2010 Aug 17

Weird differing results when using the Wilcoxon-test

...ly, when ties are involved 2) wilcox_test{coin}, which does calculate the distribution _exactly_ even, with ties. I have the following scenario: #---BeginCode--- # big example size = 60 big1 = rnorm(size, 0, 1) big2 = rnorm(size, 0.5, 1 g1f = rep(1, size) g2f = rep(2, size) big = c(big1, big2) data_frame = data.frame(big, gr=as.factor(c(g1f, g2f))) wilcox_approx = wilcox.test(big1, big2) wilcox_exact = wilcox_test(big ~ gr, data=data_frame, distribution="exact") #---EndCode--- I found here http://www-stat.stanford.edu/~susan/courses/s141/hononpara.pdf that wilcox.test (at least for the...

Show representation of a data structure

2009 Jul 21

Show representation of a data structure

I'm currently working with some large complex data structures eg list of lists of data_frames containing lots more variables and lists etc. Sometimes, I'd like to be able to bring up a simple representation of the structure I'm working with, minus all of the values it contains (so simply printing the variable doesn't work as its too hard to see structure when there are 1000s of...

Ggplot error

2017 Nov 08

Ggplot error

...2.1.14 > [66] colorspace_1.3-2? rvest_0.3.2? ? ? ? bindr_0.1? ? ? ? ? haven_1.1.0 > > > > conflicts(detail = TRUE) > $.GlobalEnv > [1] "iris" > > $`package:dplyr` >? [1] "%>%"? ? ? ? ? "%>%"? ? ? ? ? "add_row"? ? ? "as_data_frame" > "as_tibble"? ? "data_frame" >? [7] "data_frame_"? "frame_data"? ? "glimpse"? ? ? "lst" >? "lst_"? ? ? ? ? "tbl_sum" > [13] "tibble"? ? ? ? "tribble"? ? ? "trunc_mat"? ?...

Refactor all factors in a data frame

2007 Jun 05

Refactor all factors in a data frame

...levels of the factors seem to be the same as in the original data frames, probably because subset() calls [.factor without drop = TRUE (that's what I gather from scanning the mailing lists). I wonder if there is a easy way to refactor all factors in the data frame at once. I noted that fix(data_frame) does the trick, however, this needs user interaction, which I'd like to avoid. Subsequent write.table / read.table would be another option but I'm not sure if R can guess the factor/char/numeric-type correctly when reading the table. So, is there any way in drop the unused factor level...

Ggplot error

2017 Nov 08

Ggplot error

...2.1.14 > [66] colorspace_1.3-2 rvest_0.3.2 bindr_0.1 haven_1.1.0 > > > > conflicts(detail = TRUE) > $.GlobalEnv > [1] "iris" > > $`package:dplyr` > [1] "%>%" "%>%" "add_row" "as_data_frame" > "as_tibble" "data_frame" > [7] "data_frame_" "frame_data" "glimpse" "lst" > "lst_" "tbl_sum" > [13] "tibble" "tribble" "trunc_mat"...

Adding predicted values as a new variable in a data frame

2006 Sep 14

Adding predicted values as a new variable in a data frame

I am running a regression: ols.reg1 <- lm(y ~ x1 + x2 + x3 + x4) on a data.frame and then generating fitted values: y.hat <- ols.reg1$fitted.values Then I would like to add these fitted values to the data.frame as a new variable. The problem is that when the values are predicted the resulting output has too few rows. for some reason certian observations do not get predicted values. So

Download data from NASA for multiple locations - RCurl

2017 Oct 16

Download data from NASA for multiple locations - RCurl

I have done the following using readLines directory <- "~/" files <- list.files(directory) data_frames <- vector("list", length(files)) for (i in seq_along(files)) { df <- readLines(file.path(directory, files[i])) df <- df[-(1:13)] df <- data.frame(year = substr(df,1,4), month = substr(df, 6,7), day = substr(df, 9, 10),...

Dataframe Manipulation

2017 Sep 04

Dataframe Manipulation

...imilar, and the logic is identical: > > library(readr) > library(dplyr) > library(stringr) > library(tidyr) > > data_help <- read_csv("data_help.csv") > cat_help <- read_csv("cat_help.csv") > > # Helper function to split the Items and create a data_frame > split_items <- function(items){ > x <- items$Items_purchased_on_Receipts %>% > str_split(pattern = ",") %>% > unlist(use.names = FALSE) > > data_frame(Item = x, Purchase_ID = items$Purchase_ID) > } > > data_help <- > data_help %...

Dataframe Manipulation

2017 Aug 31

Dataframe Manipulation

Hi Hemant, the solution is really quite similar, and the logic is identical: library(readr) library(dplyr) library(stringr) library(tidyr) data_help <- read_csv("data_help.csv") cat_help <- read_csv("cat_help.csv") # Helper function to split the Items and create a data_frame split_items <- function(items){ x <- items$Items_purchased_on_Receipts %>% str_split(pattern = ",") %>% unlist(use.names = FALSE) data_frame(Item = x, Purchase_ID = items$Purchase_ID) } data_help <- data_help %>% mutate(Purchase_ID = 1:n()) %>% gro...

Dataframe Manipulation

2017 Aug 30

Dataframe Manipulation

by using these two tables we have to create third table in this format where categories will be on the top and transaction will be in the rows, On 30 August 2017 at 16:42, Hemant Sain <hemantsain55 at gmail.com> wrote: > Hello Ulrik, > Can you please once check this code again on the following data set > because it doesn't giving same output to me due to absence of quantity,a

search for: data_frame