thr3ads.net - similar to: "How to 'extend' a data.frame based on given variable combinations ?"

Displaying 20 results from an estimated 10000 matches similar to: "How to 'extend' a data.frame based on given variable combinations ?"

How to apply a function to subsets of a data frame *and* obtain a data frame again?

2011 Aug 17

How to apply a function to subsets of a data frame *and* obtain a data frame again?

Dear all, First, let's create some data to play around: set.seed(1) (df <- data.frame(Group=rep(c("Group1","Group2","Group3"), each=10), Value=c(rexp(10, 1), rexp(10, 4), rexp(10, 10)))[sample(1:30,30),]) ## Now we need the empirical distribution function: edf <- function(x) ecdf(x)(x) # empirical distribution function evaluated at x ##

How to convert the output of tapply() so that it has the same order as the input?

2012 Sep 15

How to convert the output of tapply() so that it has the same order as the input?

Hi, I try to apply a function to subsets of a data.frame. tapply() does the job, but the as output, I am looking for a vector (not an array/matrix) ordered in the same way as the original data, so I can simply cbind the result to the original data.frame. Below is a minimal example. I know that there are packages that can do these things easier, but I'm looking for a fast solution not

bwplot [lattice]: how to get different y-axis scales for each row?

2011 Mar 26

bwplot [lattice]: how to get different y-axis scales for each row?

Dear expeRts, How can I get ... (1) different y-axis scales for each row (2) while having the same y-axis scales for different columns? I coulnd't manage to do this with relation="free" [which gives (1) but not (2)]. I also tried relation="sliced", but it did not give the same y-axis scales within each row (see the fourth row). Further, it "separates" the

coxph weirdness

2012 Jul 26

coxph weirdness

Hi all, I cant' wrap my head around an error from the coxph function (package survival). Here's an example: library(survival) n = 100; set.seed(1); time = rexp(n); event = sample(c(0,1), n, replace = TRUE) covar = data.frame(z = rnorm(n)); model = coxph(Surv(time, event)~ . , data = covar) R gives the following error: > model = coxph(Surv(time, event)~ . , data = covar) Error in

aggregate(), tapply(): Why is the order of the grouping variables not kept?

2013 Mar 11

aggregate(), tapply(): Why is the order of the grouping variables not kept?

Dear expeRts, The question is rather simple: Why does aggregate (or similarly tapply()) not keep the order of the grouping variable(s)? Here is an example: x <- data.frame(group = rep(LETTERS[1:2], each=10), year = rep(rep(2001:2005, each=2), 2), value = rep(1:10, each=2)) ## => sorted according to group, then year aggregate(value ~ group + year, data=x,

bwplot: how to get plotmath labels?

2011 Mar 26

bwplot: how to get plotmath labels?

Dear expeRts, How can I get plotmath-labels in the bwplot below? As you can see, I couldn't manage to pass the expressions through the dimnames argument. Cheers, Marius library(lattice) ## data dim <- c(100, 6, 2, 3) dimnames <- list(n=paste("n=", seq_len(100), sep=""), groups=paste("group=", seq_len(6), sep=""),

Seed in 'parallel' vignette

2015 Feb 03

Seed in 'parallel' vignette

Hi, This is most likely only a minor technicality, but I saw the following: On page 6 of the 'parallel' vignette (http://stat.ethz.ch/R-manual/R-devel/library/parallel/doc/parallel.pdf), the random-number generator "L'Ecuyer-CMRG" is said to have seed "(x_n, x_{n-1}, x_{n-2}, y_n, y_{n-1}, y_{n-2})". However, in L'Ecuyer et al. (2002), the seed is given with

Why is format(10000, big.mark = "\\,") not 10\,000?

2010 Dec 30

Why is format(10000, big.mark = "\\,") not 10\,000?

Hi, why does format(10000, big.mark = "\\,") not give me "10\,000"? How can I get this kind of "big.mark"? Cheers, Marius

Quiz: Who finds the nicest form of X_1^\prime?

2011 Apr 06

Quiz: Who finds the nicest form of X_1^\prime?

Dear expeRts, I would like to create a plotmath-label of the form X_1^\prime. Here is how to *not* do it [not nicely aligned symbols]: plot(0,0,main=expression(italic(X*minute[1]))) plot(0,0,main=expression(italic(X[1]*minute))) plot(0,0,main=expression(italic(X)[1]*minute)) Any suggestions? Cheers, Marius

How to efficiently compare each row in a matrix with each row in another matrix?

2012 Dec 08

How to efficiently compare each row in a matrix with each row in another matrix?

Dear expeRts, I have two matrices A and B. They have the same number of columns but possibly different number of rows. I would like to compare each row of A with each row of B and check whether all entries in a row of A are less than or equal to all entries in a row of B. Here is a minimal working example: A <- rbind(matrix(1:4, ncol=2, byrow=TRUE), c(6, 2)) # (3, 2) matrix B <-

R process killed when allocating too large matrix (Mac OS X)

2016 May 05

R process killed when allocating too large matrix (Mac OS X)

Hi Simon, thanks for your quick reply. 1) ... so you can reproduce this? 2) Do you know a way how this can be 'foreseen'? We allocate larger matrices in the copula package depending on the user's input dimension. It would be good to tell her/him "Your dimension is quite large. Be aware of killers in your neighborhood"... before the killer attacks. Thanks & cheers,

How to colorize the panel backgrounds of pairs()?

2012 Mar 01

How to colorize the panel backgrounds of pairs()?

Dear expeRts, I would like to colorize the backgrounds of a pairs plot according to the respective panel number. Here is what I tried (without success): count <- 0 mypanel <- function(x, y, ...){ count <<- count+1 bg. <- if(count %in% c(1,4,9,12)) "#FDFF65" else NA points(x, y, cex=0.5, bg=bg) } U <- matrix(runif(4*500), ncol=4) pairs(U, panel=mypanel) I

lattice splom: how to adjust space between tick marks and tick labels?

2010 Dec 26

lattice splom: how to adjust space between tick marks and tick labels?

Dear expeRts, how can I decrease the space between the tick marks and the corresponding labels in an splom? See here: library(lattice) U <- matrix(runif(4000), ncol = 8) splom(U, axis.text.cex = 0.2) # => space between the [small] tick labels and tick marks is/seems to be too large I checked ?panel.pairs but could not find an option for that. Cheers, Marius

parallel::detectCores(TRUE) gives: Error in system(cmd, TRUE) : error in running command

2014 Aug 22

parallel::detectCores(TRUE) gives: Error in system(cmd, TRUE) : error in running command

Hi, Both under the current R-devel (r66456) and a version from about 3 months ago, I experience the following behavior: > parallel::detectCores(TRUE) Error in system(cmd, TRUE) : error in running command > traceback() 3: system(cmd, TRUE) 2: gsub("^ +", "", system(cmd, TRUE)[1]) 1: parallel::detectCores(TRUE) > This is on Ubuntu 14.04. Does anybody else see this? [I

How to construct a valid seed for l'Ecuyer's method with given .Random.seed?

2013 Jan 23

How to construct a valid seed for l'Ecuyer's method with given .Random.seed?

Dear expeRts, I struggle with the following problem using snow clusters for parallel computing: I would like to specify l'Ecuyer's random number generator. Base R creates a .Random.seed of length 7, the first value indicating the kind fo random number generator. I would thus like to use the components 2 to 7 as the seed for l'Ecuyer's random number generator. By doing so, I

R process killed when allocating too large matrix (Mac OS X)

2016 May 05

R process killed when allocating too large matrix (Mac OS X)

Hi, Interesting "feature" in 10.11.4. I wonder if the process is killed before or after malloc() returns. If before, it seems very blunt: "You're asking too much and I don't like it so I kill you now". If after it doesn't look much better: "You're asking a lot and I don't like it but I give it to you anyway. I'll kill you quickly later". Why

Expressions from boxplot() passed to bxp()

2020 Mar 27

Expressions from boxplot() passed to bxp()

Hi, Is this expected behavior (R-3.6.0)? dat <- cbind(x = 1:10, y = 10:1) ylab <- substitute(X[t], list(t = 2)) plot(dat, ylab = ylab) # works (correctly displays ylab) boxplot(dat, ylab = ylab) # fails boxplot(dat, ylab = as.expression(ylab)) # works Thanks & cheers, M

How to set an argument such that a function treats it as missing?

2010 Nov 13

How to set an argument such that a function treats it as missing?

Dear expeRts, I would like to call a function f from a function g with or without an argument. I use missing() to check if the argument is given. If it is not given, can I set it to anything such that the following function call (to f) behaves as if the argument isn't given? It's probably best described by a minimal example (see below). The reason why I want to do this is, that I do

splom, plotmath: how to add three lines of information with alignment?

2011 Apr 18

splom, plotmath: how to add three lines of information with alignment?

Dear expeRts, I would like to create a scatter plot matrix with splom(). The lower panel should contain some additional information about the samples shown in the upper panel plot, see the splom() call below. Now two questions came up: (1) The lower panels show "tau" and "alpha" on top of each other. How can I plot *three* expressions on top of each other? I tried several

Best way/practice to create a new data frame from two given ones with last column computed from the two data frames?

2011 Aug 18

Best way/practice to create a new data frame from two given ones with last column computed from the two data frames?

Dear expeRts, What is the best approach to create a third data frame from two given ones, when the new/third data frame has last column computed from the last columns of the two given data frames? ## Okay, sounds complicated, so here is an example. Assume we have the two data frames: df1 <- data.frame(Year=rep(2001:2010, each=2), Group=c("Group 1","Group 2"), Value=1:20)

similar to: How to 'extend' a data.frame based on given variable combinations ?