Hello,
I?m using the plyr package to add a variable named "bin" to my
original
data frame "df" with a user-defined function "create_bins".
I'd like to get
similar results using dplyr instead, but failing to do so. set.seed(4)df <-
data.frame(pred = rnorm(100), models = gl(2, 50, 100, labels =
c("model1",
"model2")))
### Using plyr (works fine)create_bins <- function(x, nBins){ Breaks <-
unique(quantile(x$pred, probs = seq(0, 1, 1/nBins))) dfB <-
data.frame(pred = x$pred, bin = cut(x$pred, breaks Breaks,
include.lowest = TRUE)) dfB} nBins = 10res_plyr <- plyr::ddply(df,
plyr::.(models), create_bins, nBins)head(res_plyr) ### Using dplyr
(fails) by_group <- dplyr::group_by(df, models)res_dplyr <-
dplyr::summarize(by_group, create_bins, nBins)Error: not a vector Any help
would be much appreciated. Best,Axel.
[[alternative HTML version deleted]]