Udaya B. Kogalur
2019-Apr-22 18:53 UTC
[R] [R-pkgs] randomForestSRC 2.9.0 is now available
Dear useRs: It's been some time since we last sent out an announcement, so this one will cover more than just the last update. The latest release of randomForestSRC is now available on CRAN at: https://CRAN.R-project.org/package=randomForestSRC The GitHub repository, through which we prefer to receive bug reports, is at: https://github.com/kogalur/randomForestSRC If you do find issues, please use: https://github.com/kogalur/randomForestSRC/issues and take the time to post a minimal script (and data set if necessary) that isolates the error. Additional documentation can be found at: https://kogalur.github.io/randomForestSRC/ ------------------------------------------ Details are as follows: Ensembles in regression now support Greenwald-Khanna approximate quantile queries via rfsrc(), predict.rfsrc() and the new wrapper quantileReg.rfsrc(). Related to this, a new split rule "quantile.regr" has been added. Another new wrapper, imbalanced.rfsrc(), implements various solutions to the two-class imbalanced problem, including the newly proposed quantile-classifier approach of O'Brien and Ishwaran (2017). This also includes Breiman's balanced random forests under-sampling of the majority class. Performance is assessed using the G-mean, but misclassification error can be requested. Also, the new parameter get.tree in predict.rfsrc() allows users to extract the ensembles for a single tree or subset of trees over the forest. The default nodesize for survival and competing risk has been changed to 15. We've added new splitrules "auc" and "entropy" for classification. A new variable importance methodology called Holdout VIMP has been implemented. Here, we exclude a variable from a subset of trees and compare the error rates between those trees in which the variables was included against those in which it was excluded. The key point here is that no permutation of a variable is conducted. See holdout.vimp.rfsrc() and the associated Rd file for more information. Finally, some function names were changed as a general move towards name uniformity in the package. Sorry about that. ------------------------------------------ Additional side-notes: Unfortunately there has been no further work on the Spark build. However, the Java wrappers have been kept up to date, and the Hello World script is still functional. Instructions are provided here: https://kogalur.github.io/randomForestSRC/building.html For those requesting and still awaiting CPU performance enhancements, our continued apologies. Our focus has been methodological, but our intention is to address performance in the next build. We promise. Thank you! Udaya B. Kogalur, Ph.D. ubkogalur at gmail.com [[alternative HTML version deleted]] _______________________________________________ R-packages mailing list R-packages at r-project.org https://stat.ethz.ch/mailman/listinfo/r-packages
Apparently Analagous Threads
- randomForestSRC 2.9.0 is now available
- Multivariate random forests in R - how to obtain variance explained for multiple responses in randomForestSRC package - or other package
- randomForestSRC 2.5.1 and OpenMP
- Release 3.2.0 of randomSurvivalForest is now availablle
- Release 3.2.0 of randomSurvivalForest is now availablle