I am new to R and I'd like to use the randomForest package for my thesis (identifying important variables for more detailed analysis with other software). I have found extremely well written and helpful information on the usage of R. Unfortunately it seems to be very difficult to find similarly detailed tutorials for randomForest, and I just can't get it work with the information on http://rss.acs.unt.edu/Rdoc/library/randomForest/html/randomForest.html Can anybody give me a hint where to find a more detailed tutorial? I am especially interested in preparation of data to use with randomForest (how to structure the csv-file) and which information is mandatory for the package to work. Maybe someone has a simple copy-paste example to share and experiment? Thank you for your attention, Dirk -- --------------------------------------- Dirk Nemitz Zeppelinstr. 11a 37083 Göttingen Germany Tel: +49 (0)551 492 32 51 Mobil: +49 (0)175 709 31 92 --------------------------------------- [[alternative HTML version deleted]]
Dirk, Check out Andy Liaw/Mat Wiener's article in RNews 2002-3 www.r-project.org/doc/Rnews/Rnews_2002-3.pdf Also my decks at www.porzak.com/JimArchive : 1. http://www.porzak.com/JimArchive/JimPorzak_CIwithR_useR2006_tutorial.pdf 2nd part is on randomForest 2. http://www.porzak.com/JimArchive/JimPorzak_RFwithR_DMAAC_Jan07_webinar.pdf is all on randomForest The latter two focus on the use of randomForest in marketing & on the binary response problem For the variable selection problem you mention, check out Ramon Diaz-Uriate's package varSelRF on your local CRAN. HTH, Jim Porzak Responsys, Inc. San Francisco, CA http://www.linkedin.com/in/jimporzak On Tue, Jul 22, 2008 at 4:58 AM, Dirk Nemitz <dirk.nemitz at gmail.com> wrote:> > I am new to R and I'd like to use the randomForest package for my thesis > (identifying important variables for more detailed analysis with other > software). I have found extremely well written and helpful information on > the usage of R. > > Unfortunately it seems to be very difficult to find similarly detailed > tutorials for randomForest, and I just can't get it work with the > information on > http://rss.acs.unt.edu/Rdoc/library/randomForest/html/randomForest.html > > Can anybody give me a hint where to find a more detailed tutorial? I am > especially interested in preparation of data to use with randomForest (how > to structure the csv-file) and which information is mandatory for the > package to work. Maybe someone has a simple copy-paste example to share and > experiment? > > Thank you for your attention, > > Dirk > > -- > --------------------------------------- > Dirk Nemitz > Zeppelinstr. 11a > 37083 G?ttingen > Germany > > Tel: +49 (0)551 492 32 51 > Mobil: +49 (0)175 709 31 92 > --------------------------------------- > > [[alternative HTML version deleted]] > > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >
The help pages for 'randomForest' are quite clear in stating how to use the 'randomForest()' function. Input is a data frame (not a csv-file) and a formula involving attribute names. There is a kind of tutorial for classification and clustering with Random Forests on Leo Breiman's web page <http://www.stat.berkeley.edu/users/breiman/RandomForests/cc_home.htm>. It describes the older code on which the R package is built, not the implementation of RandomForest from Salford systems, so more or less what you want. And there is another implementation of random forests in the 'party' package, see the 'cforest()' function. (The package varSelRF has already been mentioned in a previous post.) Hans Werner Borchers ABB Corporate Research Dirk Nemitz wrote:> > I am new to R and I'd like to use the randomForest package for my thesis > (identifying important variables for more detailed analysis with other > software). I have found extremely well written and helpful information on > the usage of R. > > Unfortunately it seems to be very difficult to find similarly detailed > tutorials for randomForest, and I just can't get it work with the > information on > http://rss.acs.unt.edu/Rdoc/library/randomForest/html/randomForest.html > > Can anybody give me a hint where to find a more detailed tutorial? I am > especially interested in preparation of data to use with randomForest (how > to structure the csv-file) and which information is mandatory for the > package to work. Maybe someone has a simple copy-paste example to share > and > experiment? > > Thank you for your attention, > > Dirk > > --------------------------------------- > Dirk Nemitz > Zeppelinstr. 11a > 37083 G?ttingen > Germany > > Tel: +49 (0)551 492 32 51 > Mobil: +49 (0)175 709 31 92 > --------------------------------------- > >-- View this message in context: http://www.nabble.com/randomForest-Tutorial-tp18588688p18594872.html Sent from the R help mailing list archive at Nabble.com.