similar to: sums of absolute deviations about the median as split function in rpart

Displaying 20 results from an estimated 1000 matches similar to: "sums of absolute deviations about the median as split function in rpart"

2004 Jun 04
1
rpart
Hello everyone, I'm a newbie to R and to CART so I hope my questions don't seem too stupid. 1.) My first question concerns the rpart() method. Which method does rpart use in order to get the best split - entropy impurity, Bayes error (min. error) or Gini index? Is there a way to make it use the entropy impurity? The second and third question concern the output of the printcp() function.
2006 Mar 08
1
function gdist, dist and vegdist in mvpart
Dear R community, I am analyzing plant communities with the function mvpart, using a dissimilarit matrix as input. The matrix is calculated with the funtion gdist. fit <- mvpart(gdist (ba12[,18:29], meth="maximum", full=TRUE, sq=F) ~ beers + slope_dem + elev_dem+ plc_dem + pr_curv+ +curv+max_depth+doc_rocks+ abandon+land_use+ca_old, data=ba12, xv="p") This
2011 Jun 21
0
How does rpart computes "improve" for split="information"?? (which seems to be different then the "gini" case)
Hello dear R-help members, I would appreciate any help in understanding how the rpart function computes the "improve" (which is given in fit$split) when using the split='information' parameter. Thanks to Professor Atkinson help, I was able to find how this is done in the case that split='gini'. By following the explanation here:
2011 Jan 23
1
Offset - usersplits function package RPART
Hi, I would like write a split function to implement a new split method with the package RPART. I see that I can define my split function as specified in the example of usersplits function, but I don't understand how I can use the variable "offsets". What is the meaning of these variable? Thank's Michela
2009 May 14
0
Rpart - user defined split functions
Dear all, I'm writing my own method to be used in Rpart by defining the list of functions named init, split and eval. I'm following the example given in the file 'tests/usersplits.R' in the sources. By now I'm able to define the split function (and it works correctly in the tree construction) while I have some problems with the init and the eval function. The task I'm
2001 May 22
1
Surrogate splits for decision trees
Dear R, Short verse of the question: Is there R code which will calculate surrogate splits and/or delta impurity for decision trees at each node? Long Version: I have local, legacy code which I use to calculate my decision trees. I would like to switch to R, but as I understand it surrogate splits are not implemented. Surrogate splits and feature ranking are described in Breiman et al
2011 Jun 13
1
In rpart, how is "improve" calculated? (in the "class" case)
Hi all, I apologies in advance if I am missing something very simple here, but since I failed at resolving this myself, I'm sending this question to the list. I would appreciate any help in understanding how the rpart function is (exactly) computing the "improve" (which is given in fit$split), and how it differs when using the split='information' vs split='gini'
2004 Jul 05
1
how to personalize split function in rpart
Hallo! I am a student of the Politecnico di Milano (Milan, italy) and I'm working on CARTs. I'm trying to use the R rpart function with a personalized splitfunction... but I'm not able to do it! More precisely, I would like to know what is the meaning of the function 'init', 'split' and 'eval' named in the help page.I can't find any answer in
2008 Jul 03
1
cross-validation in rpart
Hello list, I'm having a problem with custom functions in rpart, and before I tear my hair out trying to fix it, I want to make sure it's actually a problem. It seems that, when you write custom functions for rpart (init, split and eval) then rpart no longer cross-validates the resulting tree to return errors. A simple test is to use the usersplits.R function to get a simple, custom
2009 Apr 09
0
Connecting to "heterogenous" cluster using makeSOCKcluster of SNOW-package
When trying to create (a very simple) cluster using makeSOCKcluster or makeCluster I get the following error message: > library(snow) > cl <- makeSOCKcluster("localhost") Fatal error: kann Datei '~/R/i686-suse-linux-gnu-library/2.8/snow/RSOCKnode.R' nicht ?ffnen: Datei oder Verzeichnis nicht gefunden (translation: could not open file [...]: File or directory not
2009 May 26
0
cross-validation in rpart
Dear R users, I know cross-validation does not work in rpart with user defined split functions. As Terry Therneau suggested, one can use the xpred.rpart function and then summarize the matrix of the predicted values into a single "goodness" value. I need only a confirmation: set for example xval=10, if I correctly understood a single column of the matrix obatined by xpred.rpart gives
2006 Nov 20
0
rpart
Dear r-help-list: I' got a question about the computation of the improve of a split. The following is an extract of an output of the summary of a tree: Node number 1: 600 observations, complexity param=0.007272727 predicted class=0 expected loss=0.1666667 class counts: 500 100 probabilities: 0.833 0.167 left son=2 (211 obs) right son=3 (389 obs) Primary splits: x4
2007 Jan 03
1
User defined split function in Rpart
Dear all, I'm trying to manage with user defined split function in rpart (file rpart\tests\usersplits.R in http://cran.r-project.org/src/contrib/rpart_3.1-34.tar.gz - see bottom of the email). Suppose to have the following data.frame (note that x's values are already sorted) > D y x 1 7 0.428 2 3 0.876 3 1 1.467 4 6 1.492 5 3 1.703 6 4 2.406 7 8 2.628 8 6 2.879 9 5 3.025 10 3 3.494
2004 Feb 17
0
New package -- mvpart
The package mvpart is now available. mvpart includes partitioning based on (1) multivariate numeric responses and (2) dissimilarity matrices. The package mvpart is a modification of rpart -- -- authors of original: Terry M Therneau and Beth Atkinson <atkinson at mayo.edu>, and R port of rpart Brian Ripley <ripley at stats.ox.ac.uk>. Includes some modified routines from vegan --
2004 Feb 17
0
New package -- mvpart
The package mvpart is now available. mvpart includes partitioning based on (1) multivariate numeric responses and (2) dissimilarity matrices. The package mvpart is a modification of rpart -- -- authors of original: Terry M Therneau and Beth Atkinson <atkinson at mayo.edu>, and R port of rpart Brian Ripley <ripley at stats.ox.ac.uk>. Includes some modified routines from vegan --
2009 Jun 16
0
Connecting to "heterogenous" cluster using makeSOCKcluster of SNOW-package
Thanks to Luke Tiernay and some experimenting I found out some issues. I don't claim this infomation is complete, but it may be helpful for anyone experimenting with SNOW on Linux: - environment variables PATH and R_SNOW_LIB need to be set on master and slaves. (manually or permanent in ~/.bashrc (adjust your pathnames accordingly). Don't use ~ for home directory, instead specify
2006 Jun 20
0
FW: multivariate splits
-----Original Message----- From: Vayssi?res, Marc Sent: Tuesday, June 20, 2006 9:35 AM To: 'r-help-bounces at stat.math.ethz.ch' Subject: RE: [R] multivariate splits Glen De'ath's package for R is on cran! It is called mvpart, see: http://cran.cnr.berkeley.edu/doc/packages/mvpart.pdf Cheers, Marc Vayssi?res -----Original Message----- From: r-help-bounces at stat.math.ethz.ch
2005 Aug 26
1
Help in Compliling user -defined functions in Rpart
I have been trying to write my own user defined function in Rpart.I imitated the anova splitting rule which is given as an example.In the work I am doing ,I am calculating the concentration index(ci) ,which is in between -1 and +1.So my deviance is given by abs(ci)*(1-abs(ci)).Now when I run rpart incorporating this user defined function i get the following error message: Error in
2009 Mar 20
1
CCA - manual selection
Hello, I am trying to obtain f-values for response (independent) variables from a CCA performed in vegan package, to see which ones of them have significative influence in my dependent variables (like the manual selection in canoco), but I can't find any function (or package) that do such a thing. The dependents variables are species data, and the independents are ambiental data. Than you.
2007 Sep 07
3
Fails to restart mongrel cluster after Capistrano rollback (PIDs not deleted)
Hi, I am deploying a Rails website using Capistrano to a VPS running the site through Apache and Mongrel. ''cap deploy'' works fine, and mongrels are restarted. When I execute ''cap deploy:rollback'', however, my script fails to restart the mongrels giving this error: !!! PID file tmp/pids/mongrel.5000.pid already exists. Mongrel could be running already.