similar to: rpart question on loss matrix

Displaying 20 results from an estimated 4000 matches similar to: "rpart question on loss matrix"

2010 Apr 11
0
plotting rpart objects, text.rpart, fancy option
I have created plots of rpart objects with the fancy option for text.rpart ("fancy" creates ellipses and rectangles and labels branches with splitting criteria). The ellipses and rectangles are supposed to "interrupt" the tree lines (as seen in Therneau and Atkinson 1997, page 48, Fig. 18, http://www.mayo.edu/hsr/techrpt/61.pdf), but they do not, even when I use Therneau and
2006 Sep 25
2
rpart
Dear r-help-list: If I use the rpart method like cfit<-rpart(y~.,data=data,...), what kind of tree is stored in cfit? Is it right that this tree is not pruned at all, that it is the full tree? If so, it's up to me to choose a subtree by using the printcp method. In the technical report from Atkinson and Therneau "An Introduction to recursive partitioning using the rpart
2004 Jul 05
1
how to personalize split function in rpart
Hallo! I am a student of the Politecnico di Milano (Milan, italy) and I'm working on CARTs. I'm trying to use the R rpart function with a personalized splitfunction... but I'm not able to do it! More precisely, I would like to know what is the meaning of the function 'init', 'split' and 'eval' named in the help page.I can't find any answer in
2004 Jun 04
1
rpart
Hello everyone, I'm a newbie to R and to CART so I hope my questions don't seem too stupid. 1.) My first question concerns the rpart() method. Which method does rpart use in order to get the best split - entropy impurity, Bayes error (min. error) or Gini index? Is there a way to make it use the entropy impurity? The second and third question concern the output of the printcp() function.
2003 Dec 19
1
Question re labels in r-part (continuation of a thread from a while back)
Hello again I have modeled a tree using rpart, with the DV being a log transformation of the variable I am really interested in (I transformed the DV due to extreme skewness). By default, text.rpart labels the nodes with the value of yval, which in this case is not what I want; I'd like the labels to be on the original metric, but label in text.rpart requires a "column name of
2003 Jul 15
1
Tree question
I was under the impression that the tree method (e.g. as implemented in rpart) was insensitive to monotonic transformations of the dependent variable. e.g. Breiman Olshen et al. Classification and Regression Trees state "In a standard data structure [a tree] is invariant under all monotone transformations of individual ordered varaibles" (p. 57) However, I get very different results
2009 May 14
0
Rpart - user defined split functions
Dear all, I'm writing my own method to be used in Rpart by defining the list of functions named init, split and eval. I'm following the example given in the file 'tests/usersplits.R' in the sources. By now I'm able to define the split function (and it works correctly in the tree construction) while I have some problems with the init and the eval function. The task I'm
2011 Jun 21
0
How does rpart computes "improve" for split="information"?? (which seems to be different then the "gini" case)
Hello dear R-help members, I would appreciate any help in understanding how the rpart function computes the "improve" (which is given in fit$split) when using the split='information' parameter. Thanks to Professor Atkinson help, I was able to find how this is done in the case that split='gini'. By following the explanation here:
2003 Apr 12
5
rpart vs. randomForest
Greetings. I'm trying to determine whether to use rpart or randomForest for a classification tree. Has anybody tested efficacy formally? I've run both and the confusion matrix for rf beats rpart. I've looking at the rf help page and am unable to figure out how to extract the tree. But more than that I'm looking for a more comprehensive user's guide for randomForest including
2004 May 07
0
rpart for CART with weights/priors
Hi, I have a technical question about rpart: according to Breiman et al. 1984, different costs for misclassification in CART can be modelled either by means of modifying the loss matrix or by means of using different prior probabilities for the classes, which again should have the same effect as using different weights for the response classes. What I tried was this: library(rpart)
2001 Nov 12
0
Additional Documentation for rpart?
Dear r-help, I am looking for additional documentation on the "adj" column in rpart's splits matrix. The help says: adj gives the adjusted concordance for surrogate splits I am looking info about "adjusted concordance". I cannot find this phrase in either Therneau & Atkinson original RPART documentation or the CART book. This question came up in the
2011 Jan 24
1
How to measure/rank ?variable importance when using rpart?
--- included message ---- Thus, my question is: *What common measures exists for ranking/measuring variable importance of participating variables in a CART model? And how can this be computed using R (for example, when using the rpart package)* ---end ---- Consider the following printout from rpart summary(rpart(time ~ age + ph.ecog + pat.karno, data=lung)) Node number 1: 228 observations,
2006 Feb 16
0
sums of absolute deviations about the median as split function in rpart
Dear R community, as stated in Breiman et.al. (1984) and De'Ath & Fabricius (2000) using sums of absolute deviations about the median as an impurity measure gives robust trees. I would like to use this method in rpart. Has somebody already tried this method in rpart? Is there maybe already a script available somewhere? I am aware of the possibility to define usersplits myself with
2001 Jul 12
2
rpart puzzle
I've been using the package rpart with R 1.3.0 for Windows to produce simple classification trees for some measurement data from paleontological specimens. Both the rpart documentation and the output confirm that the program produces splits on continuous data that leave "holes" in the data. It is probably of little practical importance, but is there a reason why the binary
2006 Oct 17
1
Some questions on Rpart algorithm
Hello: I am using rpart and would like more background on how the splits are made and how to interpret results - also how to properly use text(.rpart). I have looked through Venables and Ripley and through the rpart help and still have some questions. If there is a source (say, Breiman et al) on decision trees that would clear this all up, please let me know. The questions below pertain to a
2009 Apr 30
1
unexpected behavior of rpart 3.1-43, loss matrix
Hi, I just noticed that rpart behaves unexpectecly, when performing classification learning and specifying a loss matrix. if the response variable y is a factor and if not all levels of the factor occur in the observations, rpart exits with an error: > df=data.frame(attr=1:5,class=factor(c(2,3,1,5,3),levels=1:6)) > rpart(class~attr,df,parms=list(loss=matrix(0,6,6))) Error in
2004 Feb 17
0
New package -- mvpart
The package mvpart is now available. mvpart includes partitioning based on (1) multivariate numeric responses and (2) dissimilarity matrices. The package mvpart is a modification of rpart -- -- authors of original: Terry M Therneau and Beth Atkinson <atkinson at mayo.edu>, and R port of rpart Brian Ripley <ripley at stats.ox.ac.uk>. Includes some modified routines from vegan --
2004 Feb 17
0
New package -- mvpart
The package mvpart is now available. mvpart includes partitioning based on (1) multivariate numeric responses and (2) dissimilarity matrices. The package mvpart is a modification of rpart -- -- authors of original: Terry M Therneau and Beth Atkinson <atkinson at mayo.edu>, and R port of rpart Brian Ripley <ripley at stats.ox.ac.uk>. Includes some modified routines from vegan --
2010 Nov 22
1
using rpart with a tree misclassification condition
Hello I want to build a classification tree for a binary response variable while the condition for the final tree should be : The total misclassification for each group (zero or one) will be less then 10% . for example: if I have in the root 100 observations, 90 from group 0 and 10 from group 1, I want that in the final tree a maximum of 9 and 1 observations out of group 0 and 1, respectively,
2003 Jul 17
1
Rpart question - labeling nodes with something not in x$frame
I have a tree created with tr.hh.logcas <- rpart(log(YCASSX + 1)~AGE+DRUGUSEY+SEX+OBSXNUM +WINDLE, xval = 10) I would like to label the nodes with YCASSX rather than log(YCASSX + 1). But the help file for text in library rpart says that you can only use labels that are part of x$frame, which YCASSX is not. Is there a way to do what I want? Thanks in advance Peter Peter L. Flom, PhD