similar to: In rpart, how is "improve" calculated? (in the "class" case)

Displaying 20 results from an estimated 200 matches similar to: "In rpart, how is "improve" calculated? (in the "class" case)"

2011 Jun 21
0
How does rpart computes "improve" for split="information"?? (which seems to be different then the "gini" case)
Hello dear R-help members, I would appreciate any help in understanding how the rpart function computes the "improve" (which is given in fit$split) when using the split='information' parameter. Thanks to Professor Atkinson help, I was able to find how this is done in the case that split='gini'. By following the explanation here:
2010 Sep 03
7
Function Gini or Ineq
Hi listers, Does it necessary to install any package in order to use the GINI or INEQ functions. If I use the following command the R tells me that didn't find the GINI function. x<-c(541, 1463, 2445, 3438, 4437, 5401, 6392, 8304, 11904, 22261) G<-gini(x) Thanks in advance, Marcio -- View this message in context:
2016 Mar 30
2
Compute the Gini coefficient
Hello, I would like to build a Lorenz curve and calculate a Gini coefficient in order to find how much parasites does the top 20% most infected hosts support. Here is my data set: Number of parasites per host: parasites = c(0,1,2,3,4,5,6,7,8,9,10) Number of hosts associated with each number of parasites given above: hosts = c(18,20,28,19,16,10,3,1,0,0,0) To represent the Lorenz curve: I
2003 Apr 24
5
Fast R implementation of Gini mean difference
I have written the following function to calculate the weighted mean difference for univariate data (see http://www.xycoon.com/gini_mean_difference.htm for a related formula). Unsurprisingly, the function is slow (compared to sd or mad) for long vectors. I wonder if there's a way to make the function faster, short of creating an external C function. Thanks very much for your advice. gmd
2004 Jun 04
1
rpart
Hello everyone, I'm a newbie to R and to CART so I hope my questions don't seem too stupid. 1.) My first question concerns the rpart() method. Which method does rpart use in order to get the best split - entropy impurity, Bayes error (min. error) or Gini index? Is there a way to make it use the entropy impurity? The second and third question concern the output of the printcp() function.
2005 Jun 24
2
Gini with frequencies
Hi there, I am trying to compute Gini coefficients for vectors containing income classes. The data I possess look loke this: yit <- c(135, 164, 234, 369) piit <- c(367, 884, 341, 74 ) where yit is the vector of income classes, and fit is the vector of associated frequencies.(This data is from Rustichini, Ichino and Checci (Journal of Public Economics, 1999) ). In ineq pacakge, Gini( )
2010 Sep 28
3
calcular la variancia de gini por bootstrap
Hola, paso el mini programita q estoy viendo, lo q me llama la atencion es una parte donde se definen las funciones. Probe primero meter adentro del boots la estadistica a estimar usando directamente gini(varible, pesos) pero no me dejo. Vi q en el ej del manual de boots, siempre define antes la funcion, entonces probe definir antes una funcion haciendo grini<-function(x) {gini(variable,
2007 Jun 11
1
Gini coefficient in R
If I use the Ineq library and the Gini function in this way: >Gini(c(100,0,0,0)) I obtain the result 0.75 instead of 1 (that is the perfect inequality). I think Gini's formula in Ineq is based on a formula as reported here: http://mathworld.wolfram.com/GiniCoefficient.html but in the case of perfect inequality: x_1=.......=x_n-1 =0 x_n>0 these formula are equal to 1 - 1/n, not to
2012 Feb 20
1
random numbers
Hello, Is there a way to create random numbers that fit a certain specified requirement other than distributional characteristics. In particular, I would like to create simulated income distributions with certain gini coefficient Ryan -- Ryan Murphy 2012 B.A. Economics and Mathematics [[alternative HTML version deleted]]
2005 Feb 05
2
Std Err on Concentration measures
Hi, I'm using the ineq package to calculate some concentration measures (Gini, Herfindal, ...) and I was wondering if there's around also a function to calculate standard error on these measures. If not, is anybody aware of where I can find a reference on this point? Thanks. -- ======================================================== Angelo Secchi PGP Key ID:EA280337
2012 Oct 26
1
Gini with bias correction
Hey there, I was wondering if someone could tell me if there's a package or command that allows me to compute a GINI coefficient using a vector of weights. Also the coefficient should be bias corrected. Diego Rojas [[alternative HTML version deleted]]
2004 Jul 16
3
rpart and TREE, can be the same?
Hi, all, I am wondering if it is possible to set parameters of 'rpart' and 'tree' such that they will produce the exact same tree? Thanks. Auston Wei Statistical Analyst Department of Biostatistics and Applied Mathematics The University of Texas MD Anderson Cancer Center Tel: 713-563-4281 Email: wwei@mdanderson.org [[alternative HTML version deleted]]
2007 Jun 16
1
Use of the "by" command (clarification)
Well apparently this has nothing to do with the gini() command. I cannot get it to work for something as simple as sum() Here is the little example I am playing with, maybe someone can help me find my error: a<-c("A","B","C","A","B","C","A","A","C","B")
2009 Sep 28
1
how to visualize gini coefficient in each node in RF?
Dear all, I am working with randomForest package and I am interested in examining the "Gini importance" measures that are used as a general indicator of feature relevance. Is there a possibility of getting the Gini measure that is being estimated in each tree by the output of the getTree() function? Thanks a lot, Chrysanthi [[alternative HTML version deleted]]
2010 Oct 19
1
Gini Coefficient
Dear List, I am unsure if this is specifically a R question or a stats question? I thought i would ask here and if i get no replies it will answer that! I am trying to calculate Gini coefficients in R, based on a slight modification of the typical equation that i have seen in a paper. -------------- next part -------------- A non-text attachment was scrubbed... Name: PastedGraphic-2.pdf Type:
2011 Sep 13
1
class weights with Random Forest
Hi All, I am looking for a reference that explains how the randomForest function in the randomForest package uses the classwt parameter. Here: http://tolstoy.newcastle.edu.au/R/e4/help/08/05/12088.html Andy Liaw suggests not using classwt. And according to: http://r.789695.n4.nabble.com/R-help-with-RandomForest-classwt-option-td817149.html it has "not been implemented" as of 2007.
2001 Jan 10
1
gini index
Hi. I have some data from a survey w/c include income data. Was not quite sure how to generate the gini index on these data. Jose Ramon Albert Statistical Research and Training Center www.srtc.gov.ph -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info",
2009 May 21
1
Rpart - best split selection for class method and Gini splitting index
Dear R-users, I'm working with the Rpart package and trying to understand how the procedure select the best split in the case the method "class" and the splitting index "Gini" are used. In particular I'd like to have look to the source code that works out the best split for un unordered predictor. Does anyone can suggest me which functions in the sources I should
2008 Apr 04
1
random forest varimp
Friends, I have noticed that many publications that use RF report variable importance as a function of mean decrease in accuracy rather than mean decrease in gini. Am I correct that the mean decrease in accuracy is just the mean decrease in gini divided by 100? Thanks, Helen Mills Poulos Yale School of Forestry
2008 Sep 27
1
ariable Importance Measure in Package RandomForest
Hi, I've a question about the RandomForest package. The package allows the extraction of a variable importance measure. As far as I could see from the documentation, the computation is based on the Gini index. Do you know if this extraction can be also based on other criteria? In particular, I'm interested in the info gain criterion. Best regards, Chris --