Displaying 20 results from an estimated 1000 matches similar to: "rpart"
2011 Jun 13
1
In rpart, how is "improve" calculated? (in the "class" case)
Hi all,
I apologies in advance if I am missing something very simple here, but since
I failed at resolving this myself, I'm sending this question to the list.
I would appreciate any help in understanding how the rpart function is
(exactly) computing the "improve" (which is given in fit$split), and how it
differs when using the split='information' vs split='gini'
2011 Jun 21
0
How does rpart computes "improve" for split="information"?? (which seems to be different then the "gini" case)
Hello dear R-help members,
I would appreciate any help in understanding how the rpart function computes
the "improve" (which is given in fit$split) when using the
split='information' parameter.
Thanks to Professor Atkinson help, I was able to find how this is done in
the case that split='gini'. By following the explanation here:
2009 May 31
1
Bug in truncgof package?
Dear R-helpers,
I was testing the truncgof CRAN package, found something that looked
like a bug, and did my job: contacted the maintainer. But he did not
reply, so I am resending my query here.
I installed package truncgof and run the example for function ad.test. I
got the following output:
set.seed(123)
treshold <- 10
xc <- rlnorm(100, 2, 2) # complete sample
xt <- xc[xc >=
2010 Apr 28
1
Question on: Random Forest Variable Importance for Regression Problems
I am trying to use the package RandomForest performing regression.
The variable importance estimates are given as: "%IncMSE" and
"IncNodePurity"
Can anyone explain me what these refer to and how they are calculated?
I found a lot of information on variable importance measures for
classification problems, but nothing on regression.
Thanks a lot.
Mareike
2006 Nov 20
0
rpart
Dear r-help-list:
I' got a question about the computation of the improve of a split. The following is an extract of an output of the summary of a tree:
Node number 1: 600 observations, complexity param=0.007272727
predicted class=0 expected loss=0.1666667
class counts: 500 100
probabilities: 0.833 0.167
left son=2 (211 obs) right son=3 (389 obs)
Primary splits:
x4
2005 Aug 08
1
vector vs array
Hi!
OK, I'm trying to select some "useful outliers" from
my dataset: I defined 11 "treshold" values (1 for each
level of a variable (sampling site) as follows:
tresholds<-function(x)
{
tapply(x,mm$NAME,FUN=mean ,simplify = T, na.rm=T)->med
tapply(x,mm$NAME,FUN=sd ,simplify = T,
na.rm=T)->standev
standev+med
}
tresholds(mm$chl)
Now I'd like to select
2004 Apr 20
2
polygon
Dear all
In order to clearly mark values wich are larger than a treshold value, I
would like to color the surface below the line given by plot (yy~xx). To
color is only the surface between abline (treshold) and yy if they are
larger than the specific limit. I guess I can use the function polygon,
but I can not find any valuable solution.
I'm grateful to you for an advice or an example.
2004 Feb 26
3
my own function given to lapply
Hi
It seems, I just miss something. I defined
treshold <- function(pred) {
if (pred < 0.5) pred <- 0 else pred <- 1
return(pred)
}
and want to use apply it on a vector
sapply(mylist[,,3],threshold)
but I get:
Error in match.fun(FUN) : Object "threshold" not found
thanks for help
cheers
chris
--
Christoph Lehmann <christoph.lehmann at gmx.ch>
2004 Oct 11
1
Linux freezes on large file transfers
I am running MD 10 (Community) as a file server on a Shuttle SB61G2. This
setup worked very well under Mandrake 9.2 however, everytime I try to copy
files larger than say <550 ~650MB using MD 10, my linux box freezes and must
be rebooted. I can FTP the same file(s) perfectly fine to other PC 's on my
home net. Small volumes of files work fine as well as ISO images, the box
seems to
2007 Aug 16
1
Regression tree: labels in the terminal nodes
Dear everybody,
I'm a new user of R 2.4.1 and I'm searching for information on improving
the output of regression tree graphs.
In the terminal nodes I am up to now able to indicate the number of
values (n) and the mean of all values in this terminal node by the command
> text(tree, use.n=T, xpd=T)
Yet I would like to indicate automatically in the output graph of the
tree some
2016 Apr 05
0
Problem with <= (less than or equal): not giving the expected result
You could use something like this
x <- abs(0.95 - 1)
treshold <- 0.05
x < treshold | abs(x - treshold) < 1e-6
ir. Thierry Onkelinx
Instituut voor natuur- en bosonderzoek / Research Institute for Nature and
Forest
team Biometrie & Kwaliteitszorg / team Biometrics & Quality Assurance
Kliniekstraat 25
1070 Anderlecht
Belgium
To call in the statistician after the experiment is
2001 May 22
1
Surrogate splits for decision trees
Dear R,
Short verse of the question:
Is there R code which will calculate surrogate splits
and/or delta impurity for decision trees at each node?
Long Version:
I have local, legacy code which I use to calculate my decision trees.
I would like to switch to R, but as I understand it surrogate splits
are not implemented.
Surrogate splits and feature ranking are described in Breiman et al
2010 Oct 13
5
Regular expression to find value between brackets
Hi,
this should be an easy one, but I can't figure it out.
I have a vector of tests, with their units between brackets (if they have
units).
eg tests <- c("pH", "Assay (%)", "Impurity A(%)", "content (mg/ml)")
Now I would like to hava a function where I use a test as input, and which
returns the units
like:
f <- function (x) sub("\\)",
2017 Apr 09
2
Splitting C/C++ code into pure and side-effecting code
Hi Suman,
I think you can ascertain pureness automatically leveraging the compiler
instead of manually tagging attribute to each method and call-site. It
would seem like impurity should be a transitive attribute. So this would
conflict with below.
__attribute__((annotate("pure")))
int add(uint32_t a, uint32_t b) { // impure by calling printf...
...
printf("%d + %d =
2006 Mar 02
5
Milliwatt Analyzer available
Hi,
some days ago we discused here the need for an analyzer
for the 1000 Hz tone, as opposite application to Milliwatt.
Here it is: Mwanalyze
http://planinternet.net/download/voip/asterisk/app_mwanalyze.c
It performs a Fourier analysis for a fixed frequency
and tells the amplitude.
The frequency is not limited to 1000 Hz, but can be passed
as argument. The periode duration must be a mulitple
2005 Aug 26
1
Help in Compliling user -defined functions in Rpart
I have been trying to write my own user defined function in Rpart.I
imitated the anova splitting rule which is given as an example.In the
work I am doing ,I am calculating the concentration index(ci) ,which
is in between -1 and +1.So my deviance is given by
abs(ci)*(1-abs(ci)).Now when I run rpart incorporating this user
defined function i get the following error message:
Error in
2006 Feb 16
0
sums of absolute deviations about the median as split function in rpart
Dear R community,
as stated in Breiman et.al. (1984) and De'Ath & Fabricius (2000) using
sums of absolute deviations about the median as an impurity measure
gives robust trees.
I would like to use this method in rpart.
Has somebody already tried this method in rpart? Is there maybe already
a script available somewhere?
I am aware of the possibility to define usersplits myself with
2008 Sep 12
5
cram-md5 and users maintaining their own passwords?
Is there any other mechanism than using passwd files with md5-hashed
passwords created by dovecotpw that will support cram-md5
authentication?
Has anyone created setups where the passwd databases reside in the
individual users home directories?
Is it possible to persuade dovecotpw to update the passwd databases
automatically. Having to use a text editor to paste in the passwords
sets a high
2016 Apr 05
1
Problem with <= (less than or equal): not giving the expected result
Thanks!
On 05 Apr 2016, at 16:07, Thierry Onkelinx <thierry.onkelinx at inbo.be<mailto:thierry.onkelinx at inbo.be>> wrote:
You could use something like this
x <- abs(0.95 - 1)
treshold <- 0.05
x < treshold | abs(x - treshold) < 1e-6
ir. Thierry Onkelinx
Instituut voor natuur- en bosonderzoek / Research Institute for Nature and Forest
team Biometrie &
2012 Nov 01
0
oblique.tree : the predict function asserts the dependent variable to be included in "newdata"
Dear R community,
I have recently discovered the package oblique.tree and I must admit that
it was a nice surprise for me,
since I have actually made my own version of a kind of a classifier which
uses the idea of oblique splits (splits by means of hyperplanes).
So I am now interested in comparing these two classifiers.
But what I do not seem to understand is why the function