thr3ads.net - R help - [R] rpart and randomforest results [Apr 2014]

If this information is useful, please help other people find it:
Share via:

Schillo, Sonja

2014-Apr-01 08:27 UTC

[R] rpart and randomforest results

Hi all,

I have a question on rpart and randomforest results:

We calculated a single regression tree using rpart and got a pseudo-r2 of
roundabout 10% (which is not too bad compared to a linear regression on this
data). Encouraged by this we grew a whole regression forest on the same data set
using randomforest. But we got  pretty bad pseudo-r2 values for the randomforest
(even sometimes negative values for some option settings).
We then thought that if we built only one single tree with the randomforest
routine we should get a result similar to that of rpart. So we set the options
for randomforest to only one single tree but the resulting pseudo-r2 value was
negative aswell.

Does anyone have a clue as to why the randomforest results are so bad whereas
the rpart result is quite ok?
Is our assumption that a single tree grown by randomforest should give similar
results as a tree grown by rpart wrong?
What am I missing here? 

Thanks a lot for your help!
Sonja

Mitchell Maltenfort

2014-Apr-01 11:31 UTC

head link

[R] rpart and randomforest results

Is it possible that the random forest is somehow adjusting for optimism or
overfitting?
On Apr 1, 2014 7:27 AM, "Schillo, Sonja"
<Sonja.Schillo@uni-due.de> wrote:
> Hi all,
>
> I have a question on rpart and randomforest results:
>
> We calculated a single regression tree using rpart and got a pseudo-r2 of
> roundabout 10% (which is not too bad compared to a linear regression on
> this data). Encouraged by this we grew a whole regression forest on the
> same data set using randomforest. But we got  pretty bad pseudo-r2 values
> for the randomforest (even sometimes negative values for some option
> settings).
> We then thought that if we built only one single tree with the
> randomforest routine we should get a result similar to that of rpart. So we
> set the options for randomforest to only one single tree but the resulting
> pseudo-r2 value was negative aswell.
>
> Does anyone have a clue as to why the randomforest results are so bad
> whereas the rpart result is quite ok?
> Is our assumption that a single tree grown by randomforest should give
> similar results as a tree grown by rpart wrong?
> What am I missing here?
>
> Thanks a lot for your help!
> Sonja
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
	[[alternative HTML version deleted]]

R help - Apr 2014 - rpart and randomforest results

[R] rpart and randomforest results

[R] rpart and randomforest results