Hi Everyone,
I have been using the "tree" package for a while with no problems
until now.
When I run predict(tree, newdata), I get an error with the message "Corrupt
tree" for about 50% of the trees that I generate with tree. For other
trees, the predict function completes with no errors.
I haven't identified a correlation between the corrupt trees and the working
tree. It is the same code with the same data set. The only difference is that
the training data is sampled randomly from the complete data set.
The "copy-pasted" code below illustrates:
> data <- load.data()
> mytree <- run.tree(data)
> mypred <- gen.predictions(mytree, data )
Error in pred1.tree(object, tree.matrix(newdata)) : corrupt tree
> mytree <- run.tree(data)
> mypred <- gen.predictions(mytree,data )
(no error here)>
The function run.tree() does the following:
1. Split the data into training and test
2. Call tree() to generate a tree object
3. Returns the indices of the training, test in the data as well as the tree
object
The function gen.predictions() does the following:
1. Call predict() with the tree object from run.tree() and the test data
2. Returns the yprob values and other information about the predictions
Any help would be greatly appreciated.
Osei