Displaying 1 result from an estimated 1 matches for "suncertain_at_gmail".
2007 May 10
3
how to control the sampling to make each sample unique
I have a dataset of 10000 records which I want to use to compare two
prediction models.
I split the records into test dataset (size = ntest) and training dataset
(size = ntrain). Then I run the two models.
Now I want to shuffle the data and rerun the models. I want many shuffles.
I know that the following command
sample ((1:10000), ntrain)
can pick ntrain numbers from 1 to 10000. Then I just