thr3ads.net - similar to: "Working with createFolds"

Displaying 20 results from an estimated 1100 matches similar to: "Working with createFolds"

2012 Nov 23

caret train and trainControl

I am used to packages like e1071 where you have a tune step and then pass your tunings to train. It seems with caret, tuning and training are both handled by train. I am using train and trainControl to find my hyper parameters like so: MyTrainControl=trainControl( method = "cv", number=5, returnResamp = "all", classProbs = TRUE ) rbfSVM <- train(label~., data =

[caret package] [trainControl] supplying predefined partitions to train with cross validation

2011 May 05

[caret package] [trainControl] supplying predefined partitions to train with cross validation

Hi all, I run R 2.11.1 under ubuntu 10.10 and caret version 2.88. I use the caret package to compare different models on a dataset. In order to compare their different performances I would like to use the same data partitions for every models. I understand that using a LGOCV or a boot type re-sampling method along with the "index" argument of the trainControl function, one is able to

difference between createPartition and createfold functions

2011 Oct 02

difference between createPartition and createfold functions

Hello, I'm trying to separate my dataset into 4 parts with the 4th one as the test dataset, and the other three to fit a model. I've been searching for the difference between these 2 functions in Caret package, but the most I can get is this-- A series of test/training partitions are created using createDataPartition while createResample creates one or more bootstrap samples.

CARET. Relationship between data splitting trainControl

2013 Feb 19

CARET. Relationship between data splitting trainControl

I have carefully read the CARET documentation at: http://caret.r-forge.r-project.org/training.html, the vignettes, and everything is quite clear (the examples on the website help a lot!), but I am still a confused about the relationship between two arguments to trainControl: "method" "index" and the interplay between trainControl and the data splitting functions in caret

caret: Error when using rpart and CV != LOOCV

2012 May 15

caret: Error when using rpart and CV != LOOCV

Hy, I got the following problem when trying to build a rpart model and using everything but LOOCV. Originally, I wanted to used k-fold partitioning, but every partitioning except LOOCV throws the following warning: ---- Warning message: In nominalTrainWorkflow(dat = trainData, info = trainInfo, method = method, : There were missing values in resampled performance measures. ----- Below are some

Trying to extract probabilities in CARET (caret) package with a glmStepAIC model

2011 Aug 28

Trying to extract probabilities in CARET (caret) package with a glmStepAIC model

Dear developers, I have jutst started working with caret and all the nice features it offers. But I just encountered a problem: I am working with a dataset that include 4 predictor variables in Descr and a two-category outcome in Categ (codified as a factor). Everything was working fine I got the results, confussion matrix etc. BUT for obtaining the AUC and predicted probabilities I had to add

Caret: Use timingSamps leads to error

2012 Jul 12

Caret: Use timingSamps leads to error

I want to use the caret package and found out about the timingSamps obtion to obtain the time which is needed to predict results. But, as soon as I set a value for this option, the whole model generation fails. Check this example: ------------------------- library(caret) tc=trainControl(method='LGOCV', timingSamps=10) tcWithout=trainControl(method='LGOCV')

problems with extractPrediction in package caret

2009 Jan 15

problems with extractPrediction in package caret

Hi list, I´m working on a predictive modeling task using the caret package. I found the best model parameters using the train() and trainControl() command. Now I want to evaluate my model and make predictions on a test dataset. I tried to follow the instructions in the manual and the vignettes but unfortunately I´m getting an error message I can`t figure out. Here is my code: rfControl <-

caret package: arguments passed to the classification or regression routine

2008 Sep 18

caret package: arguments passed to the classification or regression routine

Hi, I am having problems passing arguments to method="gbm" using the train() function. I would like to train gbm using the laplace distribution or the quantile distribution. here is the code I used and the error: gbm.test <- train(x.enet, y.matrix[,7], method="gbm", distribution=list(name="quantile",alpha=0.5), verbose=FALSE,

caret() train based on cross validation - split dataset to keep sites together?

2012 May 30

caret() train based on cross validation - split dataset to keep sites together?

Hello all, I have searched and have not yet identified a solution so now I am sending this message. In short, I need to split my data into training, validation, and testing subsets that keep all observations from the same sites together ? preferably as part of a cross validation procedure. Now for the longer version. And I must confess that although my R skills are improving, they are not so

caret package

2009 Jun 08

caret package

Hi all I am using the caret package and having difficulty in obtaining the results using regression, I used the glmnet to model and trying to get the coefficients and the model parameters I am trying to use the extractPrediction to obtain a confusion matrix and it seems to be giving me errors. x<-read.csv("x.csv", header=TRUE); y<-read.csv("y.csv", header=TRUE);

Train error:: subscript out of bonds

2011 Jan 24

Train error:: subscript out of bonds

Hi, I am trying to construct a svmpoly model using the "caret" package (please see code below). Using the same data, without changing any setting, I am just changing the seed value. Sometimes it constructs the model successfully, and sometimes I get an ?Error in indexes[[j]] : subscript out of bounds?. For example when I set seed to 357 following code produced result only for 8

Custom caret metric based on prob-predictions/rankings

2012 Feb 10

Custom caret metric based on prob-predictions/rankings

I'm dealing with classification problems, and I'm trying to specify a custom scoring metric (recall at p, ROC, etc.) that depends on not just the class output but the probability estimates, so that caret::train can choose the optimal tuning parameters based on this metric. However, when I supply a trainControl summaryFunction, the data given to it contains only class predictions, so the

predicciones sobre el OOB de randomForest

2018 May 31

predicciones sobre el OOB de randomForest

Gracias Carlos. No uso caret, pero lo miraré. Quoting Carlos Ortega <cof en qualityexcellence.es>: > Hola, > > Creo que si utilizas "caret" y en la función "trainControl()" defines "oob" > como criterio de randomización, puedes luego recuperar del objeto del > modelo, las predicciones individuales... > > Saludos, > Carlos Ortega >

caret pls model statistics

2013 Mar 02

caret pls model statistics

Greetings, I have been exploring the use of the caret package to conduct some plsda modeling. Previously, I have come across methods that result in a R2 and Q2 for the model. Using the 'iris' data set, I wanted to see if I could accomplish this with the caret package. I use the following code: library(caret) data(iris) #needed to convert to numeric in order to do regression #I

R help-classification accuracy of DFA and RF using caret

2013 Nov 06

R help-classification accuracy of DFA and RF using caret

Hi, I am a graduate student applying published R scripts to compare the classification accuracy of 2 predictive models, one built using discriminant function analysis and one using random forests (webpage link for these scripts is provided below). The purpose of these models is to predict the biotic integrity of streams. Specifically, I am trying to compare the classification accuracy (i.e.,

LOOCV over SVM,KNN

2013 Mar 23

LOOCV over SVM,KNN

Good afternoon. I would like to know if there is any function in R to do LOOCV with these classifiers: 1)SVM 2)Neural Networks 3)C4.5 ( J48) 4)KNN Thanks a lot! [[alternative HTML version deleted]]

Extract the accuracy of 10-CV

2013 Apr 04

Extract the accuracy of 10-CV

Hello guys! I am working with some classifiers ( SVM,C4.5,RNA,etc) using 10-C.V. Once I have the model of each one, I make the validation of these models in one dataset. Then,with my model and the dataset, I extract a confusion matrix to know the capacity of prediction from the model. And finally, I extract the accuracy of this prediction based on the diagonal from the confusion matrix. The

Applying bagging in classifiers

2013 Apr 08

Applying bagging in classifiers

Hello! Does anyone know how to apply bagging for SVM? ( for example) I am using adabag package to execute bagging but this method, "bagging", works with classification trees. I would like to apply my bagging to other classifiers as SVM,RNA or KNN. Has anyone do it? Thanks!! [[alternative HTML version deleted]]

Help，Suggest me some methods to identify training set and test set!!!

2009 Dec 21

Help，Suggest me some methods to identify training set and test set!!!

I want to split my whole dateset to training set and test set, building model in training set, and validate model using test set. Now, How can I split my dataset to them reasonally. Please give me a hand, It is better to give me some R code. and I see some ways like using SOM to project whole independents to 2-dimensions and find some to be training set and others are test set. like below. I

similar to: Working with createFolds