similar to: Efficient way to subset rows in R for dataset with 10^7 columns

Displaying 20 results from an estimated 200 matches similar to: "Efficient way to subset rows in R for dataset with 10^7 columns"

2018 Apr 14
0
Efficient way to subset rows in R for dataset with 10^7 columns
Oh, there are ways, but the constraining issue here is moving data (memory bandwidth), and data table is probably already the fastest mechanism for doing that. If you have a computer with four or more real cores you can try setting up a subset of the columns in each task and cbind the results afterward, but it will be hard to accomplish without making extra copies of the data. You are already
2011 Nov 30
1
glm step() fkt
Hi volks, i have a question about the step() fkt. Is there a possibility to save the last model generated from this method. I have a loop and so i generate 100 different models with the step fkt and i want to know which model is the most common. CODE: ... missStep -> numeric(100) for (j in 1:100) { trainindex <- sample(c(1:462),300) train <- data[trainindex,] test <-
2018 Apr 21
2
Removing columns from big.matrix which have only one value
I have a very large binary matrix, stored as a big.matrix to conserve memory (it is over 2 gb otherwise - 5 million columns and 100 rows). r <- 100 c <- 10000 m4 <- matrix(sample(0:1,r*c, replace=TRUE),r,c) m4 <- cbind(m4, 1) m4 <- as.big.matrix(m4) I need to remove every column which has only one unique value (in this case, only 0s or only 1s). Because of the number of columns, I
2023 Apr 14
2
Package Caret
This is the error then: error in prettyseq(1:ncol(out)) : could not find function "prettyseq" Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14. Apr. 2023, 09:06: > What happens if you do the following? > > library(caret) > > ?caret::createDataPartition > > i.e. to confirm that caret is loaded, seek help on this function > > > On Fri, Apr
2023 Apr 14
1
Package Caret
What happens if you do the following? > library(caret) > ?caret::createDataPartition i.e. to confirm that caret is loaded, seek help on this function On Fri, Apr 14, 2023 at 9:56?AM G?bor Malomsoki <gmalomsoki1980 at gmail.com> wrote: > Eric, > > Yes, i know, i am calling the package, but this is not working. > I saw a similar question in Stackoverflow, advising install
2023 Apr 14
1
Package Caret
Or is there any similar function to split the dataframe to trainDF and testDF? G?bor Malomsoki <gmalomsoki1980 at gmail.com> schrieb am Fr., 14. Apr. 2023, 09:45: > > This is the error then: > error in prettyseq(1:ncol(out)) : could not find function "prettyseq" > > > > > Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14. Apr. 2023, 09:06:
2023 Apr 14
1
Package Caret
Eric, Yes, i know, i am calling the package, but this is not working. I saw a similar question in Stackoverflow, advising install package Rcpp, but this one i am unable to install. Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14. Apr. 2023, 08:47: > You first have to load the package using the library command. > > library(caret) > > Then you can call
2023 Apr 16
1
Package Caret
I have newly installed R, R-tools, RStudio, but still not working: library(caret)Lade n?tiges Paket: latticeError: Laden von Paket oder Namensraum f?r ?caret? in loadNamespace(i, c(lib.loc, .libPaths()), versionCheck = vI[[i]]): fehlgeschlagen Namensraum ?vctrs? 0.5.2 ist bereits geladen, aber >= 0.6.0 wird gefordert Error in createDataPartition(hypotezis_df$X, p = 0.75, list = FALSE, times
2023 Apr 14
1
Package Caret
You should post the result of running sessionInfo(), and tell us how you installed R. Your installation seems very broken. Duncan Murdoch On 14/04/2023 3:45 a.m., G?bor Malomsoki wrote: > This is the error then: > error in prettyseq(1:ncol(out)) : could not find function "prettyseq" > > > > > Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14.
2023 Apr 16
1
Package Caret
So update the vctrs package to the latest version first before loading R-tools (or the caret package, specifically)? -- Bert On Sun, Apr 16, 2023 at 1:57?PM G?bor Malomsoki <gmalomsoki1980 at gmail.com> wrote: > > I have newly installed R, R-tools, RStudio, but still not working: > > library(caret)Lade n?tiges Paket: latticeError: Laden von Paket oder > Namensraum f?r
2023 Apr 14
1
Package Caret
You first have to load the package using the library command. > library(caret) Then you can call createDataPartition. e.g. > data(oil) > createDataPartition(oilType,2) HTH, Eric On Fri, Apr 14, 2023 at 7:52?AM G?bor Malomsoki <gmalomsoki1980 at gmail.com> wrote: > Dear all, > When i try to use createDataPartition after calling package 'caret', i get > the
2023 Apr 16
2
Package Caret
Many thanks Bert, now is ok, i did not know that "Namensraum" should mean a package Am So., 16. Apr. 2023 um 23:44 Uhr schrieb Bert Gunter < bgunter.4567 at gmail.com>: > So update the vctrs package to the latest version first before loading > R-tools (or the caret package, specifically)? > > -- Bert > > On Sun, Apr 16, 2023 at 1:57?PM G?bor Malomsoki >
2010 Sep 09
1
createDataPartition
Dear all, does anyone know how to define the structure of the required samples using function createDataPartition, meaning proportions of different types of variable in the partition? Smth like this for iris data: createDataPartition(y = c(setosa = .5, virginica = .3, versicolor = .2), times = 10, p = .7, list = FALSE) Thanks a lot for your help. Regards, Trafim [[alternative HTML version
2023 Apr 14
1
Package Caret
Dear all, When i try to use createDataPartition after calling package 'caret', i get the message: "could not find function createDataPartition" I use: R-4.2.3 for Windows RStudio-2023.03.0-386 Do you have any experience with this failure? Thank you! Best regards Gabor [[alternative HTML version deleted]]
2011 Oct 02
1
difference between createPartition and createfold functions
Hello, I'm trying to separate my dataset into 4 parts with the 4th one as the test dataset, and the other three to fit a model. I've been searching for the difference between these 2 functions in Caret package, but the most I can get is this-- A series of test/training partitions are created using createDataPartition while createResample creates one or more bootstrap samples.
2012 Nov 04
1
sample equal number of cases per class
Dear community I have a dataframe and want to split it into a learn and a test partition. However the learnset should be balanced, i.e. each class should have the same number of cases. I tried and searched a lot, without success so far. Maybe you can help? Some example code *# generate example data df <- data.frame(class = as.factor(sample(1:3, 20, replace = T)), var1 = rnorm(20,3), var2 =
2013 Feb 19
0
CARET. Relationship between data splitting trainControl
I have carefully read the CARET documentation at: http://caret.r-forge.r-project.org/training.html, the vignettes, and everything is quite clear (the examples on the website help a lot!), but I am still a confused about the relationship between two arguments to trainControl: "method" "index" and the interplay between trainControl and the data splitting functions in caret
2013 Jan 08
0
bagging SVM Ensemble
Dear Sir, I got a problem with my program. I would like to classify my data using bagging support vector machine ensemble. I split my data into training data and test data. For a given data sets TR(X), K replicated training data sets are first randomly generated by bootstrapping technique with replacement. Next, Support Vector Mechine (SVM) is applied for each bootstrap data sets. Finally, the
2013 Mar 02
2
caret pls model statistics
Greetings, I have been exploring the use of the caret package to conduct some plsda modeling. Previously, I have come across methods that result in a R2 and Q2 for the model. Using the 'iris' data set, I wanted to see if I could accomplish this with the caret package. I use the following code: library(caret) data(iris) #needed to convert to numeric in order to do regression #I
2023 May 09
1
RandomForest tuning the parameters
Hi Sacha, On second thought, perhaps this is more the direction that you want ... X2 = cbind(X_train,y_train) colnames(X2)[3] = "y" regr2<-randomForest(y~x1+x2, data=X2,maxnodes=10, ntree=10) regr regr2 #Make prediction predictions= predict(regr, X_test) predictions2= predict(regr2, X_test) HTH, Eric On Tue, May 9, 2023 at 6:40?AM Eric Berger <ericjberger at gmail.com>