Displaying 20 results from an estimated 200 matches similar to: "Efficient way to subset rows in R for dataset with 10^7 columns"
2018 Apr 14
0
Efficient way to subset rows in R for dataset with 10^7 columns
Oh, there are ways, but the constraining issue here is moving data (memory bandwidth), and data table is probably already the fastest mechanism for doing that. If you have a computer with four or more real cores you can try setting up a subset of the columns in each task and cbind the results afterward, but it will be hard to accomplish without making extra copies of the data. You are already
2011 Nov 30
1
glm step() fkt
Hi volks,
i have a question about the step() fkt. Is there a possibility to save the
last model generated from this method. I have a loop and so i generate 100
different models with the step fkt and i want to know which model is the
most common.
CODE:
...
missStep -> numeric(100)
for (j in 1:100) {
trainindex <- sample(c(1:462),300)
train <- data[trainindex,]
test <-
2018 Apr 21
2
Removing columns from big.matrix which have only one value
I have a very large binary matrix, stored as a big.matrix to conserve
memory (it is over 2 gb otherwise - 5 million columns and 100 rows).
r <- 100
c <- 10000
m4 <- matrix(sample(0:1,r*c, replace=TRUE),r,c)
m4 <- cbind(m4, 1)
m4 <- as.big.matrix(m4)
I need to remove every column which has only one unique value (in this
case, only 0s or only 1s). Because of the number of columns, I
2023 Apr 14
2
Package Caret
This is the error then:
error in prettyseq(1:ncol(out)) : could not find function "prettyseq"
Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14. Apr. 2023, 09:06:
> What happens if you do the following?
> > library(caret)
> > ?caret::createDataPartition
>
> i.e. to confirm that caret is loaded, seek help on this function
>
>
> On Fri, Apr
2023 Apr 14
1
Package Caret
What happens if you do the following?
> library(caret)
> ?caret::createDataPartition
i.e. to confirm that caret is loaded, seek help on this function
On Fri, Apr 14, 2023 at 9:56?AM G?bor Malomsoki <gmalomsoki1980 at gmail.com>
wrote:
> Eric,
>
> Yes, i know, i am calling the package, but this is not working.
> I saw a similar question in Stackoverflow, advising install
2023 Apr 14
1
Package Caret
Or is there any similar function to split the dataframe to trainDF and
testDF?
G?bor Malomsoki <gmalomsoki1980 at gmail.com> schrieb am Fr., 14. Apr. 2023,
09:45:
>
> This is the error then:
> error in prettyseq(1:ncol(out)) : could not find function "prettyseq"
>
>
>
>
> Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14. Apr. 2023, 09:06:
2023 Apr 14
1
Package Caret
Eric,
Yes, i know, i am calling the package, but this is not working.
I saw a similar question in Stackoverflow, advising install package Rcpp,
but this one i am unable to install.
Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14. Apr. 2023, 08:47:
> You first have to load the package using the library command.
> > library(caret)
>
> Then you can call
2023 Apr 16
1
Package Caret
I have newly installed R, R-tools, RStudio, but still not working:
library(caret)Lade n?tiges Paket: latticeError: Laden von Paket oder
Namensraum f?r ?caret? in loadNamespace(i, c(lib.loc, .libPaths()),
versionCheck = vI[[i]]): fehlgeschlagen
Namensraum ?vctrs? 0.5.2 ist bereits geladen, aber >= 0.6.0 wird gefordert
Error in createDataPartition(hypotezis_df$X, p = 0.75, list = FALSE,
times
2023 Apr 14
1
Package Caret
You should post the result of running sessionInfo(), and tell us how you
installed R. Your installation seems very broken.
Duncan Murdoch
On 14/04/2023 3:45 a.m., G?bor Malomsoki wrote:
> This is the error then:
> error in prettyseq(1:ncol(out)) : could not find function "prettyseq"
>
>
>
>
> Eric Berger <ericjberger at gmail.com> schrieb am Fr., 14.
2023 Apr 16
1
Package Caret
So update the vctrs package to the latest version first before loading
R-tools (or the caret package, specifically)?
-- Bert
On Sun, Apr 16, 2023 at 1:57?PM G?bor Malomsoki
<gmalomsoki1980 at gmail.com> wrote:
>
> I have newly installed R, R-tools, RStudio, but still not working:
>
> library(caret)Lade n?tiges Paket: latticeError: Laden von Paket oder
> Namensraum f?r
2023 Apr 14
1
Package Caret
You first have to load the package using the library command.
> library(caret)
Then you can call createDataPartition. e.g.
> data(oil)
> createDataPartition(oilType,2)
HTH,
Eric
On Fri, Apr 14, 2023 at 7:52?AM G?bor Malomsoki <gmalomsoki1980 at gmail.com>
wrote:
> Dear all,
> When i try to use createDataPartition after calling package 'caret', i get
> the
2023 Apr 16
2
Package Caret
Many thanks Bert, now is ok, i did not know that "Namensraum" should mean
a package
Am So., 16. Apr. 2023 um 23:44 Uhr schrieb Bert Gunter <
bgunter.4567 at gmail.com>:
> So update the vctrs package to the latest version first before loading
> R-tools (or the caret package, specifically)?
>
> -- Bert
>
> On Sun, Apr 16, 2023 at 1:57?PM G?bor Malomsoki
>
2010 Sep 09
1
createDataPartition
Dear all,
does anyone know how to define the structure of the required samples using
function createDataPartition, meaning proportions of different types of
variable in the partition?
Smth like this for iris data:
createDataPartition(y = c(setosa = .5, virginica = .3, versicolor = .2),
times = 10, p = .7, list = FALSE)
Thanks a lot for your help.
Regards,
Trafim
[[alternative HTML version
2023 Apr 14
1
Package Caret
Dear all,
When i try to use createDataPartition after calling package 'caret', i get
the message:
"could not find function createDataPartition"
I use:
R-4.2.3 for Windows
RStudio-2023.03.0-386
Do you have any experience with this failure?
Thank you!
Best regards
Gabor
[[alternative HTML version deleted]]
2011 Oct 02
1
difference between createPartition and createfold functions
Hello,
I'm trying to separate my dataset into 4 parts with the 4th one as the
test dataset, and the other three to fit a model.
I've been searching for the difference between these 2 functions in
Caret package, but the most I can get is this--
A series of test/training partitions are created using
createDataPartition while createResample creates one or more bootstrap
samples.
2012 Nov 04
1
sample equal number of cases per class
Dear community
I have a dataframe and want to split it into a learn and a test partition.
However the learnset should be balanced, i.e. each class should have the
same number of cases. I tried and searched a lot, without success so far.
Maybe you can help?
Some example code
*# generate example data
df <- data.frame(class = as.factor(sample(1:3, 20, replace = T)), var1 =
rnorm(20,3), var2 =
2013 Feb 19
0
CARET. Relationship between data splitting trainControl
I have carefully read the CARET documentation at:
http://caret.r-forge.r-project.org/training.html, the vignettes, and
everything is quite clear (the examples on the website help a lot!), but I
am still a confused about the relationship between two arguments to
trainControl:
"method"
"index"
and the interplay between trainControl and the data splitting functions in
caret
2013 Jan 08
0
bagging SVM Ensemble
Dear Sir,
I got a problem with my program. I would like to classify my data using
bagging support vector machine ensemble. I split my data into training data
and test data. For a given data sets TR(X), K replicated training data sets
are first randomly generated by bootstrapping technique with replacement.
Next, Support Vector Mechine (SVM) is applied for each bootstrap data sets.
Finally, the
2013 Mar 02
2
caret pls model statistics
Greetings,
I have been exploring the use of the caret package to conduct some plsda
modeling. Previously, I have come across methods that result in a R2 and
Q2 for the model. Using the 'iris' data set, I wanted to see if I could
accomplish this with the caret package. I use the following code:
library(caret)
data(iris)
#needed to convert to numeric in order to do regression
#I
2023 May 09
1
RandomForest tuning the parameters
Hi Sacha,
On second thought, perhaps this is more the direction that you want ...
X2 = cbind(X_train,y_train)
colnames(X2)[3] = "y"
regr2<-randomForest(y~x1+x2, data=X2,maxnodes=10, ntree=10)
regr
regr2
#Make prediction
predictions= predict(regr, X_test)
predictions2= predict(regr2, X_test)
HTH,
Eric
On Tue, May 9, 2023 at 6:40?AM Eric Berger <ericjberger at gmail.com>