Displaying 2 results from an estimated 2 matches for "outertrain".
Did you mean:
outertrack
2018 Apr 14
2
Efficient way to subset rows in R for dataset with 10^7 columns
I have a data.table with dimensions 100 by 10^7.
When I do
trainIndex <-
caret::createDataPartition(
df$status,
p = .9,
list = FALSE,
times = 1
)
outerTrain <- df[trainIndex]
outerTest <- df[-trainIndex]
Subsetting the rows of df takes over 20 minutes.
What is the best way to efficiently subset this?
Thanks!
[[alternative HTML version deleted]]
2018 Apr 14
0
Efficient way to subset rows in R for dataset with 10^7 columns
...7.
>> >
>> >When I do
>> >
>> > trainIndex <-
>> > caret::createDataPartition(
>> > df$status,
>> > p = .9,
>> > list = FALSE,
>> > times = 1
>> > )
>> > outerTrain <- df[trainIndex]
>> > outerTest <- df[-trainIndex]
>> >
>> >Subsetting the rows of df takes over 20 minutes.
>> >
>> >What is the best way to efficiently subset this?
>> >
>> >Thanks!
>> >
>> > [[alter...