similar to: randomForest predictions with new data

Displaying 20 results from an estimated 6000 matches similar to: "randomForest predictions with new data"

2011 Jan 20
1
randomForest: too many elements specified?
I getting "Error in matrix(0, n, n) : too many elements specified" while building randomForest model, which looks like memory allocation error. Software versions are: randomForest 4.5-25, R version 2.7.1 Dataset is big (~90K rows, ~200 columns), but this is on a big machine ( ~120G RAM) and I call randomForest like this: randomForest(x,y) i.e. in supervised mode and not requesting
2007 Jun 06
0
Question on RandomForest in unsupervised mode
Hi, I attempted to run the randomForest() function on a dataset without predefined classes. According to the manual, running randomForest without a response variable/class labels should result in the function assuming you are running in unsupervised mode. In this case, I understand that my data is all assigned to one class whereas a second synthetic class is made up, which is assigned
2006 Mar 08
1
Unsupervised RandomForest
Dear all, I am trying to calculate the proximity matrix for a data set with 16 variables and 6804 observations using random forests. I have a Pentium 4, 3.00GHz processor with 1 GB of RAM. When I use the command randomForest(data.scale,proximity=T) I get the warning message Error: cannot allocate vector of size 361675 kb Is this because I have reached the limit of what my computer is
2010 Sep 07
1
RandomForests Limitations? Work Arounds?
Greetings, I want to inquire about the memory limitations of the randomForest package. I am attempting to perform clustering analysis using RF but I keep getting the message that RF cannot allocate a vector of a given size. I am currently using the 32-bit version of R to run this analysis, are there fewer memory issues when using the 64-bit version of R? Mainly I want to be able to run RF on
2023 May 09
1
RandomForest tuning the parameters
Hi Sacha, On second thought, perhaps this is more the direction that you want ... X2 = cbind(X_train,y_train) colnames(X2)[3] = "y" regr2<-randomForest(y~x1+x2, data=X2,maxnodes=10, ntree=10) regr regr2 #Make prediction predictions= predict(regr, X_test) predictions2= predict(regr2, X_test) HTH, Eric On Tue, May 9, 2023 at 6:40?AM Eric Berger <ericjberger at gmail.com>
2004 Apr 18
2
outliers using Random Forest
Hello, Does anybody know if the outscale option of randomForest yields the standarized version of the outlier measure for each case? or the results are only the raw values. Also I have notice that this measure presents very high variability. I mean if I repeat the experiment I am getting very different values for this measure and it is hard to flag the outliers. This does not happen with two other
2008 Dec 26
2
about randomForest
hello, I want to use randomForest to classify a matrix which is 331030?42,the last column is class signal.I use ? Memebers.rf<-randomForest(class~.,data=Memebers,proximity=TRUE,mtry=6,ntree=200) which told me" the error is matrix(0,n,n) set too elements" then I use: Memebers.rf<-randomForest(class~.,data=Memebers,importance=TRUE,proximity=TRUE) which told me"the error is
2006 Jan 03
1
randomForest - classifier switch
Hi I am trying to use randomForest for classification. I am using this code: > set.seed(71) > rf.model <- randomForest(similarity ~ ., data=set1[1:100,], importance=TRUE, proximity=TRUE) Warning message: The response has five or fewer unique values. Are you sure you want to do regression? in: randomForest.default(m, y, ...) > rf.model Call: randomForest(x = similarity ~ .,
2005 Jan 17
0
randomForest: too many element specified?
> From: luk > > When I run randonForest with a 169453x5 matrix, I got the > following message. > > Error in matrix(0, n, n) : matrix: too many elements specified > > Can you please advise me how to solve this problem? > > Thanks, > > Lu 1. When asking new questions, please don't reply to other posts. 2. When asking questions like these, please
2012 Mar 23
1
Memory limits for MDSplot in randomForest package
Hello, I am struggling to produce an MDS plot using the randomForest package with a moderately large data set. My data set has one categorical response variables, 7 predictor variables and just under 19000 observations. That means my proximity matrix is approximately 133000 by 133000 which is quite large. To train a random forest on this large a dataset I have to use my institutions high
2010 Jul 14
1
randomForest outlier return NA
Dear R-users, I have a problem with randomForest{outlier}. After running the following code ( that produces a silly data set and builds a model with randomForest ): ####################### library(randomForest) set.seed(0) ## build data set X <- rbind( matrix( runif(n=400,min=-1,max=1), ncol = 10 ) , rep(1,times= 10 ) ) Y <- matrix( nrow = nrow(X), ncol = 1) for( i in (1:nrow(X))){
2004 Mar 31
3
help with the usage of "randomForest"
Dear all, Can anybody give me some hint on the following error msg I got with using randomForest? I have two-class classification problem. The data file "sample" is: ---------------------------------------------------------- udomain.edu udomain.hcs hpclass 1 1.0000 1 not 2 NA 2 not 3 NA 0.8 not 4 NA 0.2 hp 5 NA 0.9 hp ------------------------------------------------------------ The
2003 Apr 21
2
randomForest crash?
I am attempting to use randomForests to look for interesting genes in microarray data with 216genes, 2 classes and 52 samples. My data.frame is 52x217 with the last column, V217 being the class(1 or 2). When I try lung.rf <- randomForest(V217 ~ ., data=tlSA216cda, importance= TRUE, proximity = TRUE) the GUI crashes. I am running R-1.6.2 under windo$e98, and most
2010 Oct 21
1
RandomForest Proximity Matrix
Greetings R Users! I am posting to inquire about the proximity matrix in the randomForest R-package. I am having difficulty pushing very large data through the algorithm and it appears to hang on the building of the prox matrix. I have read on Dr. Breiman's website that in the original code a choice can be made between using an N x N matrix OR to increase the ability to compute large
2006 Jul 24
2
RandomForest vs. bayes & svm classification performance
Hi This is a question regarding classification performance using different methods. So far I've tried NaiveBayes (klaR package), svm (e1071) package and randomForest (randomForest). What has puzzled me is that randomForest seems to perform far better (32% classification error) than svm and NaiveBayes, which have similar classification errors (45%, 48% respectively). A similar difference in
2008 Jul 02
1
randomForest training error
While trying to train randomForest with my dataset, I am ending up with the following error Error in randomForest.default(datatrain, classtrain) : length of response must be the same as predictors My data looks like: A,B,C,D,Class 1,2,1,2,cl1 1,2,1,2,cl1 3,2,1,2,cl2 3,2,1,2,cl2 3,2,1,2,cl2 3,2,1,2,cl2 3,2,1,2,cl2 3,2,1,2,cl2 3,2,1,2,cl2 3,2,12,3,cl2 3,2,1,2,cl2 Actual dataset has around 4000
2003 Apr 12
5
rpart vs. randomForest
Greetings. I'm trying to determine whether to use rpart or randomForest for a classification tree. Has anybody tested efficacy formally? I've run both and the confusion matrix for rf beats rpart. I've looking at the rf help page and am unable to figure out how to extract the tree. But more than that I'm looking for a more comprehensive user's guide for randomForest including
2008 Jun 15
1
randomForest, 'No forest component...' error while calling Predict()
Dear R-users, While making a prediction using the randomForest function (package randomForest) I'm getting the following error message: "Error in predict.randomForest(model, newdata = CV) : No forest component in the object" Here's my complete code. For reproducing this task, please find my 2 data sets attached ( http://www.nabble.com/file/p17855119/data.rar data.rar ).
2011 Sep 14
1
substitute games with randomForest::partialPlot
I'm having trouble calling randomForest::partialPlot programmatically. It tries to use name of the (R) variable as the data column name. Example: library(randomForest) iris.rf <- randomForest(Species ~ ., data=iris, importance=TRUE, proximity=TRUE) partialPlot(iris.rf, iris, Sepal.Width) # works partialPlot(iris.rf, iris, "Sepal.Width") # works (function(var.name)
2005 Mar 23
0
Question on class 1, 2 output for RandomForest
The `1' and `2' columns are the error rates within those classes. E.g., the last row of the `1' column should correspond to the class.error for "-", and the last row of the `2' column to the class.error for "+". (I would have thought that that should be fairly obvious, but I guess not. It mimics what Breiman and Cutler's Fortran code does.) I suspect