similar to: installing contributed programs

Displaying 20 results from an estimated 10000 matches similar to: "installing contributed programs"

2004 Dec 10
1
predict.randomForest
I have a data.frame with a series of variables tagged to a binary response ('present'/'absent'). I am trying to use randomForest to predict present/absent in a second dataset. After a lot a fiddling (using two data frames, making sure data types are the same, lots of testing with data that works such as data(iris)) I've settled on combining all my data into one data.frame
2003 Apr 21
2
randomForest crash?
I am attempting to use randomForests to look for interesting genes in microarray data with 216genes, 2 classes and 52 samples. My data.frame is 52x217 with the last column, V217 being the class(1 or 2). When I try lung.rf <- randomForest(V217 ~ ., data=tlSA216cda, importance= TRUE, proximity = TRUE) the GUI crashes. I am running R-1.6.2 under windo$e98, and most
2008 Feb 25
1
To get more digits in precision of predict function of randomForests
Hi, I am using randomForests for a classification problem. The predict function in the randomForest library, when asked to return the probabilities, has precision of two digits after the decimal. I need at least four digits of precision for the predicted probabilities. How do I achieve this? Thank you, Nagu
2008 Mar 09
1
sampsize in Random Forests
Hi all, I have a dataset where each point is assigned to a class A, B, C, or D. Each point is also assigned to a study site. Each study site is coded with a number ranging between 1-100. This information is stored in the vector studySites. I want to run randomForests using stratified sampling, so I chose the option strata = factor(studySites) But I am not sure how to control the number of
2011 Sep 22
3
How make a x,y dataset from a formula based entry
Hello all, So I am using the (formula entry) method for randomForests: randomForest(y~x1+x2+...+x39+x40,data=xxx,...) but the issue is that some of the items in that package dont take a formula entry - you have to explicitly state the y and x vector: randomForest(x=xxx[,c('x1','x2',...,'x40')],y=xxx[,'y'],...) Now my question is whether there is a function/way
2010 Mar 16
1
Regarding variable importance in the randomForest package
For anyone who is knowledgeable about the randomForest package in R, I have a question: When I look at the variable importance for data, I see that my response variable is included along with my predictor variables. That is, I am getting a MeanDecreaseGini for my response variable, and therefore it seems as though it is being treated as a predictor variable. my code (just in case it helps) :
2010 May 04
1
randomforests - how to classify
Hi, I'm experimenting with random forests and want to perform a binary classification task. I've tried some of the sample codes in the help files and things run, but I get a message to the effect 'you don't have very many unique values in the target - are you sure you want to do regression?' (sorry, don't know exact message but r is busy now so can't check). In
2011 Feb 15
1
[slightly OT] predict.randomForest and type=”prob”
Dear all , I would like to use the function randomForest to predict the probability of relocation failure of a GPS collar as a function of several environmental variables x (both factor and numeric: slope, vegetation, etc.) on a given area. The response variable y is thus success (0)/failure(1) of the relocation, and the sampling unit is the pixel of a raster map. My aim is to build a map
2005 Mar 22
2
Error: Can not handle categorical predictors with more than 32 categories.
Hi All, My question is in regards to an error generated when using randomForest in R. Is there a special way to format the data in order to avoid this error, or am I completely confused on what the error implies? "Error in randomForest.default(m, y, ...) : Can not handle categorical predictors with more than 32 categories." This is generated from the command line: >
2012 Mar 23
1
Memory limits for MDSplot in randomForest package
Hello, I am struggling to produce an MDS plot using the randomForest package with a moderately large data set. My data set has one categorical response variables, 7 predictor variables and just under 19000 observations. That means my proximity matrix is approximately 133000 by 133000 which is quite large. To train a random forest on this large a dataset I have to use my institutions high
2003 Apr 02
4
randomForests predict problem
Hello everybody, I'm testing the randomForest package in order to do some simulations and I get some trouble with the prediction of new values. The random forest computation is fine but each time I try to predict values with the newly created object, I get an error message. I thought I was because NA values in the dataframe, but I cleaned them and still got the same error. What am I
2008 Jul 22
2
randomForest Tutorial
I am new to R and I'd like to use the randomForest package for my thesis (identifying important variables for more detailed analysis with other software). I have found extremely well written and helpful information on the usage of R. Unfortunately it seems to be very difficult to find similarly detailed tutorials for randomForest, and I just can't get it work with the information on
2006 Jul 27
2
memory problems when combining randomForests [Broadcast]
You need to give us more details, like how you call randomForest, versions of the package and R itself, etc. Also, see if this helps you: http://finzi.psych.upenn.edu/R/Rhelp02a/archive/32918.html Andy From: Eleni Rapsomaniki > > Dear all, > > I am trying to train a randomForest using all my control data > (12,000 cases, ~ 20 explanatory variables, 2 classes). > Because
2008 Feb 25
1
Running randomForests on large datasets
Hi, I am trying to run randomForests on a datasets of size 500000X650 and R pops up memory allocation error. Are there any better ways to deal with large datasets in R, for example, Splus had something like bigData library. Thank you, Nagu
2006 Jul 26
3
memory problems when combining randomForests
Dear all, I am trying to train a randomForest using all my control data (12,000 cases, ~ 20 explanatory variables, 2 classes). Because of memory constraints, I have split my data into 7 subsets and trained a randomForest for each, hoping that using combine() afterwards would solve the memory issue. Unfortunately, combine() still runs out of memory. Is there anything else I can do? (I am not using
2009 Apr 28
1
Problem with Random Forest predict
I am trying to run a partialPlot with Random Forest (as I have done many times before). First I run my forest... Cell is a 6 level factor that is the dependent variable - all other variables are predictors, most of these are factors as well. predCell<-randomForest(x=tempdata[-match("Cell",names(tempdata))],y=tempdata$Cell,importance=T) Then I try my partial plot to look at the
2002 Aug 19
4
question about Rpvm, SNOW, etc.
Dear R-devel, Inspired by Michael Li's talk at JSM, I decided to try rpvm and snow on our two linux boxes. It only took me a couple of hours of screwing around to get it working (sooner if I had RTFM). Our setup is: 2 dual PIII-866 Xeons, one with 2GB RDRAM, the other with 1.28GB RDRAM. The first machine is acting as the NIS/NFS server. both /usr and /home are exported to the second
2010 Sep 07
1
RandomForests Limitations? Work Arounds?
Greetings, I want to inquire about the memory limitations of the randomForest package. I am attempting to perform clustering analysis using RF but I keep getting the message that RF cannot allocate a vector of a given size. I am currently using the 32-bit version of R to run this analysis, are there fewer memory issues when using the 64-bit version of R? Mainly I want to be able to run RF on
2008 May 05
1
Count data in random Forest
Hello R-user! I am running R 2.7.0 on a Power Book (Tiger). (I am still R and statistics beginner) I try to find the most important variables to divide my dataset as given in a categorical variable using randomForest. Is randomForest() able to deal with count data? Or is there no difference because only the ranks are used in the trees? Thanks in advance Birgit Birgit Lemcke Institut f?r
2007 Jan 04
3
randomForest and missing data
Does anyone know a reason why, in principle, a call to randomForest cannot accept a data frame with missing predictor values? If each individual tree is built using CART, then it seems like this should be possible. (I understand that one may impute missing values using rfImpute or some other method, but I would like to avoid doing that.) If this functionality were available, then when the trees