Displaying 20 results from an estimated 2000 matches similar to: "Question on RandomForest in unsupervised mode"
2006 Mar 08
1
Unsupervised RandomForest
Dear all,
I am trying to calculate the proximity matrix for a data set with 16 variables
and 6804 observations using random forests. I have a Pentium 4, 3.00GHz
processor with 1 GB of RAM. When I use the command
randomForest(data.scale,proximity=T)
I get the warning message
Error: cannot allocate vector of size 361675 kb
Is this because I have reached the limit of what my computer is
2010 Apr 25
1
randomForest predictions with new data
Hi
I am new to R, randomForest and I have read about how to use it in your old
mails. I have also run the predictions examples from CRAN. But I still don't
understand how to use it right.
The thing that I don't understand is how to run the result from the
randomForest on one line (post) with newdata to get a good guess. What I
mean is if I put in a new observation of iris how do I
2011 Jan 20
1
randomForest: too many elements specified?
I getting "Error in matrix(0, n, n) : too many elements specified"
while building randomForest model, which looks like memory allocation
error.
Software versions are: randomForest 4.5-25, R version 2.7.1
Dataset is big (~90K rows, ~200 columns), but this is on a big machine (
~120G RAM)
and I call randomForest like this: randomForest(x,y)
i.e. in supervised mode and not requesting
2008 Jul 02
1
randomForest training error
While trying to train randomForest with my dataset, I am ending up with the
following error
Error in randomForest.default(datatrain, classtrain) :
length of response must be the same as predictors
My data looks like:
A,B,C,D,Class
1,2,1,2,cl1
1,2,1,2,cl1
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,12,3,cl2
3,2,1,2,cl2
Actual dataset has around 4000
2005 Mar 23
0
Question on class 1, 2 output for RandomForest
The `1' and `2' columns are the error rates within those classes. E.g., the
last row of the `1' column should correspond to the class.error for "-", and
the last row of the `2' column to the class.error for "+". (I would
have thought that that should be fairly obvious, but I guess not. It mimics
what Breiman and Cutler's Fortran code does.) I suspect
2002 Sep 27
0
RE: new patched version of randomForest
The link from
http://cran.r-project.org/src/contrib/PACKAGES.html#randomForest seems to be
broken. To get the file try
http://cran.r-project.org/src/contrib/randomForest_3.3-4.tar.gz
instead.
> -----Original Message-----
> From: Liaw, Andy [mailto:andy_liaw at merck.com]
> Sent: Thursday, September 26, 2002 8:08 PM
> To: 'r-announce at stat.math.ethz.ch'
> Subject:
2005 Jan 17
0
randomForest: too many element specified?
> From: luk
>
> When I run randonForest with a 169453x5 matrix, I got the
> following message.
>
> Error in matrix(0, n, n) : matrix: too many elements specified
>
> Can you please advise me how to solve this problem?
>
> Thanks,
>
> Lu
1. When asking new questions, please don't reply to other posts.
2. When asking questions like these, please
2011 Feb 15
1
[slightly OT] predict.randomForest and type=”prob”
Dear all ,
I would like to use the function randomForest to predict the probability
of relocation failure of a GPS collar as a function of several
environmental variables x (both factor and numeric: slope, vegetation,
etc.) on a given area. The response variable y is thus success
(0)/failure(1) of the relocation, and the sampling unit is the pixel of
a raster map. My aim is to build a map
2004 Apr 05
3
Can't seem to finish a randomForest.... Just goes and goe s!
When you have fairly large data, _do not use the formula interface_, as a
couple of copies of the data would be made. Try simply:
Myforest.rf <- randomForest(Mydata[, -46], Mydata[,46],
ntrees=100, mtry=7)
[Note that you don't need to set proximity (not proximities) or importance
to FALSE, as that's the default already.]
You might also want to use
2012 Aug 07
0
predicting test dataset response from training dataset with randomForest
Hi
I am new to R so I apologize if this is trivial.
I am trying to predict the resistance or susceptibility of my
sequences to a certain drug with a randomForest function from a file
with amino acids on each of the positions in the protein. I ran the
following:
> library(randomForest)
>
> path <- "C:\\..."
> path2 <- "..."
> name <-
2004 Jan 12
0
new version of randomForest (4.0-7)
Dear R users,
I've just released a new version of randomForest (available on CRAN now).
This version contained quite a number of new features and bug fixes,
compared to version prior to 4.0-x (and few more since 4.0-1).
For those not familiar with randomForest, it's an ensemble
classifier/regression tool. Please see
http://www.math.usu.edu/~adele/forests/ for more detailed information,
2004 Jan 12
0
new version of randomForest (4.0-7)
Dear R users,
I've just released a new version of randomForest (available on CRAN now).
This version contained quite a number of new features and bug fixes,
compared to version prior to 4.0-x (and few more since 4.0-1).
For those not familiar with randomForest, it's an ensemble
classifier/regression tool. Please see
http://www.math.usu.edu/~adele/forests/ for more detailed information,
2010 Dec 11
1
randomForest: help with combine() function
I've built two RF objects (RF1 and RF2) and have tried to combine
them, but I get the following error:
Error in rf$votes + ifelse(is.na(rflist[[i]]$votes), 0, rflist[[i]]$votes) :
non-conformable arrays
In addition: Warning message:
In rf$oob.times + rflist[[i]]$oob.times :
longer object length is not a multiple of shorter object length
Both RF models use the same variables, although
2003 Aug 05
1
na.action in randomForest --- Summary
A few days ago I asked whether there were options other than
na.action=na.fail for the R port of Breiman?s randomForest; the function?s
help page did not say anything about other options.
I have since discovered that a pdf document called ?The randomForest
Package? and made available by Andy Liaw (who made the tool available in
R---thank you) does discuss an option. It is an implementation of
2008 Jul 20
1
confusion matrix in randomForest
I have a question on the output generated by randomForest in classification
mode, specifically, the confusion matrix. The confusion matrix lists the
various classes and how the forest classified each one, plus the
classification error. Are these numbers essentially averages over all the
trees in the forest? If so, is there a way I can get the standard deviation
values out of the randomForest,
2008 Jun 18
2
randomForest outlier
I try to use ?randomForest to find variables that are the most important to
divide my dataset (continuous, categorical variables) in two given groups.
But when I plot the outliers:
plot(outlier(FemMalSex_NAavoid88.rf33, cls=FemMalSex_NAavoid88$Sex),
type="h",col=c("red","green")[as.numeric(FemMalSex_NAavoid88$Sex)])
it seems to me that all my values appear as
2010 Oct 14
0
AMD/Supermicro machine - AS-2022G-URF
Sorry for the long post but I know trying to decide on hardware often want to
see details about what people are using.
I have the following AS-2022G-URF machine running OpenGaryIndiana[1] that I am
starting to use.
I successfully transferred a deduped zpool with 1.x TB of files and 60 or so
zfs filesystems using mbuffer from an old 134 system with 6 drives - it ran at
about 50MB/s or
2007 Sep 05
1
ecological meaning of randomForest vegetation classification?
Hi, everyone,
I haven't found anything similar in the forum, so here's my problem (I'm no
expert in R nor statistics):
I have a data set of 59.000 cases with 9 variables each (fractional
coverage of 9 different plant types, such as deciduous broad-leaved
temperate trees or evergreen tropical trees etc.), which was generated by a
vegetation model.
In order to evaluate the quality of
2012 Feb 01
1
randomForest: proximity for new objects using an existing rf
Ein eingebundener Text mit undefiniertem Zeichensatz wurde abgetrennt.
Name: nicht verf?gbar
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20120201/cc22025d/attachment.pl>
2011 Nov 26
3
Question about randomForest
I've been using the R package randomForest but there is an aspect I
cannot work out the meaning of. After calling the randomForest
function, the returned object contains an element called prediction,
which is the prediction obtained using all the trees (at least that's
my understanding). I've checked that this prediction set has the error
rate as reported by err.rate.
However, if I