Displaying 20 results from an estimated 3000 matches similar to: "Random Forests theoretical question"
2005 Oct 27
1
Repost: Examples of "classwt", "strata", and "sampsize" i n randomForest?
"classwt" in the current version of the randomForest package doesn't work
too well. (It's what was in version 3.x of the original Fortran code by
Breiman and Cutler, not the one in the new Fortran code.) I'd advise
against using it.
"sampsize" and "strata" can be use in conjunction. If "strata" is not
specified, the class labels will be used.
2006 Feb 09
0
How to calculate the generalization error of random forests?
Hi,
Perhaps this is not the proper place to ask this
question but I am out of options, therefore I
apologize in advance.
I want to know how the (upper bound?) generalization
error of the random forest is determined using the
out-of-bag estimate. I read in Breiman's paper that s
and p determine the generalization error:
p(1-s^2)/s^2.
Does s stands for the strength of the individual tree
or of
2002 Apr 02
2
random forests for R
Hi all,
There is now a package available on CRAN that provides an R interface to Leo
Breiman's random forest classifier.
Basically, random forest does the following:
1. Select ntree, the number of trees to grow, and mtry, a number no larger
than number of variables.
2. For i = 1 to ntree:
3. Draw a bootstrap sample from the data. Call those not in the bootstrap
sample the
2002 Apr 02
2
random forests for R
Hi all,
There is now a package available on CRAN that provides an R interface to Leo
Breiman's random forest classifier.
Basically, random forest does the following:
1. Select ntree, the number of trees to grow, and mtry, a number no larger
than number of variables.
2. For i = 1 to ntree:
3. Draw a bootstrap sample from the data. Call those not in the bootstrap
sample the
2004 Jan 12
0
new version of randomForest (4.0-7)
Dear R users,
I've just released a new version of randomForest (available on CRAN now).
This version contained quite a number of new features and bug fixes,
compared to version prior to 4.0-x (and few more since 4.0-1).
For those not familiar with randomForest, it's an ensemble
classifier/regression tool. Please see
http://www.math.usu.edu/~adele/forests/ for more detailed information,
2004 Jan 12
0
new version of randomForest (4.0-7)
Dear R users,
I've just released a new version of randomForest (available on CRAN now).
This version contained quite a number of new features and bug fixes,
compared to version prior to 4.0-x (and few more since 4.0-1).
For those not familiar with randomForest, it's an ensemble
classifier/regression tool. Please see
http://www.math.usu.edu/~adele/forests/ for more detailed information,
2006 Oct 30
0
how to combine imputed data-sets from mice for classfication
Dear R users
I want to combine multiply imputed data-sets generated from mice to do
classfication.
However, I have various questions regarding the use of mice library.
For example suppose I want to predict the class in this data.frame:
data(nhanes)
mydf=nhanes
mydf$class="pos"
mydf$class[sample(1:nrow(mydf), size=0.5*nrow(mydf))]="neg"
mydf$class=factor(mydf$class)
First I
2005 Oct 27
1
Repost: Examples of "classwt", "strata", and "sampsize" in randomForest?
Sorry for the repost, but I've really been looking, and can't find any
syntax direction on this issue...
Just browsing the documentation, and searching the list came up short... I
have some unbalanced data and was wondering if, in a "0" v "1"
classification forest, some combo of these options might yield better
predictions when the proportion of one class is low (less
2005 Nov 07
4
R seems to "stall" after several hours on a long series of analyses... where to start?
Not sure where to even start on this.... I'm hoping there's some debugging I
can do...
I have a loop that cycles through several different data sets (same
structure, different info), performing randomForest growth and
predictions... saving out the predictions for later study...
I get about 5 hours in (9%... of the planned iterations.. yikes!) and R just
freezes.
This happens in
2006 Aug 14
0
Random Survival Forest 1.0.0 is now available.
Dear useRs,
Release 1.0.0 of the new R package 'randomSurvivalForest' is now available
on CRAN and its mirrors. The package implements Ishwaran and Kogalur's
Random Survival Forests algorithm for right censored survival data. The
algorithm is closely patterned after Breiman's random forests, but suitably
modified for the survival setting. Some key features are:
o An ensemble
2005 Aug 10
2
Creating new columns inside a loop
Ok, I know R isn't an optimal environment for looping (or so I've heard) but
I have a need to loop through columns of data and create new columns of data
based on calculations within rows...
I'm sure there's a help file, but I'm not sure what search terms to use to
find it! The problem is that these new columns need to have names that I can
later access... Like NewVar1,
2005 Feb 04
2
genetic algorithm
Hi,
I am doing some research on feature selection for classfication
problem using genetic algorithm in a wrapper approach. I am wondering
if there is some package which is already built for this purpose. I
was advised before about dprep package but I don't think it used GA
there (if I am wrong, please correct me!)
Thanks,
Ed
2007 Mar 15
2
replacing all NA's in a dataframe with zeros...
I've seen how to replace the NA's in a single column with a data frame
*> mydata$ncigs[is.na(mydata$ncigs)]<-0
*But this is just one column... I have thousands of columns (!) that I need
to do this, and I can't figure out a way, outside of the dreaded loop, do
replace all NA's in an entire data frame (all vars) without naming each var
separately. Yikes.
I'm racking my
2007 Mar 23
1
memory, speed, and assigning results into new v. existing variable
I have a very large data frame, and I'm doing a conversion of all columns
into factors. Takes a while (thanks to folks here though, for making
faster!), but am wondering about optimization from a memory perspective...
Internally, am I better off assigning into a new data frame, or doing one of
these:
dataframe<-someoperation(dataframe)
It would seem that re-assigning into the same data
2007 Mar 25
3
mythbackend dvb cards...need some guidance...
Trying to get a myth backend running in fedora core 6 and xend 3.0.3 but
having some issues with the DVB ATSC card, I have removed it from Dom0 and
presented it to my vm, but I am unable to scan for channels...this is what
lspci looks like for the card:
lspci -v
00:00.0 Network controller: Techsan Electronics Co Ltd B2C2 FlexCopII DVB
chip / Technisat SkyStar2 DVB card (rev 02)
Subsystem:
2006 Jan 24
0
An Appreciation of Leo Breiman (1928-2005)
We are organizing three projects to honor Leo Breiman this year. First,
Salford Systems' Data Mining Conference will open with a special session
devoted to his scientific contributions. Second, we would like to
construct Leo's scientific "family tree," identifying Leo's students,
collaborators, and others whose work has been influenced in an important
way by Leo's
2005 Nov 07
1
R seems to "stall" after several hours on a long series o f analyses... where to start?
You can test if the problem is accumulation in memory registers, which is
certainly what this sounds like. Just do a loop over a reasonably small
number of iterations and store or print the time between each iteration. If
memory accumulation it will run optimally for the first few iterations,
after which the time will increase noticeably (essentially exponentially,
hence ultimately freezes up). If
2011 Feb 15
1
[slightly OT] predict.randomForest and type=”prob”
Dear all ,
I would like to use the function randomForest to predict the probability
of relocation failure of a GPS collar as a function of several
environmental variables x (both factor and numeric: slope, vegetation,
etc.) on a given area. The response variable y is thus success
(0)/failure(1) of the relocation, and the sampling unit is the pixel of
a raster map. My aim is to build a map
2008 Aug 05
2
Driver installation for DVB-Hardware & DVBviewer
Hi
I wanted to instal the exellent DVBviewer application for viewing DVB-T-Television under fedora9 / wine?
Under WinXP i need to do the following 3 steps:
1) installing the Driver of my technisat DVB-hardware via the windows control-panel
2) installing the software of my technisat DVB-hardware
3) installing the DVBviewer application.
So under fedora9 / wine i don't know to to the first step
2020 Oct 17
0
[RFC] treewide: cleanup unreachable breaks
On Sat, 2020-10-17 at 09:09 -0700, trix at redhat.com wrote:
> From: Tom Rix <trix at redhat.com>
>
> This is a upcoming change to clean up a new warning treewide.
> I am wondering if the change could be one mega patch (see below) or
> normal patch per file about 100 patches or somewhere half way by collecting
> early acks.
>
> clang has a number of useful, new