Displaying 20 results from an estimated 10000 matches similar to: "random forest and vegetation data"
2010 Oct 22
2
Random Forest AUC
Guys,
I used Random Forest with a couple of data sets I had to predict for binary
response. In all the cases, the AUC of the training set is coming to be 1.
Is this always the case with random forests? Can someone please clarify
this?
I have given a simple example, first using logistic regression and then
using random forests to explain the problem. AUC of the random forest is
coming out to be
2008 Oct 15
2
apply model predictions over larger area with predict()
Dear all,
I have built glm models based on presences/absences and a number of
predictor maps and would like to compute habitat suitability based on
the modelled coefficients.
I thought this is pretty straight forward and wanted to use predict()
and supply the new data in a data frame, with one column for each
predictor.
However, I do get an error msg warning me that the number of rows for
2007 Jul 06
5
Clustering nested data
Hi all,
I am interested in performing a cluster analysis on ecological data from
forests in Pennsylvania. I would like to develop definitions for forest
types (red maple forests, upland oak forests, etc.(AH AR in attached table))
based on measured attributes in each forest type. To do this, I would like
to 'draw clusters' around forest types based on information from various
tree
2003 Nov 24
2
Questions on Random Forest
Hi, everyone,
I am a newbie on R. Now I want to do image pixel classification by random
forest. But I has not a clear understanding on random forest. Here is some
question:
As for an image, for example its size is 512x512 and has only one variable
-- gray level. The histogram of the image looks like mixture Gaussian Model,
say Gauss distribution (u1,sigma1), (u2,sigma2),(u3,sigma3). And a
2009 Oct 19
1
Random Forest - partial dependence plot
Hi everybody,
I used random forest regression to explain the patterns of species
richness and a bunch of climate variables (e.g. Temperature,
precipitation, etc.) All are continuos variables. My results are
really interesting and my model explained 96,7% of the variance.
Now I am trying to take advantage of the importance variable
function and depicts the observed patterns using partial
2009 Feb 26
1
Random Forest confusion matrix
Dear R users,
I have a question on the confusion matrix generated by function randomForest.
I used the entire data
set to generate the forest, for example:
> print(iris.rf)
Call:
randomForest(formula = Species ~ ., data = iris, importance = TRUE,
keep.forest = TRUE)
confusion
setosa versicolor virginica class.error
setosa 50 0 0 0.00
2009 Jun 24
1
Random Forest Variable Importance Interpretation
Hi
I am trying to explore the use of random forests for regression to
identify the important environmental/microclimate variables involved in
predicting the abundance of a species in different habitats, there are
approx 40 variable and between 200 and 500 data points depending on the
dataset. I have successfully used the randomForest package to conduct
the analysis and looked at the %IncMSE
2005 Jul 12
3
adding a factor column based on levels of another factor
Hi R users
Does anyone out there have a better/quicker way of adding a factor column
to a data frame based on levels of another factor?
I have a (large) data frame consisting of records for individual plants,
each represented by a unique ID number. The species of each plant is
indicated in the column "species", which is a factor column with many
different levels (species). There are
2007 Sep 05
1
ecological meaning of randomForest vegetation classification?
Hi, everyone,
I haven't found anything similar in the forum, so here's my problem (I'm no
expert in R nor statistics):
I have a data set of 59.000 cases with 9 variables each (fractional
coverage of 9 different plant types, such as deciduous broad-leaved
temperate trees or evergreen tropical trees etc.), which was generated by a
vegetation model.
In order to evaluate the quality of
2005 Sep 08
2
Re-evaluating the tree in the random forest
Dear mailinglist members,
I was wondering if there was a way to re-evaluate the
instances of a tree (in the forest) again after I have
manually changed a splitpoint (or split variable) of a
decision node. Here's an illustration:
library("randomForest")
forest.rf <- randomForest(formula = Species ~ ., data
= iris, do.trace = TRUE, ntree = 3, mtry = 2,
norm.votes = FALSE)
# I am
2010 Jul 20
1
logistic regression with repeated measures for species-habitat associations
Hi,
I have three years of species presence/absence data for a set of ~100
ponds, with a list of associated habitat characteristics (park, vegetation,
hydroperiod, etc.). The datasets differ slightly by year because some ponds
were dry in some years and not in others. My goal is to look at habitat
associations for a couple focal species.
For each year, I have a data table that looks like
2011 Nov 21
2
errors with lme4
Dear list,
i'm a new R user, so I apologize if the topic is already being addressed
by some other user.
I'm trying to determine if the reproductive success of a species of bird
is related to a list of covariates.
These are the covariates:
? elev: elevation of nest (meters)
? seadist: distance from the sea (meters)
? meanterranova: records of temperature
? minpengS1: records
2010 Dec 01
1
Poisson GLM warning message
Hi,
I receive the following warning message when I run a poisson GLM in R:
"glm.fit: fitted rates numerically 0 occurred"
The model summary is shown below. The variable 'Species' consists of
counts of different species ranging from 0 to 4. I suspect this may
have something to do with the warning message but I'm not sure. Can
anybody help?
Thank you!
Anna
Call:
2011 Feb 25
4
means, SD's and tapply
I'm trying to use tapply to output means and SD or SE for my data but
seem to be limited by how many times I can subset it. Here's a snippet
of my data
> stems353[1:10,]
Time DataSource Plot Elevation Aspect Slope Type Species
SizeClass Stems
1 Modern Cameron 70F221 1730 ESE 20 Conifer ABCO
Class1 3
2 Modern Cameron 70F221 1730
2002 Feb 26
1
Cross-tabulation of data from database
I am quite new to R, so please bear over with me if I have problems with the
R terminology.
I want to (try to) use R for some analyses within vegetation ecology, using
the vegan package. I have my data in a postgresql database, and I manage to
get them into R as a dataframe with columns for respectively: Name of the
analysed m2, Name of the species, coverage of species in the square in %. I
2010 Mar 01
1
Random Forest prediction questions
Hi,
I need help with the randomForest prediction. i run the folowing code:
> iris.rf <- randomForest(Species ~ ., data=iris,
> importance=TRUE,keep.forest=TRUE, proximity=TRUE)
> pr<-predict(iris.rf,iris,predict.all=T)
> iris.rf$votes[53,]
setosa versicolor virginica
0.0000000 0.8074866 0.1925134
> table(pr$individual[53,])/500
versicolor virginica
0.928
2008 Jan 25
4
Function for translation of a list into a matrix as used by ordination?
Hello.
Does anyone know of an existing function that takes a list in the form of:
Plot1 Species1 Abundance1
Plot1 Species2 Abundance2
Plot2 Species1 Abundance1
Plot2 Species3 Abundance3
.
.
.
PlotN SpeciesN AbundanceN
and translates into a matrix in the form of
Species1 Species2.... SpeciesN
Plot1 Abundance1 Abundance2... AbundanceN
Plot2 Abundance1
2011 May 24
1
Random Forest
I'm analyzing data using Random Forest Regression. For some of the
species I am analyzing, the percent variation explained is negative.
Could you please explain to me what that means? If you need more
information, please let me know. Thank you.
Sincerely,
Rachel Unger
[[alternative HTML version deleted]]
2011 Nov 24
4
I cannot get species scores to plot with site scores in MDS when I use a distance matrix as input. Problems with NA's?
Hi, First I should note I am relatively new to R so I would appreciate answers that take this into account.
I am trying to perform an MDS ordination using the function ?metaMDS? of the ?vegan? package. I want to ordinate species according to a set of functional traits. ?Species? here refers to ?sites? in traditional vegetation analyses while ?traits? here correspond to ?species? in such
2011 Feb 08
1
Grouping by factors in R
I'm having a hard time figuring out how to group results by certain
factors in R. I have data with the following headings:
[1] "Time" "Plot" "LatCat" "Elevation" "ElevCat"
"Aspect" "AspCat" "Slope"
[9] "SlopeCat" "Species" "SizeClass" "Stems"
and