similar to: rpart minimum sample size

Displaying 20 results from an estimated 3000 matches similar to: "rpart minimum sample size"

2013 May 01
1
Multiple Paired T test from large Data Set with multiple pairs
Hi, Assuming that your dataset is similar to the one below: set.seed(25) dat1<- data.frame(Algae.Mass=sample(40:50,10,replace=TRUE),Seagrass.Mass=sample(30:70,10,replace=TRUE),Terrestrial.Mass=sample(80:100,10,replace=TRUE),Other.Mass=sample(40:60,10,replace=TRUE),Site.X.Treatment=rep(c("ALA1A","ALA1U"),each=5),stringsAsFactors=FALSE) library(reshape2)
2003 Apr 02
4
randomForests predict problem
Hello everybody, I'm testing the randomForest package in order to do some simulations and I get some trouble with the prediction of new values. The random forest computation is fine but each time I try to predict values with the newly created object, I get an error message. I thought I was because NA values in the dataframe, but I cleaned them and still got the same error. What am I
2016 Aug 04
2
curl build system is broken and so is mock
On Wed, August 3, 2016 22:53, Alice Wonder wrote: > > I didn't realize ldd was recursive. I may have known that at one > point (been using linux since MK Linux DR3 and building RPMs since > 1999), but have a head injury results in memory problems with > pieces of knowledge I don't frequently use. Most of us have that problem; head injuries or not. -- *** e-Mail
2016 Apr 15
1
Heteroscedasticity in a percent-cover dataset
Hi, I am currently trying to do a GLMM on a dataset with percent cover of seagrass (dep. var) and a suite of explanatory variables including algal (AC) and epiphyte cover (EC), rainfall, temperature and sunshine hours. M2=glmer(SG~AC+EC+TP+SS+RF+(1|Location/fSi/fTr), family=binomial,data=data,nAGQ=1) As the dependent variable is percent cover, I used a binomial error structure. I also have a
2011 Sep 13
1
mvpart analyses with covariables
Hi all, I am fairly new to R and I am trying to run mvpart and create a MRT using explanatory variables and covariables. I've been following the procedures in Numerical Ecoogy with R. The command (no covariables) which works fine - ABUNDTMRT <- mvpart(abundance ~ .,factors,margin=0.08,cp=0,xv="1se",xval=nrow(abundance),xvmult=100,which=4) where abundance is 4th root
2009 May 18
1
Predicting complicated GAMMs on response scale
Hi, I am using GAMMs to show a relationship of temperature differential over time with a model that looks like this:- gamm(Diff~s(DaysPT)+AirToC,method="REML") where DaysPT is time in days since injury and Diff is repeat measures of temperature differentials with regards to injury sites compared to non-injured sites in individuals over the course of 0-24 days. I use the following
2012 Sep 18
2
Formula in a data-frame
Hello all, I'm new in R, and I have a data-frame like this (dput information below): Specie Fooditem Occurrence Volume 1 Schizodon vegetal 1 0.05 2 Schizodon sediment 1 0.60 3 Schizodon vegetal 1 0.15 4 Schizodon alga 1 0.05 5 Schizodon sediment 1 0.90 6 Schizodon
2010 Jun 29
3
merging/intersecting 2 data frames
Dear R People: I have two data frames, a.df and b.df as seen here: > a.df[1:10,] DATE GENDER PATIENT_ID AGE SYNDROME 1 4/16/2009 F 23686 45 RASH ON BODY 2 4/16/2009 F 13840 35 CANT URINATE 3 4/16/2009 M 12895 30 BLURRED VISION 4 4/16/2009 M 18375 33 UNABLE TO VOID 5 4/16/2009 M 2237 44
2006 Jun 20
4
Prototype Array bug??
function CreateTOC(){ var $aTOC = document.getElementsByClassName(''tocitem''); var $temp = ''''; if(isArray($aTOC)){ for($t in $aTOC){ $temp += ''t='' + $t + '', '' + $aTOC[$t].name + ''<BR>''; } $(''test1'').innerHTML = $temp; } } When I run this code
2007 Jan 29
3
comparing random forests and classification trees
Hi, I have done an analysis using 'rpart' to construct a Classification Tree. I am wanting to retain the output in tree form so that it is easily interpretable. However, I am wanting to compare the 'accuracy' of the tree to a Random Forest to estimate how much predictive ability is lost by using one simple tree. My understanding is that the error automatically displayed by the two
2004 Nov 05
3
Resources for optimizing code
I want to eliminate certain observations in a large dataframe (21000x100). I have written code which does this using a binary vector (0=delete obs, 1=keep), but it uses for loops, and so it's slow and in the extreme it causes R to hang for indefinite time periods. I'm looking for one of two things: 1. A document which discusses how to avoid for loops and situations in which it's
2006 Apr 24
5
radrails gone mad
Hi *, I have been using radrails as an IDE for awhile now and we were getting along just fine - but recently it has just gone totally postal?!?? I created a new rails app - when I click to expand the folders there is no framework in place, just the root folder of the app. To add insult to injury two weeks of work also just disapeered - I kid you not just gone without any deletion on my part.
2012 Feb 12
1
how to extract p values in svyglm
summary(result) Call: svyglm(Injury ~ seat, sD, family = quasibinomial(link = "logit")) Survey design: svydesign(~1, prob = NULL, strata = Data[, 1], weights = Data[, 4], data = Data, fpc = ~fPc) Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) -4.256875 0.001421 -2996.7 <2e-16 *** seatbad 0.681504 0.001689 403.4 <2e-16 *** ---
2008 Jul 18
1
Kerberos check logic incorrect
In configure.in, there is a check that does `krb5-config --version | grep -v 1\.2`, making sure there is no "1.2" in the version of Kerberos in use. This is to prevent compiling against MIT Kerberos version 1.2, which is too old for Dovecot. Unfortunately for this idea, Heimdal 1.2.1 is out. And Dovecot builds fine against it. Whoever committed this check also appears to have removed
2007 Apr 27
0
Logistic Regression Question: Risk Threshold
Hi, I am working on problem 2 of Chapter 8 in Data Analysis and Graphics Using R and don't know how to approach the second half of the question: In the data set (an artificial one of 3121 patients, that is similar to a subset of the data analyzed in Stiell et al., 2001) head.injury, obtain a logistic regression model relating clinically.important.brain.injury to other variables. Patients
2004 Oct 06
2
REGISTER timeout problem with Broadvoice
Hi all, We seem to be having an interesting issue maintaining our registrations with Broadvoice. It seems to be related to the fact that Asterisk does not normally include authentication with its first REGISTER attempt. Normally, Broadvoice doesn't care, responds with a 401 Proxy Auth Required, and asterisk answers accordingly. However, if the account is already registered, Broadvoice
2016 Aug 04
1
curl build system is broken and so is mock
On 08/03/2016 06:57 PM, Chris Adams wrote: > Once upon a time, Alice Wonder <alice at domblogger.net> said: >> So when building curl, it links curl against the libcurl in the >> buildroot and not against the libcurl it just compiled? >> >> No other packages I know of do that. > > No, that is not what it does. If you posted the full ldd output like I >
2013 Jan 16
1
Mean calculation by two variables
Hello All, I have a data frame (dput information below) with food item weight for fish species. I need to calculate the Mean proportion by weight of each food item for each specie, as show in solution data frame (dput information below). I use the ddply function (plyr package) in two steps. First calculate the proportion of weight for each individual: step1 = ddply (example, .(ID), transform,
2009 Aug 04
3
matrix
Hi I have dataset that consists of two columns AB 0.102 AC -0.002 BA -0.102 BC 0.270 CA 0.002 CB -0.270 I wish to create a matrix so that I can eventually plot the data. A B C A 1 0.102 -0.002 B -0.102 1 0.27 C 0.002 -0.27 1 Any help or guidance would be greatly
2007 Feb 09
1
Using variable names in for loops - Generating plots semi-automatically from a series of variables Partly solved
Hi, This code is trying to produce a series of graphics files, with plots of male and female disease rates by age, one plot per disease. The dataframe contains a variable 'Age' and a set of variables called 'Male_CVD, Female_CVD,Male_RTA,Female_RTA, and so on. For each disease, I want to pull out the column of data containing the word 'Male' and plot this against age, and then