thr3ads.net - similar to: "datasets"

Displaying 20 results from an estimated 200 matches similar to: "datasets"

2011 Oct 09

sapply(pred,cor,y=resp)

Hello. I am wondering why I am getting NA for all in cors=sapply(pred,cor,y=resp). I suppose that each column in pred has NAs in them. Is there some way to fix this? Thanks > str(pred) 'data.frame': 200 obs. of 13 variables: $ mnO2: num 9.8 8 11.4 4.8 9 13.1 10.3 10.6 3.4 9.9 ... $ Cl : num 60.8 57.8 40 77.4 55.4 ... $ NO3 : num 6.24 1.29 5.33 2.3 10.42 ... $ NH4 : num 578

How to re-combine values based on an index?

2012 Dec 02

How to re-combine values based on an index?

I am able to split my df into two like so: dataset <- trainset index <- 1:nrow(dataset) testindex <- sample(index, trunc(length(index)*30/100)) trainset <- dataset[-testindex,] testset <- dataset[testindex,-1] So I have the index information, how could I re-combine the data using that back into a single df? I tried what I thought might work, but failed with:

question: corCAR1 in lme

2005 May 25

question: corCAR1 in lme

Hello all, I am trying to use lme to examine how a response variable (Chla) changes over time in different treatments (2 Temp & 2 Light levels). Within each treatment combination, there are two replicate tanks (each with unique TankID) with coral fragments in them. All tanks are subject to the same environment until Time=0, when treatments are imposed, and Chla is measured for each

conversion from SAS

2005 Jul 28

conversion from SAS

Hi, I wonder if anybody could help me in converting this easy SAS program into R. (I'm still trying to do that!) PROC IMPORT OUT= WORK.CHLA_italian DATAFILE= "C:\Documents and Settings\carleal\My Documents\REBECCA\stat\sas\All&nutrients.xls" DBMS=EXCEL2000 REPLACE; GETNAMES=YES; RUN; data chla_italian; set chla_italian;

bagging SVM Ensemble

2013 Jan 08

bagging SVM Ensemble

Dear Sir, I got a problem with my program. I would like to classify my data using bagging support vector machine ensemble. I split my data into training data and test data. For a given data sets TR(X), K replicated training data sets are first randomly generated by bootstrapping technique with replacement. Next, Support Vector Mechine (SVM) is applied for each bootstrap data sets. Finally, the

Error term in aov

2004 Mar 29

Error term in aov

Hi, I'm trying to analyse a hierachical design and am running into some trouble. Clearly I don't fully understand "Error" and I was hoping someone could set me straight. We measure percentage algal cover in each of 5 quadrats from each of 16 patches where 4 treatments are randomly allocated to a patch. First suppose patches are coded 1 to 16. then the following gives the

Multiple Paired T test from large Data Set with multiple pairs

2013 May 01

Multiple Paired T test from large Data Set with multiple pairs

Hi, Assuming that your dataset is similar to the one below: set.seed(25) dat1<- data.frame(Algae.Mass=sample(40:50,10,replace=TRUE),Seagrass.Mass=sample(30:70,10,replace=TRUE),Terrestrial.Mass=sample(80:100,10,replace=TRUE),Other.Mass=sample(40:60,10,replace=TRUE),Site.X.Treatment=rep(c("ALA1A","ALA1U"),each=5),stringsAsFactors=FALSE) library(reshape2)

SVM accuracy question

2011 Sep 26

SVM accuracy question

Hi, I'm working with support vector machine for the classification purpose, and I have a problem about the accuracy of prediction. I divided my data set in train (1/3 of enteire data set) and test (2/3 of data set) using the "sample" function. Each time I perform the svm model I obtain different result, according with the result of the "sample" function. I would like

Help with this error "kernlab class probability calculations failed; returning NAs"

2012 Nov 29

Help with this error "kernlab class probability calculations failed; returning NAs"

I have never been able to get class probabilities to work and I am relatively new to using these tools, and I am looking for some insight as to what may be wrong. I am using caret with kernlab/ksvm. I will simplify my problem to a basic data set which produces the same problem. I have read the caret vignettes as well as documentation for ?train. I appreciate any direction you can give. I

svm(e1071) and scaling of weights

2011 Feb 23

svm(e1071) and scaling of weights

I expected, that I will get the same prediction, if I multiply the weights for all classes with a constant factor, but I got different results. Please look for the following code. > library(e1071) > data(Glass, package = "mlbench") > index <- 1:nrow(Glass) > testindex <- sample(index, trunc(length(index)/5)) > testset <- Glass[testindex, ] > trainset <-

R classification

2011 Oct 19

R classification

hello, i am so glad to write you. i am dealing now with writing my M.Sc in Applied Statistics thesis, titled " Data Mining Classifiers and Predictive Models Validation and Evaluation". I am planning to compare several DM classifiers like "NN, kNN, SVM, Dtree, and Naïve Bayes" according to their Predicting accuracy, interpretability, scalability, and time consuming etc. I have

Multivariate regression tree: problems with surrogate splits

2007 Oct 23

Multivariate regression tree: problems with surrogate splits

R helpers, I am working with the R program performing multivariate regression trees (MRT). I have a matrix with species and environmental variables saved as a CSV file (sprot_matrix.csv), I have 42 species and 8 environmental variables (SECCHI+PH+TA+PTOT+NTOT+CHLA+AREA+ MEANDEP) for 104 samples Title SpA SpB SpC SpD Varible1 Variable2 Variable3 Sample1 Sample 2

data after write() is off by 1 ?

2012 Nov 20

data after write() is off by 1 ?

I am new to R, so I am sure I am making a simple mistake. I am including complete information in hopes someone can help me. Basically my data in R looks good, I write it to a file, and every value is off by 1. Here is my flow: > str(prediction) Factor w/ 10 levels "0","1","2","3",..: 3 1 10 10 4 8 1 4 1 4 ... - attr(*, "names")= chr

Stepwise Multiple Regression

2008 Mar 03

Stepwise Multiple Regression

Dear R, I am using stepwise multiple glm to select a subset of variables using the step command. My question is how do you calculate the percentage variance for each parameter in the reduced model. An example of my code thus far is, fit<-glm(Abalone~offset(Area)+Sessile invertebrates+Bare rock+ NCA + Encrusting red algae, family = poisson ()) step(fit) fit1<-glm(Abalone~offset(Area)+Bare

lme model with replicates within a random factor

2007 Nov 02

lme model with replicates within a random factor

Dear all, I wonder if anyone can help me with specifying a right model for my analysis. I am a beginner to lme methods. I was unfortunately not able to find a solution to my problem on my own. Data structure: I have sampled monthly 6 basins during two hydrological cycles, and I have taken several (2 to 4) samples (“replicate”) for each basin and month. I’m trying to relate Shannon diversity

partial correlation

2008 Dec 08

partial correlation

Hej! I have the following problem: I would like to do partial correlations on non-parametric data. I checked "pcor" (Computes the partial correlation between two variables given a set of other variables) but I do not know how to change to a Spearman Rank Correlation method [pcor(c("BCDNA","ImProd","A365"),var(PCor))] Here''s a glimpse of

log transforming predictor variables in a binomial GAM?

2013 Dec 16

log transforming predictor variables in a binomial GAM?

Hi all, I am applying a Presence/absence Generalized additive model to model the distribution of marine algae species in R. I have found that log transforming the environmental variables improves the explained deviance of the model considerably. While log transforming is common practice in GLM, I have been unable to find any papers where this is performed in a GAM. Im wondering whether this

strucchange: breakpoints in inequally spaced data

2004 Jul 16

strucchange: breakpoints in inequally spaced data

Hello, we want to identify breakpoints (different phases) in environmental data, algae cell counts of three years with intervals between 7 and 30 days (N=40). We found that breakpoints(cells ~1) works great and identifies 5 very good breaks, however we are uncertain about these, because the data are unequally spaced. Is there a way to include the information about the measurement intervals,

Mixed models quantile regression

2008 Feb 06

Mixed models quantile regression

Dear R, I have a question concerning quantile regression models. I am using the quantile regression model to test the relationship between abalone and the percentage cover of algae etc at different sites and depths. An example is fit<-rq(abalone~%coversessileinvertebrates+factor(Depth)+factor(Site),tau=0.7) In this model depth is fixed and site is random. My question is, is it possible

Linear mixed-effect models and model selection

2012 Jul 24

Linear mixed-effect models and model selection

Hi, I am looking at the effect of allelochemicals produced by two freshwater macrophyte species on two different algal species at different days. I am comparing the effect of each macrophyte on each algae at each day. I received help from someone doing the LMEM (Linear mixed-effect models) and he told me to do ANOVA to analyse the LMEM. However, I received these feedback from my examinor; 1. An

similar to: datasets