similar to: datasets

Displaying 20 results from an estimated 200 matches similar to: "datasets"

2011 Oct 09
1
sapply(pred,cor,y=resp)
Hello. I am wondering why I am getting NA for all in cors=sapply(pred,cor,y=resp). I suppose that each column in pred has NAs in them. Is there some way to fix this? Thanks > str(pred) 'data.frame':   200 obs. of  13 variables:  $ mnO2: num  9.8 8 11.4 4.8 9 13.1 10.3 10.6 3.4 9.9 ...  $ Cl  : num  60.8 57.8 40 77.4 55.4 ...  $ NO3 : num  6.24 1.29 5.33 2.3 10.42 ...  $ NH4 : num  578
2012 Dec 02
2
How to re-combine values based on an index?
I am able to split my df into two like so: dataset <- trainset index <- 1:nrow(dataset) testindex <- sample(index, trunc(length(index)*30/100)) trainset <- dataset[-testindex,] testset <- dataset[testindex,-1] So I have the index information, how could I re-combine the data using that back into a single df? I tried what I thought might work, but failed with:
2005 May 25
1
question: corCAR1 in lme
Hello all, I am trying to use lme to examine how a response variable (Chla) changes over time in different treatments (2 Temp & 2 Light levels). Within each treatment combination, there are two replicate tanks (each with unique TankID) with coral fragments in them. All tanks are subject to the same environment until Time=0, when treatments are imposed, and Chla is measured for each
2005 Jul 28
1
conversion from SAS
Hi, I wonder if anybody could help me in converting this easy SAS program into R. (I'm still trying to do that!) PROC IMPORT OUT= WORK.CHLA_italian DATAFILE= "C:\Documents and Settings\carleal\My Documents\REBECCA\stat\sas\All&nutrients.xls" DBMS=EXCEL2000 REPLACE; GETNAMES=YES; RUN; data chla_italian; set chla_italian;
2013 Jan 08
0
bagging SVM Ensemble
Dear Sir, I got a problem with my program. I would like to classify my data using bagging support vector machine ensemble. I split my data into training data and test data. For a given data sets TR(X), K replicated training data sets are first randomly generated by bootstrapping technique with replacement. Next, Support Vector Mechine (SVM) is applied for each bootstrap data sets. Finally, the
2004 Mar 29
0
Error term in aov
Hi, I'm trying to analyse a hierachical design and am running into some trouble. Clearly I don't fully understand "Error" and I was hoping someone could set me straight. We measure percentage algal cover in each of 5 quadrats from each of 16 patches where 4 treatments are randomly allocated to a patch. First suppose patches are coded 1 to 16. then the following gives the
2013 May 01
1
Multiple Paired T test from large Data Set with multiple pairs
Hi, Assuming that your dataset is similar to the one below: set.seed(25) dat1<- data.frame(Algae.Mass=sample(40:50,10,replace=TRUE),Seagrass.Mass=sample(30:70,10,replace=TRUE),Terrestrial.Mass=sample(80:100,10,replace=TRUE),Other.Mass=sample(40:60,10,replace=TRUE),Site.X.Treatment=rep(c("ALA1A","ALA1U"),each=5),stringsAsFactors=FALSE) library(reshape2)
2011 Sep 26
1
SVM accuracy question
Hi, I'm working with support vector machine for the classification purpose, and I have a problem about the accuracy of prediction. I divided my data set in train (1/3 of enteire data set) and test (2/3 of data set) using the "sample" function. Each time I perform the svm model I obtain different result, according with the result of the "sample" function. I would like
2012 Nov 29
1
Help with this error "kernlab class probability calculations failed; returning NAs"
I have never been able to get class probabilities to work and I am relatively new to using these tools, and I am looking for some insight as to what may be wrong. I am using caret with kernlab/ksvm. I will simplify my problem to a basic data set which produces the same problem. I have read the caret vignettes as well as documentation for ?train. I appreciate any direction you can give. I
2011 Feb 23
0
svm(e1071) and scaling of weights
I expected, that I will get the same prediction, if I multiply the weights for all classes with a constant factor, but I got different results. Please look for the following code. > library(e1071) > data(Glass, package = "mlbench") > index <- 1:nrow(Glass) > testindex <- sample(index, trunc(length(index)/5)) > testset <- Glass[testindex, ] > trainset <-
2011 Oct 19
0
R classification
hello, i am so glad to write you. i am dealing now with writing my M.Sc in Applied Statistics thesis, titled " Data Mining Classifiers and Predictive Models Validation and Evaluation". I am planning to compare several DM classifiers like "NN, kNN, SVM, Dtree, and Naïve Bayes" according to their Predicting accuracy, interpretability, scalability, and time consuming etc. I have
2007 Oct 23
1
Multivariate regression tree: problems with surrogate splits
R helpers, I am working with the R program performing multivariate regression trees (MRT). I have a matrix with species and environmental variables saved as a CSV file (sprot_matrix.csv), I have 42 species and 8 environmental variables (SECCHI+PH+TA+PTOT+NTOT+CHLA+AREA+ MEANDEP) for 104 samples Title SpA SpB SpC SpD Varible1 Variable2 Variable3 Sample1 Sample 2
2012 Nov 20
3
data after write() is off by 1 ?
I am new to R, so I am sure I am making a simple mistake. I am including complete information in hopes someone can help me. Basically my data in R looks good, I write it to a file, and every value is off by 1. Here is my flow: > str(prediction) Factor w/ 10 levels "0","1","2","3",..: 3 1 10 10 4 8 1 4 1 4 ... - attr(*, "names")= chr
2008 Mar 03
1
Stepwise Multiple Regression
Dear R, I am using stepwise multiple glm to select a subset of variables using the step command. My question is how do you calculate the percentage variance for each parameter in the reduced model. An example of my code thus far is, fit<-glm(Abalone~offset(Area)+Sessile invertebrates+Bare rock+ NCA + Encrusting red algae, family = poisson ()) step(fit) fit1<-glm(Abalone~offset(Area)+Bare
2007 Nov 02
1
lme model with replicates within a random factor
Dear all, I wonder if anyone can help me with specifying a right model for my analysis. I am a beginner to lme methods. I was unfortunately not able to find a solution to my problem on my own. Data structure: I have sampled monthly 6 basins during two hydrological cycles, and I have taken several (2 to 4) samples (“replicate”) for each basin and month. I’m trying to relate Shannon diversity
2008 Dec 08
1
partial correlation
Hej! I have the following problem: I would like to do partial correlations on non-parametric data. I checked "pcor" (Computes the partial correlation between two variables given a set of other variables) but I do not know how to change to a Spearman Rank Correlation method [pcor(c("BCDNA","ImProd","A365"),var(PCor))] Here''s a glimpse of
2013 Dec 16
1
log transforming predictor variables in a binomial GAM?
Hi all, I am applying a Presence/absence Generalized additive model to model the distribution of marine algae species in R. I have found that log transforming the environmental variables improves the explained deviance of the model considerably. While log transforming is common practice in GLM, I have been unable to find any papers where this is performed in a GAM. Im wondering whether this
2004 Jul 16
1
strucchange: breakpoints in inequally spaced data
Hello, we want to identify breakpoints (different phases) in environmental data, algae cell counts of three years with intervals between 7 and 30 days (N=40). We found that breakpoints(cells ~1) works great and identifies 5 very good breaks, however we are uncertain about these, because the data are unequally spaced. Is there a way to include the information about the measurement intervals,
2008 Feb 06
1
Mixed models quantile regression
Dear R, I have a question concerning quantile regression models. I am using the quantile regression model to test the relationship between abalone and the percentage cover of algae etc at different sites and depths. An example is fit<-rq(abalone~%coversessileinvertebrates+factor(Depth)+factor(Site),tau=0.7) In this model depth is fixed and site is random. My question is, is it possible
2012 Jul 24
1
Linear mixed-effect models and model selection
Hi, I am looking at the effect of allelochemicals produced by two freshwater macrophyte species on two different algal species at different days. I am comparing the effect of each macrophyte on each algae at each day. I received help from someone doing the LMEM (Linear mixed-effect models) and he told me to do ANOVA to analyse the LMEM. However, I received these feedback from my examinor; 1. An