similar to: Large data set in R

Displaying 20 results from an estimated 20000 matches similar to: "Large data set in R"

2009 Apr 28
1
Bounded memory ANOVA
Hi, I'm using aov() to analyze the data and get the rank of factors. However, this does not work for larger set of data due to memory limitation. Are there any similar function to use aov() on data sets larger than memory similar to biglm ? Thanks, ~ Hardi
2009 Aug 05
2
Durbin-Watson
Hi, I ran an experiment with 3 factors, 2 levels and 200 replications and as I want to test for residuals independence, I used Durbin-Watson in R. I found two functions (durbin.watson and dwtest) and while both are giving the same rho, the p-values are greatly differ: > durbin.watson(mod1) lag Autocorrelation D-W Statistic p-value 1 -0.04431012 2.088610 0.012 Alternative
2009 Apr 15
0
Rank of factors for experiment based on latin hypercube?
Hi, I am running a simulation and have to perform ANOVA to determine the rank of factors. Used the aov() function and it works great for full factorial design. 1. For a massive set of data, I tried using biglm, while it can create the linear model, all the residuals (for assumption validation) are not recorded and the sum of squares are not there, just the estimated regression coefficient, 95%
2004 Aug 02
4
Standard errors from glm
Kia ora list members: I'm having a little difficulty getting the correct standard errors from a glm.object (R 1.9.0 under Windows XP 5.1). predict() will gives standard errors of the predicted values, but I am wanting the standard errors of the mean. To clarify: Assume I have a 4x3x2 factorial with 2 complete replications (i.e. 48 observations, I've appended a dummy set of data at the
2003 Jun 26
1
Correct contrast for unreplicated 2K factorial design
Hi all, I have been trying to reproduce an analysis from Douglas Montgomery?s book on design and analysis of experiments. Table 6.10 of example 6.2 on page 246, gives a table as follows: > NPK <- expand.grid(A=mp,B=mp,C=mp,D=mp) > Rate <- c(45,71,48,65,68,60,80,65,43,100,45,104,75,86,70,96) > filtration <- cbind(NPK,Rate) > filtration A B C D Rate 1 - - - - 45 2
2003 Dec 13
2
half normal probability plot in R
I have generated the effects in a factorial design and now want to put them in a half normal probability plot. Is there an easy way to do this in R??? I can't find the command. Thanks much - Ali Jones _________________________________________________________________ Our best dial-up offer is back. Get MSN Dial-up Internet Service for 6 months @ $9.95/month now!
2011 Nov 14
1
2^k*r (with replications) experimental design question
Hello, I have one replication (r=1 of the 2^k*r) of a 2^k experimental design in the context of performance analysis i.e. my response variables are Throughput and Response Time. I use the "aov" function and the results look ok: > str(throughput) 'data.frame': 286 obs. of 7 variables: $ Time : int 6 7 8 9 10 11 12 13 14 15 ... $ Throughput : int 42 44 33 41
2001 Mar 10
3
Problem With Model.Tables Function
I am using R for the first time in one of my classes. My students have alerted me to a problem for which we have not found an answer. We find that some means returned by the model.tables function are not correct when missing data is present in analysis of variance problems. We have duplicated the problem using R 1.2.0, 1.2.1, and 1.2.2 under Windows 98 and several distributions of Linux (Redhat
2010 Oct 31
1
biglm: how it handles large data set?
I am trying to figure out why 'biglm' can handle large data set... According to the R document - "biglm creates a linear model object that uses only p^2 memory for p variables. It can be updated with more data using update. This allows linear regression on data sets larger than memory." After reading the source code below? I still could not figure out how 'update'
2002 Oct 31
1
Re: gregmisc version 0.7.3 now available
Dear Greg, Thanks for the new release. The decomposition of the SSQ is just what I need! Regards, Martin. Martin Hoyle, School of Life and Environmental Sciences, University of Nottingham, University Park, Nottingham, NG7 2RD, UK Webpage: http://myprofile.cos.com/martinhoyle >>> gregory_r_warnes at groton.pfizer.com 10/30/02 07:16PM >>> Version 0.7.3 of the gregmisc package
2003 Oct 15
2
aov and non-categorical variables
It is unclear to me how aov() handles non-categorical variables. I mean it works and produces results that I would expect, but I was under impression that ANOVA is only defined for categorical variables. In addition, help(aov) says that it "call to 'lm' for each stratum", which I presume means that it calls to lm() for every group of the categorical variable, however I
2005 Feb 21
2
power.anova.test for interaction effects
This question will probably get me in trouble on theoretical grounds, but I will pose it anyway. The situation: I recently ran a field study looking for differences in sugarbeet cultivar tolerance to a specific herbicide. The study was set up so that 37 cultivars were treated with 4 different applications of the herbicide (37*4 factorial). In doing so, we found that the interaction effect was
2012 Jul 04
2
Difference between two-way ANOVA and (two-way) ANCOVA
Hi! as my subject says I am struggling with the different of a two-way ANOVA and a (two-way) ANCOVA. I found the following examples from this webpage: http://www.statmethods.net/stats/anova.html # One Way Anova (Completely Randomized Design) fit <- aov(y ~ A, data=mydataframe) # Randomized Block Design (B is the blocking factor) fit <- aov(y ~ A + B, data=mydataframe) # Two Way
2011 Aug 06
1
multcomp::glht() doesn't work for an incomplete factorial using aov()?
Hi R users, I sent a message yesterday about NA in model estimates ( http://r.789695.n4.nabble.com/How-set-lm-to-don-t-return-NA-in-summary-td3722587.html). If I use aov() instead of lm() I get no NA in model estimates and I use gmodels::estimable() without problems. Ok! Now I'm performing a lot of contrasts and I need correcting for multiplicity. So, I can use multcomp::glht() for this.
2012 Jun 25
2
Fractional Factorial - Wrong values using lm-function
Hello. I'm a new user of R, and I have a question regarding the use of aov and lm-functions. I'm doing a fractional factorial experiment at our production site, and I need to familiarize myself with the analysis before I conduct the experiment. I've been working my way through the examples provided at http://www.itl.nist.gov/div898/handbook/pri/section4/pri472.htm
2009 Aug 07
5
Cantidad de datos
Buenas, tengo 30 000 000 de datos, y el R no me deja trabajar, como podria corregir eso problema para trabajar con los 30 000 000, mintras es estoy trabajando cada 1 000 000 pero no es igual. Espero puedan ayudarme saludos -- Manuel Bonilla
2009 Apr 22
5
large factorials
I am working on a project that requires me to do very large factorial evaluations. On R the built in factorial function and the one I created both are not able to do factorials over 170. The first gives an error and mine return Inf. Is there a way to have R do these larger calculations (the calculator in accessories can do 10000 factorial and Maple can do even larger) -- View this message in
2006 Aug 18
2
4^2 factorial help
To whom it may concern: I am trying a factorial design a system of mine that has two factors. Each factor was set at four different levels, with one replication for each of the combinations. My data is as follows: A B Response 1 600 2.5 0.0257 2 600 2.5 0.0254 3 600 5
2003 Sep 20
1
factorial design
Hello all, I´m trying to study a factorial design, but I can´t understand why did Df, Sum Sq and Mean Sq of residuals alter when I Split the interaction? I think that Split the interaction must not alter the residuals. Am I doing something wrong? Could anyone help me? My data and functions I tried are: Y<-c(196,213,183, 192,253,199, 251,331,276,
2005 Feb 18
1
Two-factorial Huynh-Feldt-Test
Hi, I'm currently working on porting some SAS scripts to R, and hence need to do the same calculation (and get the same results) as SAS in order to make the transition easier for users of the script. In the script, I'm dealing with a two-factorial repeated-measures anova. I'll try to give you a short overview of the setup: - two between-cell factors: facBetweenROI (numbering