For ordered factor the natural contrast coding would be to parametrize by the succsessive differences between levels, which does not assume equal spacing of factor levels as does the polynomial contrasts (implicitly at least). This requires the contr.cum, which could be: contr.cum <- function (n, contrasts = TRUE) { if (is.numeric(n) && length(n) == 1) levs <- 1:n else { levs <- n n <- length(n) } contr <- array(0, c(n, n), list(levs, levs)) contr[col(contr) <= row(contr)] <- 1 if (contrasts) { if (n < 2) stop(paste("Contrasts not defined for", n - 1, "degrees of freedom")) contr <- contr[, -1, drop = FALSE] } contr } Why is this not part of the standard R distribution ? Bendix Carstensen \\\|/// \\ - - / ( @ @ ) +------------------o00o-(_)-o00o------------------+ | Bendix Carstensen tel: 44 43 87 38 | | Statistiker SDC: 44 43 90 42 | | Steno Diabetes Center fax: 44 43 82 32 | | Niels Steensens Vej 2 e-mail: bxc at novo.dk | | 2820 Gentofte | | Ooooo | +------------------ooooO----( )-----------------+ ( ) ) / \ ( (_/ \_)> -----Original Message----- > From: Thomas Lumley [mailto:thomas at biostat.washington.edu] > Sent: 4. maj 1999 16:00 > To: Simon Bond > Cc: r-help at stat.math.ethz.ch > Subject: Re: [R] surrogate poisson models > > > On Tue, 4 May 1999, Simon Bond wrote: > > > > > Call: > > glm(formula = terms(Fr ~ treatment * age + (treatment + age) * > > repair, keep.order = T), family = poisson, data = overall) > > > > > > > Coefficients: > > Estimate Std. Error z value Pr(>|z|) > > (Intercept) 0.68441 0.28979 2.362 0.01819 * > > treatment -0.28096 0.43716 -0.643 0.52043 > > age.L 0.55595 0.42669 1.303 0.19259 > > age.Q 0.03404 0.42817 0.079 0.93664 > > treatment.age.L -1.31805 0.66813 -1.973 0.04853 * > > treatment.age.Q -0.37452 0.67332 -0.556 0.57805 > > repair.L 1.53962 0.54255 2.838 0.00454 ** > > repair.Q -0.49447 0.40128 -1.232 0.21787 > > treatment.repair.L -3.93138 0.96310 -4.082 4.46e-05 *** > > treatment.repair.Q -0.58937 0.62739 -0.939 0.34753 > > age.L.repair.L -2.08339 0.67093 -3.105 0.00190 ** > > age.Q.repair.L -0.47257 0.59116 -0.799 0.42406 > > age.L.repair.Q -0.04208 0.42881 -0.098 0.92183 > > age.Q.repair.Q -0.64314 0.42800 -1.503 0.13293 > > - --- > > > > > How do you interpret the suffixes .L and .Q in the > summary? I tried > > redefining overall$age_ordered(.... labels=c(...)), to no > avail. When the > > factors are unordered, the suffixes are the labels, but the > fitted model is > > different. > > For an ordered factor polynomial contrasts are used by > default (this can > be set with options(contrasts=)). Polynomial contrasts decompose the > effect of a variable into orthogonal linear, quadratic, > cubic,... terms. > Based on the analysis above you might argue that all the > quadratic terms > are relatively small and not statistically significant and > that the linear > terms capture most of the variation. > > You can see what the contrast matrix looks like by typing eg > R> contr.poly(3) > .L .Q > [1,] -7.071068e-01 0.4082483 > [2,] -7.850462e-17 -0.8164966 > [3,] 7.071068e-01 0.4082483 > > to get the polynomial contrasts for a three-level ordered factor. > > For an unordered factor we use treatment contrasts by default > (that is, > indicator variables for all but one level of the factor). > Any full rank > set of contrasts gives the same model, but the coefficients > are different, > and allow you to answer different questions. The choice of contrasts > depends on which questions you want to answer. > > Thomas Lumley > Assistant Professor, Biostatistics > University of Washington, Seattle-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Prof Brian D Ripley
1999-May-05 12:30 UTC
Ordered factors , was:[R] surrogate poisson models
On Wed, 5 May 1999, BXC wrote:> For ordered factor the natural contrast coding would be to parametrize by > the succsessive differences between levels, which does not assume equal > spacing > of factor levels as does the polynomial contrasts (implicitly at least).What is `natural' is a matter of opinion, of course, and does not facilitate reducing the number of dfs given to the factor. If the levels are known not to be equally spaced, and it is know how they are spaced then I suggest you have rather more than an ordinal varaible, which is what ordered factors are designed for. In that case, use poly on a set of scores for the levels.> This requires the contr.cum, which could be: > > contr.cum <- function (n, contrasts = TRUE) > { > if (is.numeric(n) && length(n) == 1) > levs <- 1:n > else { > levs <- n > n <- length(n) > } > contr <- array(0, c(n, n), list(levs, levs)) > contr[col(contr) <= row(contr)] <- 1 > if (contrasts) { > if (n < 2) > stop(paste("Contrasts not defined for", n - 1, "degrees of > freedom")) > contr <- contr[, -1, drop = FALSE] > } > contr > } > > Why is this not part of the standard R distribution ?Well, there are lots of things we could add, but this one is precisely (as far as I can see) contr.sdif in the MASS library. -- Brian D. Ripley, ripley at stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272860 (secr) Oxford OX1 3TG, UK Fax: +44 1865 272595 -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._