thr3ads.net - R packages - [R-pkgs] MANOVA for collinear responses with rotation testing (ffmanova) + Synthetic data (RegSDC) [Jan 2019]

If this information is useful, please help other people find it:
Share via:

Langsrud, Øyvind

2019-Jan-23 22:10 UTC

[R-pkgs] MANOVA for collinear responses with rotation testing (ffmanova) + Synthetic data (RegSDC)

Hi,

Package ffmanova originally released on CRAN in 2006 has now been updated to
Version 1.0. It has been a stable working horse and the changes are cosmetic.

Package RegSDC (Version: 0.2.0) is a new package on CRAN. The two packages are
theoretically related and both make use of conditioned multivariate normal
simulations - rotation testing in ffmanova and synthetic data generation in
RegSDC.

Package ffmanova is mainly meant as a package for multivariate responses, but it
also involves a general contribution to ANOVA testing in linear models. The
approach to sums of squares (Type II*) is invariant to scale changes of
continuous variables and pitfalls are avoided. Try to run code below.


ffmanova: Fifty-Fifty MANOVA

General linear modeling with multiple responses (MANCOVA). An overall p-value
for each model term is calculated by the 50-50 MANOVA method by Langsrud (2002)
<https://10.1111/1467-9884.00320>, which handles collinear responses.
Rotation testing, described by Langsrud (2005)
<https://10.1007/s11222-005-4789-5>, is used to compute adjusted single
response p-values according to familywise error rates and false discovery rates
(FDR). The approach to FDR is described in the appendix of Moen et al. (2005)
<https://10.1128/AEM.71.4.2086-2094.2005>. Unbalanced designs are handled
by Type II sums of squares as argued in Langsrud (2003)
<https://10.1023/A:1023260610025>. Furthermore, the Type II philosophy is
extended to continuous design variables as described in Langsrud et al. (2007)
<https://10.1080/02664760701594246>. This means that the method is
invariant to scale changes and that common pitfalls are avoided.


RegSDC: Information Preserving Regression-Based Tools for Statistical Disclosure
Control

Information Preserving Regression-Based Tools for Statistical Disclosure Control
Implementation of the methods described in the paper with the above title:
Langsrud, ?. (2019) <https://10.1007/s11222-018-9848-9>. Open view-only
version at <https://rdcu.be/bfeWQ>. The package can be used to generate
synthetic or hybrid continuous microdata, and the relationship to the original
data can be controlled in several ways.


Best,
?yvind Langsrud


set.seed(123)
z <- 1:9
x <- c(0, 0, 0, 10, 10, 10, 1, 1, 1)
y <- rnorm(9)/10 + x  # y depends strongly on x
z100 <- z + 100  # change of scale (origin)
x100 <- x + 100  # change of scale (origin)
library(car)  # Anova with type II as default
library(ffmanova)

# Type III depends on scale
Anova(lm(y ~ factor(x) * z), type = 3)
Anova(lm(y ~ factor(x) * z100), type = 3)

# Even Type II depends on scale
Anova(lm(y ~ x + I(x^2)), type = 2)
Anova(lm(y ~ x100 + I(x100^2)), type = 2)

# Type II* within ffmanova is invariant
ffmanova(y ~ x100 + I(x100^2))
ffmanova(y ~ z * (x100 + I(x100^2)))

# Type I for comparison
anova(lm(y ~ x100 + I(x100^2)))  # same as ffmanova
anova(lm(y ~ z * (x100 + I(x100^2))))  # but here z is significant

	[[alternative HTML version deleted]]

R packages - Jan 2019 - MANOVA for collinear responses with rotation testing (ffmanova) + Synthetic data (RegSDC)

[R-pkgs] MANOVA for collinear responses with rotation testing (ffmanova) + Synthetic data (RegSDC)