On 27-Feb-10 03:52:19, Cardinals_Fan wrote:>
> Hi, I am a stata user trying to transition to R. Typically I
> compute marginal effects plots for (example) probit models by
> drawing simulated betas by using the coefficient/standard error
> estimates after I run a probit model. I then use these simulated
> betas to compute first difference marginal effects. My question
> is, can I do this in R? Specifically, I was wondering if anyone
> knows how R stores the coefficient/standard error estimates after
> you estimate the model? I assume it's vector, but what is it
> called?
>
> Cheers
> --
Here is an example which sets up (X,Y) data using a probit mechaism,
then fits a probit model, and then extracts the information which
you seek.
set.seed(54321)
X <- 0.2*(-10:10)
U <- rnorm(21)
Y <- 1*(U <= X) ## binary outcome 0/1, = 1 if N(0,1) <= X
GLM <- glm(Y ~ X, family=binomial(link=probit)) ## fit a probit
Coef <- summary(GLM)$coef ## apply summary() to the fit
GLM is a list with a large number of components: enter the command
str(GLM)
and have a look at what you get! Only a few of these are displayed
when you apply print() to it:
print(GLM)
# Call: glm(formula = Y ~ X, family = binomial(link = probit))
# Coefficients:
# (Intercept) X
# 0.08237 0.56982
#
# Degrees of Freedom: 20 Total (i.e. Null); 19 Residual
# Null Deviance: 29.06
# Residual Deviance: 23.93 AIC: 27.93
Note that you do *not* get Standard Errors from this.
However, all the information in GLM is available for processing
by other functions. In particular, summary(GLM) produces another
list with several components -- have a look at the output from
str(summary(GLM))
One of these components (listed near the end of this output)
is "coef", and it can be accessed as summary(GLM)$coef as in the
above command
Coef <- summary(GLM)$coef
This is a matrix (in this case 2 named rows, 4 named columns):
Coef
# Estimate Std. Error z value Pr(>|z|)
# (Intercept) 0.0823684 0.2974595 0.2769063 0.78185207
# X 0.5698200 0.2638657 2.1595076 0.03081081
So there is one row for each coefficient in the model (here 2,
one for Intercept, one for variable X), and four columns
(for the Estimate itself of the coefficient, for its Standard
Error, for the z-value (Est/SE), and for the P-value).
Hence you can access the estimates as
Coef[,1] # (the first column of the matrix)
# (Intercept) X
# 0.0823684 0.5698200
and their respective SEs as
Coef[,2] # (the second column of the matrix)
# (Intercept) X
# 0.2974595 0.2638657
I have spelled this out in detail to demonstrate that the key
to accessing information in objects constructed by R lies in
its structures (especially lists, vectors and matrices). You
can find out what is involved for any function by looking for
the section "Value" in its help page. For instance, the function
summary() when applied to a GLM uses the method summary.glm(),
so you can enter the command
?summary.glm
and then read what is in the section "Value". This shows that
it is a list with components whose names are
call, family, deviance, ... , coefficients, ... , symbolic.cor
and a component with name "Name" can be accessed using
"$Name" as
in "GLM$coef" (you can use "coef" instead of
"coefficients" since
the first four letters are [more than] enough to identify the name
uniquely).
Once you get used to this, things become straightforward!
Ted.
--------------------------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at manchester.ac.uk>
Fax-to-email: +44 (0)870 094 0861
Date: 27-Feb-10 Time: 08:38:41
------------------------------ XFMail ------------------------------