thr3ads.net - R help - [R] model selection, stepAIC(), and coxph() (fwd) [May 2006]

If this information is useful, please help other people find it:
Share via:

Chad Reyhan Bhatti

2006-May-07 00:01 UTC

[R] model selection, stepAIC(), and coxph() (fwd)

Hello,

My question concerns model selection, stepAIC(), add1(), and coxph().

In Venables and Ripley (3rd Ed) pp389-390 there is an example of using
stepAIC() for the automated selection of a coxph model for VA lung cancer
data.

A statistics question:  Can partial likelihoods be interpreted in the same
manner as likelihoods with respect to information based criterion and
likelihood ratio tests?  It seems that they should be treated as
quasilikelihoods which would make stepAIC() invalid and would require the
use of add1() with a F-test for the reduction in deviance.

An answer and a reference would be greatly appreciated.

Thanks,

Chad R. Bhatti


!DSPAM:445d377a41433079914684!

Thomas Lumley

2006-May-08 15:11 UTC

head link

[R] model selection, stepAIC(), and coxph() (fwd)

On Sat, 6 May 2006, Chad Reyhan Bhatti wrote:
> Hello,
>
> My question concerns model selection, stepAIC(), add1(), and coxph().
>
> In Venables and Ripley (3rd Ed) pp389-390 there is an example of using
> stepAIC() for the automated selection of a coxph model for VA lung cancer
> data.
>
> A statistics question:  Can partial likelihoods be interpreted in the same
> manner as likelihoods with respect to information based criterion and
> likelihood ratio tests?  It seems that they should be treated as
> quasilikelihoods which would make stepAIC() invalid and would require the
> use of add1() with a F-test for the reduction in deviance.
Since this is a question about the MASS book you would be better off 
contacting the authors.

They do (as usual) know what they are doing.  The Cox model is an 
unusually (perhaps uniquely) well-behaved semiparametric model, and the 
partial likelihood really does behave this way.

- For data without ties in the survival time the partial likelihood is 
(proportional to) the marginal likelihood of the ranks, so it is a 
perfectly good parametric likelihood. (Kalbfleisch & Prenctice, 
Biometrika, 1973)

- The chi^2 distribution (rather than F distribution) for the likelihood 
ratio test is justified by the marginal likelihood, or by martingale 
arguments (eg the book by Fleming and Harrington), or in more modern times 
by empirical process arguments or as a semiparametric profile likelihood. 
However, the only technically hard part is showing weak convergence -- the 
original paper by Cox showed that the variance of the partial score and 
the Hessian of the partial likelihood were the same, which is the key fact 
for the chi^2 rather than F test to be valid (if one of them is)

- The same arguments suggest AIC will be appropriate for comparing 
different subsets of variables in the same way that it is for generalized 
linear models. I don't have a reference here.


 	-thomas

Thomas Lumley			Assoc. Professor, Biostatistics
tlumley at u.washington.edu	University of Washington, Seattle

Seemingly Similar Threads

Search for more apparently analagous threads

R help - May 2006 - model selection, stepAIC(), and coxph() (fwd)

[R] model selection, stepAIC(), and coxph() (fwd)

[R] model selection, stepAIC(), and coxph() (fwd)

Seemingly Similar Threads