Hi all,
I have two time series that I would like to correlate but as they are
autocorrelated, I am "pre-whitening" them first by fitting ARIMA
models,
then correlating their residuals....as described in
https://onlinecourses.science.psu.edu/stat510/?q=node/75
However, http://www.stat.pitt.edu/stoffer/tsa2/Rissues.htm discusses some
issues with ARIMA in R. In particular, for issue 2, if there is any
differencing, a constant term is not included, but the solution is to 1)
input a series that has already been differenced and not include an order
for differencing in the function, or 2) include an xreg argument.
I have some sample code:
set.seed(66)
x = arima.sim(list(order=c(1,1,0), ar=.9), n=100) + 50
y=c(x[-1:-3],x[98:100])
# Method 1
m1a=arima(diff(x),order=c(1,0,0))
m1b=arima(diff(y),order=c(1,0,0))
a=ccf(resid(m1a),resid(m1b))
# Method 2
m2a=arima(x,order=c(1,1,0),xreg=1:length(x))
m2b=arima(y,order=c(1,1,0),xreg=1:length(y))
b=ccf(resid(m2a),resid(m2b))
My question is why do the two methods generate different results for the CCF
(one having a peak correlation at lag 4 and the other at lag 3, and the
correlations are also of slightly different values)? I am assuming Method 2
is the correct method since it gives the peak correlation at the correct lag
(lag 3), but I'm wondering why Method 1 does not work.
Could anyone explain this? Thanks in advance!
Emily
[[alternative HTML version deleted]]