QUESTION:

How can I extend ARIMA to include regressors ?

ANSWER:

The problem you refer to is handled by Transfer Functions. Box and Jenkins wrote a text, but most people

have only read the first half of it. The first half (roughly speaking) dealt with ARIMA or auto-projective

methods and the issues involved in identifying, estimating and forecasting. The second half dealt with

Transfer Functions and augmented the auto-projective model with eXgoneous or X input series. These

input series were modelled as right-hand side variables and their respective lag structures were to be

optimized.

 

This is then a super-set of multiple regression where Y is set upon by X and its lags and an error process

that may have some color ( i.e. an ARIMA process). Unfortunately, most readers and software developers

have ignored the opportunities in teaching and implementing these single equation, multivariable time

series methods. The input series may either be stochastic (probabalistic) or deterministic (fixed).

Identification strategies and model refinements differ for both stochastic and deterministic inputs.

 

Pre-whitened cross correlations are used to identify possible structures. Note that the differencing

involved in these identification steps are not necessarily part and parcel of the Transfer Model. Both of

these tools (ARIMA and TRANSFER FUNCTIONS) are NOT ROBUST to certain violations of the

Gaussian assumptions. Specifically, lack of normality doesn't seem to be a problem, although it could be.

Discrete data as opposed to continuous data can be a problem at low levels of response. Interventions, i.e. a

non-constant mean of the errors, has a significant effect on each step of the modeling process. The analog

to ROBUST REGRESSION when dealing with time series data is INTERVENTION DETECTION.

Interventions i.e. changes in the mean of the residuals can arise in at least four ways: 1. Pulse (one-time

unusual values) 2. Step or Level (consecutive unusual values .... thus locally usual) 3. Seasonal Pulses

(consecutive unusual values, , s periods apart. .. thus usual) 4. Time trends (trends do not necessarily

begin at time period 1 and end at time period t).

 

Changes in the second moment of the residuals (the variance) can have a significant effect. Work on

detecting relationships between central tendency (the mean) and variability (the variance) led to

developing optimum transformations (log, square root etc.). Recent work has identified regime changes

in the variance leading to Weighted Least Squares Estimators. The good news is that AUTOBOX

provides an integration of these opportunities even providing a "productivity aid" by automatically

identifying and building these models. AUTOBOX shows each and every step of the process and users

can set up their own procedures, models and simulations and fine tune the analysis.