Autoregressive integrated moving average

From Wikipedia, the free encyclopedia

In statistics, an autoregressive integrated moving average (ARIMA) model is a generalisation of an autoregressive moving average or (ARMA) model. These models are fitted to time series data either to better understand the data or to predict future points in the series. The model is generally referred to as an ARIMA(p,d,q) model where p, d, and q are integers greater than or equal to zero and refer to the order of the autoregressive, integrated, and moving average parts of the model respectively.

Given a time series of data Xt where t is an integer index and the Xt are real numbers, then an ARMA(p,q) model is given by

\left(1 - \sum_{i=1}^p \phi_i L^i\right) X_t = \left(1 + \sum_{i=1}^q \theta_i L^i\right) \varepsilon_t\,

where L is the lag operator, the φi are the parameters of the autoregressive part of the model, the θi are the parameters of the moving average part and the \varepsilon_t are error terms. The error terms \varepsilon_t are generally assumed to be independent, identically distributed variables sampled from a normal distribution with zero mean.

An ARIMA(p,d,q) process is obtained by integrating an ARMA(p,q) process. That is,

\left(1 - \sum_{i=1}^p \phi_i L^i\right) (1-L)^d X_t = \left(1 + \sum_{i=1}^q \theta_i L^i\right) \varepsilon_t\,

where d is a positive integer that controls the level of differencing (or, if d = 0, this model is equivalent to an ARMA model). Conversely, applying term-by-term differencing d times to an ARIMA(p,d,q) process gives an ARMA(p,q) process. Note that it is only necessary to difference the AR side of the ARMA representation, because the MA component is always I(0).

It should be noted that not all choices of parameters produce well-behaved models. In particular, if the model is required to be stationary then conditions on these parameters must be met.

Some well-known special cases arise naturally. For example, an ARIMA(0,1,0) model is given by:

X_t = X_{t-1} + \varepsilon

which is simply a random walk.

A number of variations on the ARIMA model are commonly used. For example, if multiple time series are used then the Xt can be thought of as vectors and a VARIMA model may be appropriate. Sometimes a seasonal effect is suspected in the model. For example, consider a model of daily road traffic volumes. Weekends clearly exhibit different behaviour from weekdays. In this case it is often considered better to use a SARIMA (seasonal ARIMA) model than to increase the order of the AR or MA parts of the model. If the time-series is suspected to exhibit long-range dependence then the d parameter may be replaced by certain non-integer values in a Fractional ARIMA (FARIMA also sometimes called ARFIMA) model.

[edit] See also

[edit] References

  • Mills, Terence C. Time Series Techniques for Economists. Cambridge University Press, 1990.
  • Percival, Donald B. and Andrew T. Walden. Spectral Analysis for Physical Applications. Cambridge University Press, 1993.

[edit] External links

In other languages