SARIMA Model

Describing a SARIMA Model

  • A SARIMA model is an ARIMA model that accounts for seasonality
  • Here, seasonality just refers to a repeating pattern within a year (i.e. can't be a two-year pattern)
  • A SARIMA model is defined as the following:

    ARIMA(p,d,q)(P,D,Q)mARIMA(p,d,q)(P,D,Q)m
    • Where pp refers to the order of the autoregressive portion of the ARIMA model
    • Where dd refers to the order of the integrated portion of the ARIMA model (i.e. how many times we take a difference to get our model to be stationary)
    • Where qq refers to the order of the moving average portion of the ARIMA model
    • Where mm refers to the seasonal factor (i.e. the number of distinct periods within a year that make up a single seasonal pattern that is repeated over time)
    • Where PP refers to the order of the autoregressive portion of the ARIMA model, with an additional seasonal component
    • Where DD refers to the order of the integrated porition of the ARIMA model, with an additional seasonal component
    • Where QQ refers to the order of the order of the moving average porition of the ARIMA model, with an additional seasonal component
  • A SARIMA model can also be written as an ARIMA model:
ARIMA(p,d,q)1(P,D,Q)mARIMA(p,d,q)_{1}(P,D,Q)m
  • This is because we're technically backshifting parameters p,d,p, d, and qq by 11 (i.e. st+1sts_{t+1} - s_{t})

Example of SARIMA Model

  • Let's say we define the following SARIMA model:
ARIMA(1,0,0)(0,1,1)4ARIMA(1,0,0)(0,1,1)_{4}
  • The pp parameter with a value equal to 11 can be translated to the following:

    (1ϕ1B1)yt(1-\phi_{1}B^{1})y_{t}
    • Where B1ytB^{1}y_{t} is our lag operator representing yt1y_{t-1}
    • Where ϕ1\phi_{1} is the slope of our variable yt1y_{t-1}
    • Therefore, (1ϕ1B1)yt(1-\phi_{1}B^{1})y_{t} is just shorthand for ytϕ1yt1y_{t}-\phi_{1}y_{t-1}
  • The DD parameter with a value equal to 11 can be translated to the following:

    (1B4)yt(1-B^{4})y_{t}
    • Where B4ytB^{4}y_{t} is our lag operator representing yt4y_{t-4}
    • Therefore, (1B4)yt(1-B^{4})y_{t} is just shorthand for z=ytyt4z = y_{t}-y_{t-4}
  • The QQ parameter with a value equal to 11 can be translated to the following:

    (1+θ1B4)ϵt(1+\theta_{1}B^{4})\epsilon_{t}
    • Where B4ϵtB^{4}\epsilon_{t} is our lag operator representing ϵt4\epsilon_{t-4}

      • Where ϵt4\epsilon_{t-4} is the error of the fourth previous predicted value and what we observed
      • Where θ1\theta_{1} represents the percentage of the error ϵt4\epsilon_{t-4} we should include in our model
      • Therefore, (1θ1B4)ϵt(1-\theta_{1}B^{4})\epsilon_{t} is just shorthand for ϵt+θ1ϵt4\epsilon_{t}+\theta_{1}\epsilon_{t-4}
  • We can continue to simplify our SARIMA model to the following formulas:
(1ϕ1B1)(1B4)yt=(1+θ1B4)ϵt(1-\phi_{1}B^{1})(1-B^{4})y_{t} = (1+\theta_{1}B^{4})\epsilon_{t} (1ϕ1B1B4+ϕ1B5)yt=ϵt+θ1ϵt4(1-\phi_{1}B^{1}-B^{4}+\phi_{1}B^{5})y_{t} = \epsilon_{t}+\theta_{1}\epsilon_{t-4} ytϕ1yt1yt4+ϕ1yt5=ϵt+θ1ϵt4y_{t} - \phi_{1}y_{t-1} - y_{t-4} + \phi_{1}y_{t-5} = \epsilon_{t} + \theta_{1}\epsilon_{t-4} ytyt4=ϕ1yt1ϕ1yt5+θ1ϵt4+ϵty_{t} - y_{t-4} = \phi_{1}y_{t-1} - \phi_{1}y_{t-5} + \theta_{1}\epsilon_{t-4} + \epsilon_{t} zt=ϕ1zt1+θ1ϵt4+ϵtz_{t} = \phi_{1}z_{t-1} + \theta_{1}\epsilon_{t-4} + \epsilon_{t}
  • Since z=ytyt4z = y_{t}-y_{t-4} is based on parameter DD
  • Now, we've transformed our ARIMA model to account for seasonality by taking differences ytyt4y_{t} - y_{t-4}, which will lead to a more stationary model

References

Previous

Seasonality