Autoregression

Describing Autoregression

  • An autoregressive model is a linear regression model, where the response variable is some variable indexed by the current time period and the predictor variables are the same variable indexed by previous time periods
  • For example, we could create an autoregressive model to predict the average price of salmon in March depending on the average price of salmon in February and the average price of salmon in January
  • In this case, we could define the autoregressive model as the following:

    st=β0+β1st1+β2st2+ϵts_{t} = \beta_{0} + \beta_{1}s_{t-1} + \beta_{2}s_{t-2} + \epsilon_{t}
    • Where sts_{t} represents the average price of salmon in the current time period (i.e. March)
    • Where st1s_{t-1} represents the average price of salmon in the previous time period (i.e. February)
    • Where st2s_{t-2} represents the average price of salmon in the time period before the previous time period (i.e. March)

Including Lag Variables

  • A lag variable refers to a predictor from a previous time period in an autoregressive model
  • For example, the average price of salmon in February (i.e. st1s_{t-1}) would be considered a lag variable in our case
  • Including every single lag variable (or a very large amount of lag variables) is a typical naive approach to fitting an autoregressive model
  • However, this typically leads to overfitting
  • Therefore, we are interested in only including the most significant lag variables in our model to avoid overfitting, and thus building a model that will hold up better over time
  • We can determine which lags are most significant by observing the lags within a partial autocorrelated function (or pacfpacf) chart
  • Specifically, we want to know which months have prices that are directly correlated with the prices of our current month
  • Essentially, we only want to include the lags in our model whose direct effects (based on pacfpacf) are high in magnitude according to the pacfpacf chart

References

Previous
Next

Autocorrelation

Moving Average Model