Main Content

MMSE Forecasting Regression Models with ARIMA Errors

What Are MMSE Forecasts?

An objective of time series analysis is generating forecasts for responses over a future time horizon. That is, you can generate predictions foryT+ 1,yT+ 2、……yT+ hgiven the following:

  • An observed seriesy1,y2、……yT

  • A forecast horizonh

  • Nonstochastic predictorsx1,x2、……xT、……xT+h, wherexkis anr-vector containing the measurements ofrpredictors observed at timek

  • A regression model with ARIMA errors

    y t = c + X t β + u t Η ( L ) u t = Ν ( L ) ε t ,

    where H(L) and N(L) are compound autoregressive and moving average lag operator polynomials (possibly containing integration), respectively.

Let y ^ t + 1 denote a forecast for the process at timet+ 1, conditional on the history of the process up to timet(Ht), and assume that the predictors are fixed. The minimum mean square error (MMSE) forecast is the forecast y ^ t + 1 that minimizes expected square loss,

E ( y t + 1 y ^ t + 1 | H t ) 2 .

Minimizing this loss function yields the MMSE forecast,

y ^ t + 1 = E ( y t + 1 | H t ) .

How forecast Generates MMSE Forecasts

forecastgenerates MMSE forecasts recursively. When you callforecast, you must specify aregARIMAmodel (Mdl) and the forecast horizon. You can also specify presample observations (Y0), predictors (X0), innovations (E0), and conditional disturbances (U0) using name-value pair arguments.

To begin forecastingytstarting at timeT+ 1, use the last few observations ofytandXtas presample responses and predictors to initialize the forecast. Alternatively, you can specify presample unconditional disturbances or innovations.

However, when you specify presample data:

  • If you provide presample predictor data (X0), then you must also provide predictor forecasts (XF). It is best practice to setX0to the same predictor matrix that estimates the parameters. If you do not provide presample and future predictors, thenforecastignores the regression component in the model.

  • If the error process inMdlcontains a seasonal or nonseasonal autoregressive component, or seasonal or nonseasonal integration, thenforecastrequires a minimum ofPpresample unconditional disturbances to initialize the forecast. The propertyPofMdlstoresP.

  • If the error process inMdlcontains a seasonal or nonseasonal moving average component, thenforecastrequires a minimum ofQpresample innovations to initialize the forecast. The propertyQofMdlstoresQ.

  • If you provide a sufficient amount of presample unconditional disturbances, thenforecastignoresY0andX0. If you also do not provideE0, but provide enough presample unconditional disturbances, thenforecastinfers the required amount of presample innovations from the ARIMA error model andU0.

  • If you provide a sufficient amount of presample responses and predictors (and do not provideU0), thenforecastuses the regression model to infer the presample unconditional disturbances.

  • If you do not provide presample observations, thenforecastsets the required amount of presample unconditional disturbances and innovations to 0.

  • If you provide an insufficient amount of presample observations, thenforecastreturns an error.

Consider generating forecasts from a regression model with ARMA(3,2) errors:

y t = c + X t β + u t ( 1 a 1 L a 2 L 2 a 3 L 3 ) u t = ( 1 + b 1 L + b 2 L 2 ) ε t or a ( L ) u t = b ( L ) ε t ,

wherea(L) andB(L) are lag operator polynomials. The largest AR lag is 3, the largest MA lag is 2. This model does not contain any seasonal lags nor integration. Therefore,P= 3 andQ= 2. To forecast this model, you need three presample responses and predictors, or three presample unconditional disturbances, and two presample innovations.

Given presample unconditional disturbances ( u T 2 , u T 1 , u T ) , presample innovations ( ε T 1 , ε T ) , and future predictors ( X T + 1 , X T + 2 , ... ) , you can forecast the model as follows:

  • u ^ T + 1 = a 1 u T + a 2 u T 1 + a 3 u T 2 + b 1 ε T + b 2 ε T 1 y ^ T + 1 = c + X T + 1 β + u ^ T + 1 .

  • u ^ T + 2 = a 1 u ^ T + 1 + a 2 u T + a 3 u T 1 + b 2 ε T y ^ T + 2 = c + X T + 2 β + u ^ T + 2 .

  • u ^ T + 3 = a 1 u ^ T + 2 + a 2 u ^ T + 1 + a 3 u T y ^ T + 3 = c + X T + 3 β + u ^ T + 3 .

...

Note that:

  • Future innovations take on their unconditional mean, 0.

  • For stationary error processes, such as this one:

    • 的forecasted unconditional disturbances converge to their unconditional mean,

      E ( u t ) = b ( L ) a ( L ) E ( ε t ) = 0.

    • c+Xtβgoverns the long-term behavior of the forecasted responses.

Forecast Error

的forecast error for ans-step ahead forecast of a regression model with ARIMA errors is

MSE = E ( y T + s y ^ T + s | H T + s 1 ) 2 = E ( c + X T + s β + u T + s c X t + s β u ^ T + s | H T + s 1 ) 2 = E ( u T + s u ^ T + s | H T + s 1 ) 2 = Ν ( L ) Η ( L ) E ( ε t 2 | H T + s 1 ) = ψ ( L ) σ 2 ,

where the dividendψ(L)是一个无限延迟算子多项式σ2is the innovation variance.

If the error process is stationary, then the coefficients ofψ(L) are absolutely summable. Therefore, the MSE (mean square error) converges to the unconditional variance of the process[1].

If the error process is not stationary, then the MSE grows with increasings.

References

[1] Box, G. E. P., G. M. Jenkins, and G. C. Reinsel.Time Series Analysis: Forecasting and Control. 3rd ed. Englewood Cliffs, NJ: Prentice Hall, 1994.

See Also

|

Related Examples

More About