Interval Forecasting in Time Series Analysis: Application to COVID-19 and Beyond
Pub. online: 9 May 2025
Type: Data Science In Action
Open Access
Received
1 October 2024
1 October 2024
Accepted
15 April 2025
15 April 2025
Published
9 May 2025
9 May 2025
Abstract
Forecasting is essential for optimizing resource allocation, particularly during crises such as the unprecedented COVID-19 pandemic. This paper focuses on developing an algorithm for generating k-step-ahead interval forecasts for autoregressive time series. Unlike conventional methods that assume a fixed distribution, our approach utilizes kernel distribution estimation to accommodate the unknown distribution of prediction errors. This flexibility is crucial in real-world data, where deviations from normality are common, and neglecting these deviations can result in inaccurate predictions and unreliable confidence intervals. We evaluate the performance of our method through simulation studies on various autoregressive time series models. The results show that the proposed approach performs robustly, even with small sample sizes, as low as 50 observations. Moreover, our method outperforms traditional linear model-based prediction intervals and those derived from the empirical distribution function, particularly when the underlying data distribution is non-normal. This highlights the algorithm’s flexibility and accuracy for interval forecasting in non-Gaussian contexts. We also apply the method to log-transformed weekly COVID-19 case counts from lower-middle-income countries, covering the period from June 1, 2020, to March 13, 2022.
References
Abbasimehr H, Paki R, Bahrini A (2022). A novel approach based on combining deep learning models with statistical methods for COVID-19 time series forecasting. Neural Computing & Applications, 34: 3135–3149. https://doi.org/10.1007/s00521-021-06548-9
Chatfield C (1993). Calculating interval forecasts. Journal of Business & Economic Statistics, 11(2): 121–135. https://doi.org/10.1080/07350015.1993.10509938
Cerqueira V, Torgo L, Mozetič I (2020). Evaluating time series forecasting models: An empirical study on performance estimation methods. Machine Learning, 109: 1997–2028. https://doi.org/10.1007/s10994-020-05910-7
Gooijer J, Hyndman R (2006). 25 years of time series forecasting. International Journal of Forecasting, 22: 443–473. https://doi.org/10.1016/j.ijforecast.2006.01.001
Inoue A, Jin L, Rossi B (2017). Rolling window selection for out-of-sample forecasting with time-varying parameters. Journal of Econometrics, 196: 55–67. https://doi.org/10.1016/j.jeconom.2016.03.006
Kong J, Gu L, Yang L (2018). Prediction interval for autoregressive time series via oracally efficient estimation of multi-step-ahead innovation distribution function. Journal of Time Series Analysis, 395: 690–708. https://doi.org/10.1111/jtsa.12293
Pan L, Politis DN (2016). Bootstrap prediction intervals for linear, nonlinear and nonparametric autoregressions. Journal of Statistical Planning and Inference, 177: 1–27. https://doi.org/10.1016/j.jspi.2014.10.003
Petropoulos F, Makridakis S, Stylianou N (2022). COVID-19: Forecasting confirmed cases and deaths with a simple time series model. International Journal of Forecasting, 38: 439–452. https://doi.org/10.1016/j.ijforecast.2020.11.010
Pierce DA (1971). Least squares estimation in the regression model with autoregressive-moving average errors. Biometrika, 58: 299–312. https://doi.org/10.1093/biomet/58.2.299
R Core Team (2024). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/.
Shao Q, Polavarapu M, Small L, Singh S, Nguyen Q, Shao K (2024). A longitudinal mixed effects model for assessing mortality trends during vaccine rollout. Healthcare Analytics, 6: 100347. https://doi.org/10.1016/j.health.2024.100347
Shao Q, Yang L (2017). Oracally efficient estimation and consistent model selection for auto-regressive moving average time series with trend. Journal of the Royal Statistical Society, Series B, Statistical Methodology, 79: 507–524. https://doi.org/10.1111/rssb.12170
Wang J, Liu R, Cheng F, Yang L (2014). Oracally efficient estimation of autoregressive error distribution with simultaneous confidence band. The Annals of Statistics, 42: 654–668. https://doi.org/10.1214/14-AOS1238
Zhong C (2024). Statistical inference for innovation distribution in ARMA and multi-step-ahead prediction via empirical process. Journal of Nonparametric Statistics, 1–27. https://doi.org/10.1080/10485252.2024.2384608