Dynamic Network Poisson Autoregression with Application to COVID-19 Count Data✩
Pub. online: 5 July 2024
Type: Statistical Data Science
Open Access
✩
The authors are most grateful to Yoshihisa Baba and two anonymous reviewers for their very helpful comments and suggestions. The first author acknowledges the financial support of the Japan Society for the Promotion of Science (grant number 22KK0022). This work was partially supported by The Hong Kong University of Science and Technology research grant “Risk Analytics and Applications” (grant number SBMDF21BM07). The funding recipient was MKPS.
Received
24 June 2023
24 June 2023
Accepted
12 March 2024
12 March 2024
Published
5 July 2024
5 July 2024
Abstract
There is growing interest in accommodating network structure in panel data models. We consider dynamic network Poisson autoregressive (DN-PAR) models for panel count data, enabling their use in regard to a time-varying network structure. We develop a Bayesian Markov chain Monte Carlo technique for estimating the DN-PAR model, and conduct Monte Carlo experiments to examine the properties of the posterior quantities and compare dynamic and constant network models. The Monte Carlo results indicate that the bias in the DN-PAR models is negligible, while the constant network model suffers from bias when the true network is dynamic. We also suggest an approach for extracting the time-varying network from the data. The empirical results for the count data for confirmed cases of COVID-19 in the United States indicate that the extracted dynamic network models outperform the constant network models in regard to the deviance information criterion and out-of-sample forecasting.
Supplementary material
Supplementary MaterialProgramming code can be found at https://github.com/ManabuAsai/Dynamic_network_poisson.
References
Amillotta M, Fokianos K, Krikidis I (2022). Generalized linear models network autoregression. In: Network Science: Proceedings of the 7th International Winter Conference, NetSci-X 2022 (P Ribeiro, F Silva, JF Mendes, R Laureano, eds.), In: Lecture Notes in Computer Science, 112–125. Network Science Society, Springer, Porto, Portugal.
Armillotta M, Fokianos K (2021). Poisson network autoregression. arXiv preprint: https://arxiv.org/abs/2104.06296.
Armillotta M, Fokianos K (2022). Testing linearity for network autoregressive models. arXiv preprint: https://arxiv.org/abs/2202.03852.
Bartolucci F, Pennoni F, Mira A (2021). A multivariate statistical approach to predict Covid-19 count data with epidemiological interpretation and uncertainty quantification. Statistics in Medicine, 40(24): 5351–5372. https://doi.org/10.1002/sim.9129
Berry LR, West M (2020). Bayesian forecasting of many count-valued time series. Journal of Business & Economic Statistics, 38(4): 872–887. https://doi.org/10.1080/07350015.2019.1604372
Chan S, Chu J, Zhang Y, Nadarajah S (2021). Count regression models for COVID-19. Physica A: Statistical Mechanics and its Applications, 563: 125460. https://doi.org/10.1016/j.physa.2020.125460
Chen CWS, Khamthong K, Lee S (2019). Markov switching integer-valued generalized auto-regressive conditional heteroscedastic models for dengue counts. Journal of the Royal Statistical Society. Series C. Applied Statistics, 68(4): 963–983. https://doi.org/10.1111/rssc.12344
Chen CWS, Lee S (2016). Generalized Poisson autoregressive models for time series of counts. Computational Statistics & Data Analysis, 99: 51–67. https://doi.org/10.1016/j.csda.2016.01.009
Chen CWS, Lee S, Khamthong K (2021). Bayesian inference of nonlinear hysteretic integer-valued garch models for disease counts. Computational Statistics, 36(1): 261–281. https://doi.org/10.1007/s00180-020-01018-7
Chen CWS, So MKP, Li JC, Sriboonchitta S (2016). Autoregressive conditional negative binomial model applied to over-dispersed time series of counts. Statistical Methodology, 31: 73–90. https://doi.org/10.1016/j.stamet.2016.02.001
Chu AMY, Chan TWC, So MKP, Wong WK (2021). Dynamic network analysis of COVID-19 with a latent pandemic space model. International Journal of Environmental Research and Public Health, 18(6): 3195. https://doi.org/10.3390/ijerph18063195
Chu AMY, Chong ACY, Lai NHT, Tiwari A, So MKP (2023). Enhancing the Predictive Power of Google Trends Data Through Network Analysis: Infodemiology Study of COVID-19. JMIR Public Health and Surveillance, 9(1): e42446. https://doi.org/10.2196/42446
Debaly ZM, Truquet L (2019). Stationarity and moment properties of some multivariate count autoregressions. arXiv preprint: https://arxiv.org/abs/1909.11392.
Debaly ZM, Truquet L (2023). Multivariate time series models for mixed data. Bernoulli, 29(1): 669–695. https://doi.org/10.3150/22-BEJ1474
El-Morshedy M, Altun E, Eliwa M (2022). A new statistical approach to model the counts of novel coronavirus cases. Mathematical Sciences, 16: 37–50. https://doi.org/10.1007/s40096-021-00390-9
Fokianos K (2021). Multivariate count time series modelling. Econometrics and Statistics. https://doi.org/10.1016/j.ecosta.2021.11.006
Fokianos K, Fried R, Kharin Y, Voloshko V (2022). Statistical analysis of multivariate discrete-valued time series. Journal of Multivariate Analysis, 188: 104805. https://doi.org/10.1016/j.jmva.2021.104805
Fokianos K, Rahbek A, Tjøstheim D A Tjøstheim D R (2009). Poisson autoregression. Journal of the American Statistical Association, 104(488): 1430–1439. https://doi.org/10.1198/jasa.2009.tm08270
Fokianos K, Støve B, Tjøstheim D, Doukhan P (2020). Multivariate count autoregression. Bernoulli, 26(1): 471–499. https://doi.org/10.3150/19-BEJ1132
Fokianos K, Tjøstheim D (2011). Log-linear Poisson autoregression. Journal of Multivariate Analysis, 102(3): 563–578. https://doi.org/10.1016/j.jmva.2010.11.002
Frühwirth-Schnatter S, Wagner H (2006). Auxiliary mixture sampling for parameter-driven models of time series of counts with applications to state space modelling. Biometrika, 93(4): 827–841. https://doi.org/10.1093/biomet/93.4.827
Green PJ, Mira A (2001). Delayed rejection in reversible jump Metropolis-Hastings. Biometrika, 88: 1035–1053. https://doi.org/10.1093/biomet/88.4.1035
Haario H, Laine M, Mira A, Saksman E (2006). Dram: Efficient adaptive MCMC. Statistics and Computing, 16: 339–354. https://doi.org/10.1007/s11222-006-9438-0
Haario H, Saksman E, Tamminen J (1999). Adaptive proposal distribution for random walk Metropolis algorithm. Computational Statistics, 14: 375–395. https://doi.org/10.1007/s001800050022
Haario H, Saksman E, Tamminen J (2001). An adaptive Metropolis algorithm. Bernoulli, 7: 223–242. https://doi.org/10.2307/3318737
Held L, Höhle M, Hofmann M (2005). A statistical framework for the analysis of multivariate infectious disease surveillance counts. Statistical Modelling, 5(3): 187–199. https://doi.org/10.1191/1471082X05st098oa
Livsey J, Lund R, Kechagias S, Pipiras V (2018). Multivariate integer-valued time series with flexible autocovariances and their application to major hurricane counts. Annals of Applied Statistics, 12(1): 408–431. https://doi.org/10.1214/17-AOAS1098
Martínez-Bello DA, López-Quílez A, Torres-Prieto A (2017). Bayesian dynamic modeling of time series of dengue disease case counts. PLoS Neglected Tropical Diseases, 11(7): e0005696. https://doi.org/10.1371/journal.pntd.0005696
Nikoloulopoulos AK, Karlis D (2009). Modeling multivariate count data using copulas. Communications in Statistics. Simulation and Computation, 39(1): 172–187. https://doi.org/10.1080/03610910903391262
Polwiang S (2020). The time series seasonal patterns of dengue fever and associated weather variables in Bangkok (2003-2017). BMC Infectious Diseases, 20: 208. https://doi.org/10.1186/s12879-020-4902-6
Ravishanker N, Serhiyenko V, Willig MR (2014). Hierarchical dynamic models for multivariate times series of counts. Statistics and its Interface, 7(4): 559–570. https://doi.org/10.4310/SII.2014.v7.n4.a11
Reasenberg P, Jones L (1994). Earthquake aftershocks: Update. Science, 265(5176): 1251–1252. https://doi.org/10.1126/science.265.5176.1251
Shinde GR, Kalamkar AB, Mahalle PN, Dey N, Chaki J, Hassanien AE (2020). Forecasting models for coronavirus disease (COVID-19): A survey of the state-of-the-art. SN Computer Science, 1: 197. https://doi.org/10.1007/s42979-020-00209-9
So MKP, Chan LSH, Chu AMY (2021a). Financial network connectedness and systemic risk during the COVID-19 pandemic. Asia-Pacific Financial Markets, 28(4): 649–665. https://doi.org/10.1007/s10690-021-09340-w
So MKP, Chu AMY, Tiwari A, Chan JNL (2021b). On topological properties of COVID-19: Predicting and assessing pandemic risk with network statistics. Scientific Reports, 11(1): 1–14. https://doi.org/10.1038/s41598-020-79139-8
So MKP, Tiwari A, Chu AMY, Tsang JTY, Chan JNL (2020). Visualizing COVID-19 pandemic risk through network connectedness. International Journal of Infectious Diseases, 96: 558–561. https://doi.org/10.1016/j.ijid.2020.05.011
Soyer R, Zhang D (2022). Bayesian modeling of multivariate time series of counts. Wiley Interdisciplinary Reviews: Computational Statistics, 14(6): e1559. https://doi.org/10.1002/wics.1559
Tierney L, Mira A (1999). Some adaptive Monte Carlo methods for Bayesian inference. Statistics in Medicine, 18: 2507–2515. https://doi.org/10.1002/(SICI)1097-0258(19990915/30)18:17/18<2507::AID-SIM272>3.0.CO;2-J
Wang Z, Chakraborty P, Mekaru SR, Brownstein JS, Ye J, Ramakrishnan N (2015). Dynamic Poisson autoregression for influenza-like-illness case count prediction. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’15 (Cao L, Zhang C, eds.), 1285–1294. Association for Computing Machinery, Association for Computing Machinery, Sydney, NSW, Australia.
West M (2020). Bayesian forecasting of multivariate time series: Scalability, structure uncertainty and decisions. Annals of the Institute of Statistical Mathematics, 72: 1–31. https://doi.org/10.1007/s10463-019-00741-3
West M, Harrison PJ, Migon HS (1985). Dynamic generalized linear models and Bayesian forecasting. Journal of the American Statistical Association, 80(389): 73–83. https://doi.org/10.1080/01621459.1985.10477131
Wu WB, Shao X (2004). Limit theorems for iterated random functions. Journal of Applied Probability, 41(2): 425–436. https://doi.org/10.1239/jap/1082999076
Zhang MY, Russell JR, Tsay RS (2001). A nonlinear autoregressive conditional duration model with applications to financial transaction data. Journal of Econometrics, 104(1): 179–207. https://doi.org/10.1016/S0304-4076(01)00063-X
Zhang Y, Zhou H, Zhou J, Sun W (2017). Regression models for multivariate count data. Journal of Computational and Graphical Statistics, 26(1): 1–13. https://doi.org/10.1080/10618600.2016.1154063
Zhu X, Pan R, Li G, Liu Y, Wang H (2017). Network vector autoregression. The Annals of Statistics, 45(3): 1096–1123. https://doi.org/10.1214/16-AOS1476