Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 19, Issue 2 (2021): Special issue: Continued Data Science Contributions to COVID-19 Pandemic
  4. Estimating the Number of Infected Cases ...

Journal of Data Science

Submit your article Information
  • Article info
  • More
    Article info

Estimating the Number of Infected Cases in COVID-19 Pandemic
Volume 19, Issue 2 (2021): Special issue: Continued Data Science Contributions to COVID-19 Pandemic, pp. 348–364
Donghui Yan   Ying Xu   Pei Wang  

Authors

 
Placeholder
https://doi.org/10.6339/21-JDS1002
Pub. online: 23 February 2021      Type: Statistical Data Science     

Received
7 December 2020
Accepted
10 January 2021
Published
23 February 2021

Abstract

It is widely acknowledged that the reported numbers of infected cases with COVID-19 were not complete. A structured approach is proposed where we distinguish cases reflected later in the numbers of confirmed cases and those with mild or no symptoms thus not captured by any systems at all. The number of infected cases in the US is estimated to be 220.54% of that reported as of Apr 20, 2020. This implies an overall infection ratio of 0.53%, and a case mortality rate at 2.85% which is close to the 3.4% suggested by WHO in March 2020.

Supplementary material

 Supplementary Material
In the online supplementary, we provide all R scripts and datasets used to produce the figures and results reported in the paper. All the R scripts are placed in the main directory of a .zip archive, along with a dataset for 2020 US population and a README document that briefly describes the R scripts and the datasets. Data collected as of Apr 20, 2020 and Aug 31, 2020 are placed in respective subdirectories; there is a separate time series dataset for each US state with such information as the report case numbers, death tolls etc.

References

 
Backer JA, Klinkenberg D, Wallinga J (2020). Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance, 25(5): 2000062.
 
Baquero C, Casari P, Anta AF, Frey D, Garcia-Agundez A, Georgiou C, et al. (2020). Measuring icebergs: Using different methods to estimate the number of COVID-19 cases in Portugal and Spain. medRxiv preprint: https://doi.org/10.1101/2020.04.20.20073056.
 
Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. (2020). Epidemiology and transmission of COVID-19 in Shenzhen China: Analysis of 391 cases and 1286 of their close contacts. medRxiv preprint: https://doi.org/10.1101/2020.03.03.20028423.
 
Bohk-Ewald C, Dudel C, Myrskyla M (2020). A demographic scaling model for estimating the total number of COVID-19 infections. medRxiv preprint: https://doi.org/10.1101/2020.04.23.20077719.
 
Bottcher L, Xia M, Chou T (2020). Why estimating population-based case fatality rates during epidemics may be misleading. medRxiv preprint: https://doi.org/10.1101/2020.03.26.20044693.
 
Byambasuren O, Bell MCK, Clark J, McLaws M, Glasziou P (2020). Estimating the extent of asymptomatic COVID-19 and its potential for community transmission: Systematic review and meta-analysis. Journal of the Association of Medical Microbiology and Infectious Disease, 5(4): 223–234. doi: https://doi.org/10.3138/jammi-2020-0030.
 
Easley D, Kleinberg J (2010). Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press.
 
Gupta S, Shankar R (2020). Estimating the number of COVID-19 infections in Indian hot-spots using fatality data. arXiv preprint: https://arxiv.org/abs/arXiv:2004.04025.
 
Jagodnik KM, Ray F, Giorgi FM, Lachmann A (2020). Correcting under-reported COVID-19 case numbers: estimating the true scale of the pandemic. medRxiv preprint: https://doi.org/10.1101/2020.03.14.20036178.
 
Kaplan EL, Meier P (1958). Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53(282): 457–481.
 
Lauer SA, Grantz K, Bi Q, Jones FK, Zheng Q, Meredith H, et al. (2020). The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application. Annals of Internal Medicine, 172(9): 577–582. doi: https://doi.org/10.7326/M20-0504.
 
Linton NM, Kobayashi T, Yang Y, Hayashi K, Akhmetzhanov AR, Jung S, et al. (2020). Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: A statistical analysis of publicly available case data. Journal of Clinical Medicine, 9(2): 538.
 
Little RJ, Rubin D (2002). Statistical Analysis with Missing Data. Wiley.
 
McAloon C, Collins A, Hunt K, Barber A, Byrne A, Butler F, et al. (2020). Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. British Medical Journal Open, 10(8): 1–9.
 
Prem K, Liu Y, Russell T, Kucharski A, Eggo R, Davies N, et al. (2020). The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study. The Lancet Public Health, 5(5): 261–270.
 
Richterich P (2020). Severe underestimation of COVID-19 case numbers: effect of epidemic growth rate and test restrictions. medRxiv preprint: https://doi.org/10.1101/2020.04.13.20064220.
 
Tian T, Tan J, Jiang Y, Wang X, Zhang H (2020). Evaluate the timing of resumption of business for the states of New York, New Jersey, and California via a pre-symptomatic and asymptomatic transmission model of COVID-19. medRxiv preprint: https://doi.org/10.1101/2020.05.16.20103747.
 
WHO (2020). Coronavirus (COVID-19) Mortality Rate. https://www.worldometers.info/coronavirus/coronavirus-death-rate/.
 
Wikipedia (2020). COVID-19 pandemic in the United States. https://en.wikipedia.org/wiki/COVID-19_pandemic_in_the_United_States.
 
Yan D, Li C, Cong N, Yu L, Gong P (2019). A structured approach to the analysis of remote sensing images. International Journal of Remote Sensing, 40(20): 7874–7897.
 
Zhang J, Litvinova M, Liang Y, Wang Y, Wang W, Zhao S, et al. (2020). Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science, 368: 1481–1486.
 
Zhou T, Ji Y (2020). Semiparametric Bayesian inference for the transmission dynamics of COVID-19 with a state-space model. arXiv preprint: https://arxiv.org/abs/arXiv:2006.05581.

PDF XML
PDF XML

Copyright
© 2021 The Author(s)
This is a free to read article.

Keywords
number of unreported cases population match

Metrics
since February 2021
1242

Article info
views

396

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy