This supplementary consists of R code and datasets to reproduce the figures and 
results reported in the manuscript. These are listed in the following.

o R scripts
	1) plotCT-MAT6T7.R
	   R script to generate and plot the estimated unreported ratio of type 
	   I cases for two New England states, CT and MA, under incubation 
           periods T=6 or T=7. The script generates the plots for 04/22/2020
           or 08/31/2020, depending on which date is turned on. 
	2) plotUST6T7.R
	   Similar as plotCT-MAT6T7.R, this R script produces and plots the 
	   estimated unreported ratio of type I cases for the US under incubation 
           periods T=6 or T=7. The script generates the plots for 04/22/2020
           Or 08/31/2020, depending on which date is turned on. 
	3) plotInfRatio50.R
	   This script plots the estimated infection ratio for all 50 US states 
	   as heatmap on the US map. The script generates the plots for 04/22/2020
           or 08/31/2020, depending on which date is turned on.
	4) plotUST1Ratio.R
	   This script generates the estimated missing ratio of Type I for all 
	   50 US states with the incubation period T=7. The script generates the 
	   plots for 04/22/2020 or 08/31.2020, depending on which date is turned
	   on. 
	5) plotInfRatioCasesStates6_trend.R
	   This script generates the trend of estimated infection ratio and 
	   estimated number of infected cases through time for 6 heavily hit
	   states NY, NJ, CT, MA, RI and LA. The script generates the plots for 
	   04/22/2020 or 08/31/2020, depending on which date is turned on.
	6) sensitivityLognormal.R
	   This script generates newly infected cases with lognormal incubation
	   periods, and then compare the actual number of Type I unreported 
	   cases and those by our estimation under two different window sizes 
	   T=5 or 6. The window sizes are set to match those reported in the 
	   literature. The number of daily new cases are either set as a constant, 
	   1000, or generated uniformly from the integer interval [600,1200], 
	   [400,1600], or [200,2000]. 
	7) sensitivityWeibull.R
	   Similar as sensitivityLognormal.R, this script differs by generating
	   the newly infected cases with Weibull distribution incubation periods, 
 	   and window sizes set at T=6 or 7 to match those reported in literature. 

o Data
	1) US-POP.Data
	   Population of the US and each of 50 states in 2020.
	2) Reported cases for the US and each of the 50 states, each as a 
	   separate file placed in two different directories, Data_0422 and 
	   Data_0831, corresponding to data collected at 04/22/2020 and
	   08/31/2020, respectively. The files are named by the US or states 
	   abbreviations.
      3) Percentage of reported cases by age groups. These are put in the two 
 	   different directories, named as UScasePerc_AGE_20200420.Data and
	   UScasePerc_AGE_20200831.Data, respectively.

o	All questions or comments are to be directed to dyan@umassd.edu. Thanks 
	in advance for your feedback!

