<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1036</article-id>
<article-id pub-id-type="doi">10.6339/22-JDS1036</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Data Science in Action</subject></subj-group></article-categories>
<title-group>
<article-title>Clustering US States by Time Series of COVID-19 New Case Counts in the Early Months with Non-Negative Matrix Factorization</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Chen</surname><given-names>Jianmin</given-names></name><xref ref-type="aff" rid="j_jds1036_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-8211-5930</contrib-id>
<name><surname>Zhang</surname><given-names>Panpan</given-names></name><email xlink:href="mailto:panpan.zhang@pennmedicine.upenn.edu">panpan.zhang@pennmedicine.upenn.edu</email><xref ref-type="aff" rid="j_jds1036_aff_002">2</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1036_aff_001"><label>1</label>Department of Statistics, <institution>University of Connecticut</institution>, Storrs, CT 06269, <country>U.S.A.</country></aff>
<aff id="j_jds1036_aff_002"><label>2</label>Department of Biostatistics, Epidemiology and Informatics, <institution>University of Pennsylvania</institution>, Philadelphia, PA 19104, <country>U.S.A.</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:panpan.zhang@pennmedicine.upenn.edu">panpan.zhang@pennmedicine.upenn.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2022</year></pub-date><pub-date pub-type="epub"><day>4</day><month>2</month><year>2022</year></pub-date><volume>20</volume><issue>1</issue><fpage>79</fpage><lpage>94</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1036_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>
<list>
<list-item id="j_jds1036_li_001">
<label>1.</label>
<p>data_10_05.csv: This file contains the data from a public repository maintained by the Center for Systems Science and Engineering at the Johns Hopkins University (<xref ref-type="bibr" rid="j_jds1036_ref_008">Dong et al.</xref>, <xref ref-type="bibr" rid="j_jds1036_ref_008">2020</xref>). The data was retrieved on October 5, 2020. The case numbers may differ from those in the current version owing to possible modifications made after October 5, 2020.</p>
</list-item>
<list-item id="j_jds1036_li_002">
<label>2.</label>
<p>nst-est2019-01.csv: This file contains the state-level population data, maintained by the US Census Bureau (<uri>https://www.census.gov</uri>). The data was released at the end of 2019.</p>
</list-item>
<list-item id="j_jds1036_li_003">
<label>3.</label>
<p>pretreat.R: Codes for pre-processing the data (e.g., smoothing and scaling).</p>
</list-item>
<list-item id="j_jds1036_li_004">
<label>4.</label>
<p>getnmfparameter.R: Codes for obtaining NMF ranks via the cross-validation method proposed in the paper.</p>
</list-item>
<list-item id="j_jds1036_li_005">
<label>5.</label>
<p>model_fit.R: Codes for implementing the NMF method. The results of <italic>k</italic>-means clustering (including the selection of <italic>k</italic>) are given by running this file as well.</p>
</list-item>
<list-item id="j_jds1036_li_006">
<label>6.</label>
<p>plotmaking.R: Codes for generating the figures in the paper.</p>
</list-item>
</list> 
</p>
</caption>
</supplementary-material><history><date date-type="received"><day>23</day><month>1</month><year>2022</year></date><date date-type="accepted"><day>25</day><month>1</month><year>2022</year></date></history>
<permissions><copyright-statement>2022 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>The spreading pattern of COVID-19 in the early months of the pandemic differs a lot across the states in the US under different quarantine measures and reopening policies. We proposed to cluster the US states into distinct communities based on the daily new confirmed case counts from March 22 to July 25 via a nonnegative matrix factorization (NMF) followed by a <italic>k</italic>-means clustering procedure on the coefficients of the NMF basis. A cross-validation method was employed to select the rank of the NMF. The method clustered the 49 continental states (including the District of Columbia) into 7 groups, two of which contained a single state. To investigate the dynamics of the clustering results over time, the same method was successively applied to the time periods with an increment of one week, starting from the period of March 22 to March 28. The results suggested a change point in the clustering in the week starting on May 30, caused by a combined impact of both quarantine measures and reopening policies.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>change point</kwd>
<kwd>COVID-19</kwd>
<kwd><italic>k</italic>-means clustering</kwd>
<kwd>non-negative matrix factorization</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1036_reflist_001">
<title>References</title>
<ref id="j_jds1036_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Arumugadevi</surname> <given-names>S</given-names></string-name>, <string-name><surname>Seenivasagam</surname> <given-names>V</given-names></string-name> (<year>2015</year>). <article-title>Comparison of clustering methods for segmenting color images</article-title>. <source>Indian Journal of Science and Technology</source>, <volume>8</volume>(<issue>7</issue>): <fpage>670</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Boutsidis</surname> <given-names>C</given-names></string-name>, <string-name><surname>Gallopoulos</surname> <given-names>E</given-names></string-name> (<year>2008</year>). <article-title>SVD based initialization: a head start for nonnegative matrix factorization</article-title>. <source>Pattern Recognition</source>, <volume>41</volume>(<issue>4</issue>): <fpage>1350</fpage>–<lpage>1362</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Brunet</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Tamayo</surname> <given-names>P</given-names></string-name>, <string-name><surname>Golub</surname> <given-names>TR</given-names></string-name>, <string-name><surname>Mesirov</surname> <given-names>JP</given-names></string-name> (<year>2004</year>). <article-title>Metagenes and molecular pattern discovery using matrix factorization</article-title>. <source>Proceedings of the National Academy of Sciences of the United States of America</source>, <volume>101</volume>(<issue>12</issue>): <fpage>4164</fpage>–<lpage>4169</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_004">
<mixed-citation publication-type="other"> <string-name><surname>Chen</surname> <given-names>WC</given-names></string-name>, <string-name><surname>Maitra</surname> <given-names>R</given-names></string-name> (2015). EMCluster: EM Algorithm for Model-Based Clustering of Finite Mixture Gaussian Distribution. R Package. URL <uri>http://cran.r-project.org/package=EMCluster</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Chiou</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Li</surname> <given-names>PL</given-names></string-name> (<year>2007</year>). <article-title>Functional clustering and identifying substructures of longitudinal data</article-title>. <source>Journal of the Royal Statistical Society. Series B (Statistical Methodology)</source>, <volume>69</volume>(<issue>4</issue>): <fpage>679</fpage>–<lpage>699</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Devarajan</surname> <given-names>K</given-names></string-name> (<year>2008</year>). <article-title>Nonnegative matrix factorization: an analytical and interpretive tool in computational biology</article-title>. <source>PLOS Computational Biology</source>, <volume>4</volume>(<issue>7</issue>): <elocation-id>e1000029</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_007">
<mixed-citation publication-type="chapter"> <string-name><surname>Ding</surname> <given-names>C</given-names></string-name>, <string-name><surname>He</surname> <given-names>X</given-names></string-name>, <string-name><surname>Simon</surname> <given-names>HD</given-names></string-name> (<year>2005</year>). <chapter-title>On the equivalence of nonnegative matrix factorization and spectral clustering</chapter-title>. In: <source>Proceedings of the 2005 SIAM International Conference on Data Mining</source> (<string-name><given-names>H</given-names> <surname>Kargupta</surname></string-name>, <string-name><given-names>J</given-names> <surname>Srivastava</surname></string-name>, <string-name><given-names>C</given-names> <surname>Kamath</surname></string-name>, <string-name><given-names>A</given-names> <surname>Goodman</surname></string-name>, eds.), <fpage>606</fpage>–<lpage>610</lpage>. <publisher-name>SIAM</publisher-name>, <publisher-loc>Philadelphia, PA, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Dong</surname> <given-names>E</given-names></string-name>, <string-name><surname>Du</surname> <given-names>H</given-names></string-name>, <string-name><surname>Gardner</surname> <given-names>L</given-names></string-name> (<year>2020</year>). <article-title>An interactive web-based dashboard to track COVID-19 in real time</article-title>. <source>The Lancet Infectious Diseases</source>, <volume>20</volume>(<issue>5</issue>): <fpage>533</fpage>–<lpage>534</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>D’Urso</surname> <given-names>P</given-names></string-name>, <string-name><surname>De Giovanni</surname> <given-names>L</given-names></string-name>, <string-name><surname>Vitale</surname> <given-names>V</given-names></string-name> (<year>2021</year>a). <article-title>Spatial robust fuzzy clustering of COVID-19 time series based on B-splines</article-title>. <source>Spatial Statistics</source>, <elocation-id>100518</elocation-id>. <uri>https://doi.org/10.1016/j.spasta.2021.100518</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>D’Urso</surname> <given-names>P</given-names></string-name>, <string-name><surname>Mucciardi</surname> <given-names>M</given-names></string-name>, <string-name><surname>Otranto</surname> <given-names>E</given-names></string-name>, <string-name><surname>Vitale</surname> <given-names>V</given-names></string-name> (<year>2021</year>b). <article-title>Community mobility in the European regions during COVID-19 pandemic: a partitioning around medoids with noise cluster based on space-time autoregressive models</article-title>. <source>Spatial Statistics</source>, <elocation-id>100531</elocation-id>. <uri>https://doi.org/10.1016/j.spasta.2021.100531</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Fauci</surname> <given-names>AS</given-names></string-name>, <string-name><surname>Lane</surname> <given-names>HC</given-names></string-name>, <string-name><surname>Redfield</surname> <given-names>RR</given-names></string-name> (<year>2020</year>). <article-title>COVID-19—navigating the uncharted</article-title>. <source>The New England Journal of Medicine</source>, <volume>382</volume>(<issue>13</issue>): <fpage>1268</fpage>–<lpage>1269</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Gaujoux</surname> <given-names>R</given-names></string-name>, <string-name><surname>Seoighe</surname> <given-names>C</given-names></string-name> (<year>2010</year>). <article-title>A flexible R package for nonnegative matrix factorization</article-title>. <source>BMC Bioinformatics</source>, <volume>11</volume>: <fpage>367</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Gelbard</surname> <given-names>R</given-names></string-name>, <string-name><surname>Goldman</surname> <given-names>O</given-names></string-name>, <string-name><surname>Spiegler</surname> <given-names>I</given-names></string-name> (<year>2007</year>). <article-title>Investigating diversity of clustering methods: an empirical comparison</article-title>. <source>Data &amp; Knowledge Engineering</source>, <volume>63</volume>(<issue>1</issue>): <fpage>155</fpage>–<lpage>166</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Goyal</surname> <given-names>P</given-names></string-name>, <string-name><surname>Choi</surname> <given-names>JJ</given-names></string-name>, <string-name><surname>Pinheiro</surname> <given-names>LC</given-names></string-name>, <string-name><surname>Schenck</surname> <given-names>EJ</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>R</given-names></string-name>, <string-name><surname>Jabri</surname> <given-names>A</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Clinical characteristics of COVID-19 in New York City</article-title>. <source>The New England Journal of Medicine</source>, <volume>382</volume>(<issue>24</issue>): <fpage>2372</fpage>–<lpage>2374</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_015">
<mixed-citation publication-type="chapter"> <string-name><surname>Guillamet</surname> <given-names>D</given-names></string-name>, <string-name><surname>Bressan</surname> <given-names>M</given-names></string-name>, <string-name><surname>Vitria</surname> <given-names>J</given-names></string-name> (<year>2001</year>). <chapter-title>A weighted non-negative matrix factorization for local representations</chapter-title>. In: <source>Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001</source>, volume <volume>I</volume>, <fpage>942</fpage>–<lpage>947</lpage>. <publisher-name>IEEE</publisher-name>, <publisher-loc>Piscataway, NJ, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Hartigan</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Wong</surname> <given-names>MA</given-names></string-name> (<year>1979</year>). <article-title>Algorithm as 136: a <italic>k</italic>-means clustering algorithm</article-title>. <source>Journal of the Royal Statistical Society. Series C (Applied Statistics)</source>, <volume>28</volume>(<issue>1</issue>): <fpage>100</fpage>–<lpage>108</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Hirano</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>X</given-names></string-name>, <string-name><surname>Tsumoto</surname> <given-names>S</given-names></string-name> (<year>2004</year>). <article-title>Comparison of clustering methods for clinical databases</article-title>. <source>Information Sciences</source>, <volume>159</volume>(<issue>3–4</issue>): <fpage>155</fpage>–<lpage>165</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Hubert</surname> <given-names>L</given-names></string-name>, <string-name><surname>Arabie</surname> <given-names>P</given-names></string-name> (<year>1985</year>). <article-title>Comparing partitions</article-title>. <source>Journal of Classification</source>, <volume>2</volume>: <fpage>193</fpage>–<lpage>218</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Jacques</surname> <given-names>J</given-names></string-name>, <string-name><surname>Preda</surname> <given-names>C</given-names></string-name> (<year>2014</year>). <article-title>Functional data clustering: a survey</article-title>. <source>Advances in Data Analysis and Classification</source>, <volume>8</volume>: <fpage>231</fpage>–<lpage>255</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_020">
<mixed-citation publication-type="chapter"> <string-name><surname>Kanagal</surname> <given-names>B</given-names></string-name>, <string-name><surname>Sindhwani</surname> <given-names>V</given-names></string-name> (<year>2010</year>). <chapter-title>Rank selection in low-rank matrix approximations: a study of cross-validation for NMFs</chapter-title>. In: <source>Low-Rank Methods for Large-scale Machine Learning (Workshop in NIPS’10)</source> (<string-name><given-names>A</given-names> <surname>Gretton</surname></string-name>, <string-name><given-names>M</given-names> <surname>Mahoney</surname></string-name>, <string-name><given-names>M</given-names> <surname>Mohri</surname></string-name>, <string-name><given-names>A</given-names> <surname>Talwalkar</surname></string-name>, eds.). <uri>https://www.cs.umd.edu/~bhargav/nips2010.pdf</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_021">
<mixed-citation publication-type="chapter"> <string-name><surname>Kim</surname> <given-names>YD</given-names></string-name>, <string-name><surname>Choi</surname> <given-names>S</given-names></string-name> (<year>2009</year>). <chapter-title>Weighted nonnegative matrix factorization</chapter-title>. In: <source>2009 IEEE International Conference on Acoustics, Speech and Signal Processing</source>, <fpage>1541</fpage>–<lpage>1544</lpage>. <publisher-name>IEEE</publisher-name>, <publisher-loc>Piscataway, NJ, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Lauer</surname> <given-names>SA</given-names></string-name>, <string-name><surname>Grantz</surname> <given-names>KH</given-names></string-name>, <string-name><surname>Bi</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>FK</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Meredith</surname> <given-names>HR</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application</article-title>. <source>Annals of Internal Medicine</source>, <volume>172</volume>(<issue>9</issue>): <fpage>577</fpage>–<lpage>582</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_023">
<mixed-citation publication-type="chapter"> <string-name><surname>Lee</surname> <given-names>DD</given-names></string-name>, <string-name><surname>Seung</surname> <given-names>HS</given-names></string-name> (<year>2000</year>). <chapter-title>Algorithms for non-negative matrix factorization</chapter-title>. In: <source>Proceedings of the 13th International Conference on Neural Information Processing Systems (NIPS’00)</source> (<string-name><given-names>TK</given-names> <surname>Leen</surname></string-name>, <string-name><given-names>TG</given-names> <surname>Dietterich</surname></string-name>, <string-name><given-names>V</given-names> <surname>Tresp</surname></string-name>, eds.), <fpage>535</fpage>–<lpage>541</lpage>. <publisher-name>MIT Press</publisher-name>, <publisher-loc>Cambridge, MA, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_024">
<mixed-citation publication-type="other"> <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>P</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>Q</given-names></string-name> (2022). Exploring COVID-19 in mainland China during the lockdown of Wuhan via functional data analysis. <italic>Communications for Statistical Applications and Methods</italic>. In press.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Liao</surname> <given-names>TW</given-names></string-name> (<year>2005</year>). <article-title>Clustering of time series data—a survey</article-title>. <source>Pattern Recognition</source>, <volume>38</volume>(<issue>11</issue>): <fpage>1857</fpage>–<lpage>1874</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_026">
<mixed-citation publication-type="chapter"> <string-name><surname>Lin</surname> <given-names>J</given-names></string-name>, <string-name><surname>Vlachos</surname> <given-names>M</given-names></string-name>, <string-name><surname>Keogh</surname> <given-names>E</given-names></string-name>, <string-name><surname>Gunopulos</surname> <given-names>D</given-names></string-name> (<year>2004</year>). <chapter-title>Iterative incremental clustering of time series</chapter-title>. In: <source>Proceedings of the 9th International Conference on Extending Database Technology</source> (<string-name><given-names>E</given-names> <surname>Bertino</surname></string-name>, <string-name><given-names>S</given-names> <surname>Christodoulakis</surname></string-name>, <string-name><given-names>D</given-names> <surname>Plexousakis</surname></string-name>, <string-name><given-names>V</given-names> <surname>Christophides</surname></string-name>, <string-name><given-names>M</given-names> <surname>Koubarakis</surname></string-name>, <string-name><given-names>K</given-names> <surname>Böhm</surname></string-name>, <string-name><given-names>E</given-names> <surname>Ferrari</surname></string-name>, eds.), <fpage>106</fpage>–<lpage>122</lpage>. <publisher-name>Springer-Verlag</publisher-name>, <publisher-loc>Berlin, Heidelberg, Germany</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_027">
<mixed-citation publication-type="chapter"> <string-name><surname>Madhulatha</surname> <given-names>TS</given-names></string-name> (<year>2011</year>). <chapter-title>Comparison between <italic>k</italic>-means and <italic>k</italic>-medoids clustering algorithms</chapter-title>. In: <source>Advances in Computing and Information Technology</source> (<string-name><given-names>DC</given-names> <surname>Wyld</surname></string-name>, <string-name><given-names>M</given-names> <surname>Wozniak</surname></string-name>, <string-name><given-names>N</given-names> <surname>Chaki</surname></string-name>, <string-name><given-names>N</given-names> <surname>Meghanathan</surname></string-name>, <string-name><given-names>D</given-names> <surname>Nagamalai</surname></string-name>, eds.), <fpage>472</fpage>–<lpage>481</lpage>. <publisher-name>Springer</publisher-name>, <publisher-loc>Berlin, Heidelberg, Germany</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_028">
<mixed-citation publication-type="journal"> <string-name><surname>Moghadas</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Shoukat</surname> <given-names>A</given-names></string-name>, <string-name><surname>Fitzpatrick</surname> <given-names>MC</given-names></string-name>, <string-name><surname>Wells</surname> <given-names>CR</given-names></string-name>, <string-name><surname>Sah</surname> <given-names>P</given-names></string-name>, <string-name><surname>Pandey</surname> <given-names>A</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Projecting hospital utilization during the COVID-19 outbreaks in the United States</article-title>. <source>Proceedings of the National Academy of Sciences of the United States of America</source>, <volume>117</volume>(<issue>16</issue>): <fpage>9122</fpage>–<lpage>9126</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Shaw</surname> <given-names>C</given-names></string-name>, <string-name><surname>King</surname> <given-names>G</given-names></string-name> (<year>1992</year>). <article-title>Using cluster analysis to classify time series</article-title>. <source>Physica D: Nonlinear Phenomena</source>, <volume>58</volume>(<issue>1–4</issue>): <fpage>288</fpage>–<lpage>298</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_030">
<mixed-citation publication-type="other"> <string-name><surname>Tang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>P</given-names></string-name> (2022). Functional data analysis: an application to COVID-19 data in the United States. <italic>Quantitative Biology</italic>. In press. arXiv preprint: <uri>https://arxiv.org/abs/2009.08363</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_031">
<mixed-citation publication-type="other"> <string-name><surname>Tian</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tan</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>H</given-names></string-name> (2020). Evaluate the risk of resumption of business for the states of New York, New Jersey and Connecticut via a pre-symptomatic and asymptomatic transmission model of COVID-19. medRxiv preprint: <uri>https://doi.org/10.1101/2020.05.16.20103747</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_032">
<mixed-citation publication-type="journal"> <string-name><surname>Vitale</surname> <given-names>V</given-names></string-name>, <string-name><surname>D’Urso</surname> <given-names>P</given-names></string-name>, <string-name><surname>De Giovanni</surname> <given-names>L</given-names></string-name> (<year>2021</year>). <article-title>Spatio-temporal object-oriented Bayesian network modelling of the COVID-19 Italian outbreak data</article-title>. <source>Spatial Statistics</source>, <elocation-id>100529</elocation-id>. <uri>https://doi.org/10.1016/j.spasta.2021.100529</uri>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_033">
<mixed-citation publication-type="journal"> <string-name><surname>Wang</surname> <given-names>G</given-names></string-name>, <string-name><surname>Kossenkov</surname> <given-names>AV</given-names></string-name>, <string-name><surname>Ochs</surname> <given-names>MF</given-names></string-name> (<year>2006</year>). <article-title>LS-NMF: a modified non-negative matrix factorization algorithm utilizing uncertainty estimates</article-title>. <source>BMC bioinformatics</source>, <volume>7</volume>: <fpage>175</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1036_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>P</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Xie</surname> <given-names>SX</given-names></string-name> (<year>2020</year>). <article-title>Meta-analysis of several epidemic characteristics of COVID-19</article-title>. <source>Journal of Data Science</source>, <volume>18</volume>(<issue>3</issue>): <fpage>536</fpage>–<lpage>549</lpage>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
