<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn>
<issn pub-type="ppub">1680-743X</issn>
<issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS992</article-id>
<article-id pub-id-type="doi">10.6339/21-JDS992</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Assessment of Effects of Age and Gender on the Incubation Period of COVID-19 with a Mixture Regression Model</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zheng</surname><given-names>Siming</given-names></name><xref ref-type="aff" rid="j_jds992_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Qin</surname><given-names>Jing</given-names></name><xref ref-type="aff" rid="j_jds992_aff_002">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhou</surname><given-names>Yong</given-names></name><email xlink:href="mailto:yzhou@amss.ac.cn.">yzhou@amss.ac.cn.</email><xref ref-type="aff" rid="j_jds992_aff_003">3</xref><xref ref-type="aff" rid="j_jds992_aff_004">4</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds992_aff_001"><label>1</label>Academy of Mathematics and Systems Science, <institution>University of Chinese Academy of Sciences</institution>, Beijing, <country>China</country></aff>
<aff id="j_jds992_aff_002"><label>2</label>National Institute of Allergy and Infectious Diseases, <institution>National Institutes of Health</institution>, Bethesda, Maryland, <country>U.S.A.</country></aff>
<aff id="j_jds992_aff_003"><label>3</label>Key Laboratory of Advanced Theory and Application in Statistics and Data Science, <institution>MOE</institution></aff>
<aff id="j_jds992_aff_004"><label>4</label>Academy of Statistics and Interdisciplinary Sciences, Faculty of Economics and Management, <institution>East China Normal University</institution>, Shanghai, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:yzhou@amss.ac.cn.">yzhou@amss.ac.cn.</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2021</year></pub-date><pub-date pub-type="epub"><day>7</day><month>5</month><year>2021</year></pub-date><volume>19</volume><issue>2</issue><fpage>253</fpage><lpage>268</lpage><supplementary-material id="S1" content-type="document" xlink:href="jds992_s001.pdf" mimetype="application" mime-subtype="pdf">
<caption>
<title>Supplementary Material</title>
<p>The Supplementary Material including the detailed proofs of Theorems 1 and 2, can be found on the <italic>Journal of Data Science</italic> website. The data/code used in the analyses can be found at <uri>https://github.com/SimonsZheng/Assessment-of-Effects-of-Age-and-Gender-on-COVID-19</uri>.</p>
</caption>
</supplementary-material>
<history>
<date date-type="received"><day>1</day><month>4</month><year>2020</year></date>
<date date-type="accepted"><day>1</day><month>5</month><year>2020</year></date>
</history>
<permissions><copyright-statement>2021 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2021</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Following the outbreak of COVID-19, various containment measures have been taken, including the use of quarantine. At present, the quarantine period is the same for everyone, since it is implicitly assumed that the incubation period distribution of COVID-19 is the same regardless of age or gender. For testing the effects of age and gender on the incubation period of COVID-19, a novel two-component mixture regression model is proposed. An expectation-maximization (EM) algorithm is adopted to obtain estimates of the parameters of interest, and the simulation results show that the proposed method outperforms the simple regression method and has robustness. The proposed method is applied to a Zhejiang COVID-19 dataset, and it is found that age and gender statistically have no effect on the incubation period of COVID-19, which indicates that the quarantine measure currently in operation is reasonable.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>EM algorithm</kwd>
<kwd>incubation period</kwd>
<kwd>length-biased data</kwd>
<kwd>mixture model</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds992_reflist_001">
<title>References</title>
<ref id="j_jds992_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Aitkin</surname> <given-names>M</given-names></string-name>, <string-name><surname>Rubin</surname> <given-names>D</given-names></string-name> (<year>1985</year>). <article-title>Estimation and hypothesis testing in finite mixture models</article-title>. <source>Journal of the Royal Statistical Society, Series B</source>, <volume>47</volume>(<issue>1</issue>): <fpage>67</fpage>–<lpage>75</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Backer</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Klinkenberg</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wallinga</surname> <given-names>J</given-names></string-name> (<year>2020</year>). <article-title>Incubation period of 2019 Novel Coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020</article-title>. <source>Euro Surveill</source>, <volume>25</volume>(<issue>5</issue>): <elocation-id>2000062</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Boldea</surname> <given-names>O</given-names></string-name>, <string-name><surname>Magnus</surname> <given-names>J</given-names></string-name> (<year>2009</year>). <article-title>Maximum likelihood estimation of the multivariate normal mixture model</article-title>. <source>Journal of the American Statistical Association</source>, <volume>104</volume>(<issue>488</issue>): <fpage>1539</fpage>–<lpage>1549</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Chen</surname> <given-names>JH</given-names></string-name> (<year>2017</year>). <article-title>Consistency of the MLE under mixture models</article-title>. <source>Statistical Science</source>, <volume>32</volume>(<issue>1</issue>): <fpage>47</fpage>–<lpage>63</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Dimitris</surname> <given-names>K</given-names></string-name>, <string-name><surname>Evdokia</surname> <given-names>X</given-names></string-name> (<year>2003</year>). <article-title>Choosing initial values for the EM algorithm for finite mixtures</article-title>. <source><italic>Computational Statistics</italic> &amp; <italic>Data Analysis</italic></source>, <volume>41</volume>(<issue>3–4</issue>): <fpage>577</fpage>–<lpage>590</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_006">
<mixed-citation publication-type="book"> <string-name><surname>Everitt</surname> <given-names>BS</given-names></string-name>, <string-name><surname>Hand</surname> <given-names>DJ</given-names></string-name> (<year>1981</year>). <source>Finite Mixture Distributions</source>. <publisher-name>Chapman and Hall</publisher-name>, <publisher-loc>London</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Guan</surname> <given-names>WJ</given-names></string-name>, <string-name><surname>Ni</surname> <given-names>ZY</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>WH</given-names></string-name>, <string-name><surname>Ou</surname> <given-names>CQ</given-names></string-name>, <string-name><surname>He</surname> <given-names>JX</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Clinical characteristics of Coronavirus Disease 2019 in China</article-title>. <source>The New England Journal of Medicine</source>, <volume>382</volume>: <fpage>1708</fpage>–<lpage>1720</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Jiang</surname> <given-names>WX</given-names></string-name>, <string-name><surname>Tanner</surname> <given-names>AM</given-names></string-name> (<year>1999</year>). <article-title>Hierarchical mixtures-of-experts for exponential family regression models: Approximation and maximum likelihood estimation</article-title>. <source>Annals of Statistics</source>, <volume>27</volume>(<issue>3</issue>): <fpage>987</fpage>–<lpage>1011</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Khalili</surname> <given-names>A</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>JH</given-names></string-name> (<year>2007</year>). <article-title>Variable selection in finite mixture of regression models</article-title>. <source>Journal of the American Statistical Association</source>, <volume>102</volume>(<issue>479</issue>): <fpage>1025</fpage>–<lpage>1038</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Lauer</surname> <given-names>S</given-names></string-name>, <string-name><surname>Grantz</surname> <given-names>K</given-names></string-name>, <string-name><surname>Bi</surname> <given-names>QF</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>F</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>QL</given-names></string-name>, <string-name><surname>Meredith</surname> <given-names>H</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>The incubation period of Coronavirus Disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application</article-title>. <source>Annals of Internal Medicine</source>, <volume>172</volume>(<issue>9</issue>): <fpage>577</fpage>–<lpage>582</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Li</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Guan</surname> <given-names>XH</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>P</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>XY</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>L</given-names></string-name>, <string-name><surname>Tong</surname> <given-names>YQ</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Early transmission dynamics in Wuhan, China, of Novel Coronavirus infected pneumonia</article-title>. <source>The New England Journal of Medicine</source>, <volume>382</volume>(<issue>13</issue>): <fpage>1199</fpage>–<lpage>1207</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_012">
<mixed-citation publication-type="book"> <string-name><surname>Liang</surname> <given-names>FM</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>CH</given-names></string-name>, <string-name><surname>Carroll</surname> <given-names>RJ</given-names></string-name> (<year>2010</year>). <source>Advanced Markov Chain Monte Carlo Methods: Learning from Past Samples</source>. <publisher-name>Wiley</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Linton</surname> <given-names>NM</given-names></string-name>, <string-name><surname>Kobayashi</surname> <given-names>T</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>YC</given-names></string-name>, <string-name><surname>Hayashi</surname> <given-names>K</given-names></string-name>, <string-name><surname>Akhmetzhanov</surname> <given-names>A</given-names></string-name>, <string-name><surname>Jung</surname> <given-names>SM</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Incubation period and other epidemiological characteristics of 2019 Novel Coronavirus infections with right truncation: A statistical analysis of publicly available case data</article-title>. <source>Journal of Clinical Medicine</source>, <volume>9</volume>(<issue>2</issue>): <fpage>538</fpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_014">
<mixed-citation publication-type="book"> <string-name><surname>McLachlan</surname> <given-names>GJ</given-names></string-name>, <string-name><surname>Peel</surname> <given-names>D</given-names></string-name> (<year>2000</year>). <source>Finite Mixture Model</source>. <publisher-name>Wiley</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Qin</surname> <given-names>J</given-names></string-name>, <string-name><surname>You</surname> <given-names>C</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>QS</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>TJ</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>SC</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>XH</given-names></string-name> (<year>2020</year>). <article-title>Estimation of incubation period distribution of COVID-19 using disease onset forward time: A novel cross-sectional and forward follow-up study</article-title>. <source>Science Advances</source>, <volume>6</volume>(<issue>33</issue>): <fpage>eabc1202</fpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Qin</surname> <given-names>YC</given-names></string-name>, <string-name><surname>Priebe</surname> <given-names>CE</given-names></string-name> (<year>2013</year>). <article-title>Maximum <inline-formula id="j_jds992_ineq_001"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">q</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${L_{q}}$]]></tex-math></alternatives></inline-formula>-likelihood estimation via the expectation-maximization algorithm: A robust estimation of mixture models</article-title>. <source>Journal of the American Statistical Association</source>, <volume>108</volume>(<issue>503</issue>): <fpage>914</fpage>–<lpage>928</lpage>.</mixed-citation>
</ref>
<ref id="j_jds992_ref_017">
<mixed-citation publication-type="other"> <string-name><surname>Wolfe</surname> <given-names>J</given-names></string-name> (1971). A Monte Carlo study of the sampling distribution of the likelihood ratio for mixtures of normal distributions. Naval Personnel and Training Research Laboratory San Diego, Technical Bulletin STB 72-2.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
