<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn>
<issn pub-type="ppub">1680-743X</issn>
<issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1006</article-id>
<article-id pub-id-type="doi">10.6339/21-JDS1006</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>COVID-19 Special Issue</subject></subj-group></article-categories>
<title-group>
<article-title>Statistical Challenges in the Analysis of Sequence and Structure Data for the COVID-19 Spike Protein</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>He</surname><given-names>Shiyu</given-names></name><xref ref-type="aff" rid="j_jds1006_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wong</surname><given-names>Samuel W.K.</given-names></name><email xlink:href="mailto:samuel.wong@uwaterloo.ca">samuel.wong@uwaterloo.ca</email><xref ref-type="aff" rid="j_jds1006_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1006_aff_001"><label>1</label>Department of Statistics and Actuarial Science, <institution>University of Waterloo</institution>, Waterloo, ON, <country>Canada</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:samuel.wong@uwaterloo.ca">samuel.wong@uwaterloo.ca</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2021</year></pub-date><pub-date pub-type="epub"><day>22</day><month>2</month><year>2021</year></pub-date>
<volume>19</volume><issue>2</issue><fpage>314</fpage><lpage>333</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1006_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>The processed data, R code, and instructions for reproducing the results in this paper are provided in a supplementary .zip file.</p>
</caption>
</supplementary-material>
<history>
<date date-type="received"><day>31</day><month>12</month><year>2020</year></date>
<date date-type="accepted"><day>18</day><month>1</month><year>2021</year></date>
</history>
<permissions><copyright-statement>2021 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2021</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>As the major target of many vaccines and neutralizing antibodies against SARS-CoV-2, the spike (S) protein is observed to mutate over time. In this paper, we present statistical approaches to tackle some challenges associated with the analysis of S-protein data. We build a Bayesian hierarchical model to study the temporal and spatial evolution of S-protein sequences, after grouping the sequences into representative clusters. We then apply sampling methods to investigate possible changes to the S-protein’s 3-D structure as a result of commonly observed mutations. While the increasing spread of D614G variants has been noted in other research, our results also show that the co-occurring mutations of D614G together with S477N or A222V may spread even more rapidly, as quantified by our model estimates.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>Bayesian hierarchical models</kwd>
<kwd>compositional data analysis</kwd>
<kwd>conformational sampling</kwd>
<kwd>mutant clusters</kwd>
<kwd>SARS-CoV-2</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1006_reflist_001">
<title>References</title>
<ref id="j_jds1006_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Aitchison</surname> <given-names>J</given-names></string-name> (<year>1982</year>). <article-title>The statistical analysis of compositional data</article-title>. <source>Journal of the Royal Statistical Society, Series B, Methodological</source>, <volume>44</volume>(<issue>2</issue>): <fpage>139</fpage>–<lpage>160</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Aitchison</surname> <given-names>J</given-names></string-name> (<year>1999</year>). <article-title>Logratios and natural laws in compositional data analysis</article-title>. <source>Mathematical Geology</source>, <volume>31</volume>(<issue>5</issue>): <fpage>563</fpage>–<lpage>580</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Amanat</surname> <given-names>F</given-names></string-name>, <string-name><surname>Krammer</surname> <given-names>F</given-names></string-name> (<year>2020</year>). <article-title>SARS-CoV-2 vaccines: Status report</article-title>. <source>Immunity</source>, <volume>52</volume>(<issue>4</issue>): <fpage>583</fpage>–<lpage>589</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Bernstein</surname> <given-names>FC</given-names></string-name>, <string-name><surname>Koetzle</surname> <given-names>TF</given-names></string-name>, <string-name><surname>Williams</surname> <given-names>GJ</given-names></string-name>, <string-name><surname>Meyer</surname> <given-names>EF</given-names></string-name>, <string-name><surname>Brice</surname> <given-names>MD</given-names></string-name>, <string-name><surname>Rodgers</surname> <given-names>JR</given-names></string-name>, <etal>et al.</etal> (<year>1977</year>). <article-title>The protein data bank</article-title>. <source>European Journal of Biochemistry</source>, <volume>80</volume>(<issue>2</issue>): <fpage>319</fpage>–<lpage>324</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Botev</surname> <given-names>ZI</given-names></string-name>, <string-name><surname>Grotowski</surname> <given-names>JF</given-names></string-name>, <string-name><surname>Kroese</surname> <given-names>DP</given-names></string-name>, <etal>et al.</etal> (<year>2010</year>). <article-title>Kernel density estimation via diffusion</article-title>. <source>The Annals of Statistics</source>, <volume>38</volume>(<issue>5</issue>): <fpage>2916</fpage>–<lpage>2957</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Cargnoni</surname> <given-names>C</given-names></string-name>, <string-name><surname>Müller</surname> <given-names>P</given-names></string-name>, <string-name><surname>West</surname> <given-names>M</given-names></string-name> (<year>1997</year>). <article-title>Bayesian forecasting of multinomial time series through conditionally Gaussian dynamic models</article-title>. <source>Journal of the American Statistical Association</source>, <volume>92</volume>(<issue>438</issue>): <fpage>640</fpage>–<lpage>647</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Carpenter</surname> <given-names>B</given-names></string-name>, <string-name><surname>Gelman</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hoffman</surname> <given-names>MD</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>D</given-names></string-name>, <string-name><surname>Goodrich</surname> <given-names>B</given-names></string-name>, <string-name><surname>Betancourt</surname> <given-names>M</given-names></string-name>, <etal>et al.</etal> (<year>2017</year>). <article-title>STAN: A probabilistic programming language</article-title>. <source>Journal of Statistical Software</source>, <volume>76</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>32</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_008">
<mixed-citation publication-type="other"> <string-name><surname>Chen</surname> <given-names>AT</given-names></string-name>, <string-name><surname>Altschuler</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhan</surname> <given-names>SH</given-names></string-name>, <string-name><surname>Chan</surname> <given-names>YA</given-names></string-name>, <string-name><surname>Deverman</surname> <given-names>BE</given-names></string-name> (2020a). COVID-19 CG: Tracking SARS-CoV-2 mutations by locations and dates of interest. bioRxiv preprint: <uri>https://doi.org/10.1101/2020.09.23.310565</uri>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Chen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>R</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>G-W</given-names></string-name> (<year>2020</year>b). <article-title>Mutations strengthened SARS-CoV-2 infectivity</article-title>. <source>Journal of Molecular Biology</source>, <volume>432</volume>(<issue>19</issue>): <fpage>5212</fpage>–<lpage>5226</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Diehl</surname> <given-names>WE</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>AE</given-names></string-name>, <string-name><surname>Grubaugh</surname> <given-names>ND</given-names></string-name>, <string-name><surname>Carvalho</surname> <given-names>LM</given-names></string-name>, <string-name><surname>Kim</surname> <given-names>K</given-names></string-name>, <string-name><surname>Kyawe</surname> <given-names>PP</given-names></string-name>, <etal>et al.</etal> (<year>2016</year>). <article-title>Ebola virus glycoprotein with increased infectivity dominated the 2013–2016 epidemic</article-title>. <source>Cell</source>, <volume>167</volume>(<issue>4</issue>): <fpage>1088</fpage>–<lpage>1098</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Dong</surname> <given-names>E</given-names></string-name>, <string-name><surname>Du</surname> <given-names>H</given-names></string-name>, <string-name><surname>Gardner</surname> <given-names>L</given-names></string-name> (<year>2020</year>). <article-title>An interactive web-based dashboard to track COVID-19 in real time</article-title>. <source>Lancet. Infectious Diseases</source>, <volume>20</volume>(<issue>5</issue>): <fpage>533</fpage>–<lpage>534</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Duffy</surname> <given-names>S</given-names></string-name> (<year>2018</year>). <article-title>Why are RNA virus mutation rates so damn high?</article-title> <source>PLoS Biology</source>, <volume>16</volume>(<issue>8</issue>): <elocation-id>e3000003</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_013">
<mixed-citation publication-type="other"> European Commission (2020). Coronavirus: Commission proposes more clarity and predictability of any measures restricting free movement in the European Union. <uri>https://ec.europa.eu/commission/presscorner/detail/en/ip_20_1555</uri>. Last checked on Dec 20, 2020.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_014">
<mixed-citation publication-type="other"> <string-name><surname>Hodcroft</surname> <given-names>EB</given-names></string-name>, <string-name><surname>Zuber</surname> <given-names>M</given-names></string-name>, <string-name><surname>Nadeau</surname> <given-names>S</given-names></string-name>, <string-name><surname>Comas</surname> <given-names>I</given-names></string-name>, <string-name><surname>Candelas</surname> <given-names>FG</given-names></string-name>, <string-name><surname>Stadler</surname> <given-names>T</given-names></string-name>, et al. (2020). Emergence and spread of a SARS-CoV-2 variant through Europe in the summer of 2020. medRxiv preprint: <uri>https://doi.org/10.1101/2020.10.25.20219063</uri>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Huddleston</surname> <given-names>J</given-names></string-name>, <string-name><surname>Barnes</surname> <given-names>JR</given-names></string-name>, <string-name><surname>Rowe</surname> <given-names>T</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Kondor</surname> <given-names>R</given-names></string-name>, <string-name><surname>Wentworth</surname> <given-names>DE</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Integrating genotypes and phenotypes improves long-term forecasts of seasonal influenza A/H3N2 evolution</article-title>. <source>eLife</source>, <volume>9</volume>:<elocation-id>e60067</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Korber</surname> <given-names>B</given-names></string-name>, <string-name><surname>Fischer</surname> <given-names>W</given-names></string-name>, <string-name><surname>Gnanakaran</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yoon</surname> <given-names>H</given-names></string-name>, <string-name><surname>Theiler</surname> <given-names>J</given-names></string-name>, <string-name><surname>Abfalterer</surname> <given-names>W</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Tracking changes in SARS-CoV-2 Spike: Evidence that D614G increases infectivity of the COVID-19 virus</article-title>. <source>Cell</source>, <volume>182</volume>(<issue>4</issue>): <fpage>812</fpage>–<lpage>827</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Krammer</surname> <given-names>F</given-names></string-name> (<year>2020</year>). <article-title>SARS-CoV-2 vaccines in development</article-title>. <source>Nature</source>, <volume>586</volume>(<issue>7830</issue>): <fpage>516</fpage>–<lpage>527</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Kryshtafovych</surname> <given-names>A</given-names></string-name>, <string-name><surname>Schwede</surname> <given-names>T</given-names></string-name>, <string-name><surname>Topf</surname> <given-names>M</given-names></string-name>, <string-name><surname>Fidelis</surname> <given-names>K</given-names></string-name>, <string-name><surname>Moult</surname> <given-names>J</given-names></string-name> (<year>2019</year>). <article-title>Critical assessment of methods of protein structure prediction (CASP) — Round XIII</article-title>. <source>Proteins: Structure, Function, and Bioinformatics</source>, <volume>87</volume>(<issue>12</issue>): <fpage>1011</fpage>–<lpage>1020</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Lauer</surname> <given-names>SA</given-names></string-name>, <string-name><surname>Grantz</surname> <given-names>KH</given-names></string-name>, <string-name><surname>Bi</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>FK</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Meredith</surname> <given-names>HR</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application</article-title>. <source>Annals of Internal Medicine</source>, <volume>172</volume>(<issue>9</issue>): <fpage>577</fpage>–<lpage>582</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Lauring</surname> <given-names>AS</given-names></string-name>, <string-name><surname>Andino</surname> <given-names>R</given-names></string-name> (<year>2010</year>). <article-title>Quasispecies theory and the behavior of RNA viruses</article-title>. <source>PLoS Pathogens</source>, <volume>6</volume>(<issue>7</issue>): <elocation-id>e1001005</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Li</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Nie</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>S</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>The impact of mutations in SARS-CoV-2 spike on viral infectivity and antigenicity</article-title>. <source>Cell</source>, <volume>182</volume>(<issue>5</issue>): <fpage>1284</fpage>–<lpage>1294</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Ning</surname> <given-names>T</given-names></string-name>, <string-name><surname>Nie</surname> <given-names>J</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>C</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Q</given-names></string-name>, <etal>et al.</etal> (<year>2019</year>). <article-title>Antigenic drift of influenza a (H7N9) virus hemagglutinin</article-title>. <source>The Journal of Infectious Diseases</source>, <volume>219</volume>(<issue>1</issue>): <fpage>19</fpage>–<lpage>25</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Onuchic</surname> <given-names>JN</given-names></string-name>, <string-name><surname>Luthey-Schulten</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wolynes</surname> <given-names>PG</given-names></string-name> (<year>1997</year>). <article-title>Theory of protein folding: The energy landscape perspective</article-title>. <source>Annual Review of Physical Chemistry</source>, <volume>48</volume>(<issue>1</issue>): <fpage>545</fpage>–<lpage>600</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Phan</surname> <given-names>T</given-names></string-name> (<year>2020</year>). <article-title>Novel coronavirus: From discovery to clinical diagnostics</article-title>. <source>Infection, Genetics and Evolution</source>, <volume>79</volume>: <elocation-id>104211</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Schaefer</surname> <given-names>C</given-names></string-name>, <string-name><surname>Rost</surname> <given-names>B</given-names></string-name> (<year>2012</year>). <article-title>Predict impact of single amino acid change upon protein structure</article-title>. <source>BMC Genomics</source>, <volume>13</volume>(<issue>S4</issue>): <fpage>1</fpage>–<lpage>10</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>Sedova</surname> <given-names>M</given-names></string-name>, <string-name><surname>Jaroszewski</surname> <given-names>L</given-names></string-name>, <string-name><surname>Alisoltani</surname> <given-names>A</given-names></string-name>, <string-name><surname>Godzik</surname> <given-names>A</given-names></string-name> (<year>2020</year>). <article-title>Coronavirus3d: 3d structural visualization of COVID-19 genomic divergence</article-title>. <source>Bioinformatics</source>, <volume>36</volume>(<issue>15</issue>): <fpage>4360</fpage>–<lpage>4362</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Tang</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>J</given-names></string-name> (<year>2014</year>). <article-title>Fast protein loop sampling and structure prediction using distance-guided sequential chain-growth Monte Carlo method</article-title>. <source>PLoS Computational Biology</source>, <volume>10</volume>: <fpage>e1003539</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_028">
<mixed-citation publication-type="journal"> <string-name><surname>Toyoshima</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Nemoto</surname> <given-names>K</given-names></string-name>, <string-name><surname>Matsumoto</surname> <given-names>S</given-names></string-name>, <string-name><surname>Nakamura</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Kiyotani</surname> <given-names>K</given-names></string-name> (<year>2020</year>). <article-title>SARS-CoV-2 genomic variations associated with mortality rate of COVID-19</article-title>. <source>Journal of Human Genetics</source>, <volume>65</volume>: <fpage>1075</fpage>–<lpage>1082</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Wan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Shang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Graham</surname> <given-names>R</given-names></string-name>, <string-name><surname>Baric</surname> <given-names>RS</given-names></string-name>, <string-name><surname>Li</surname> <given-names>F</given-names></string-name> (<year>2020</year>). <article-title>Receptor recognition by the novel coronavirus from Wuhan: An analysis based on decade-long structural studies of SARS coronavirus</article-title>. <source>Journal of Virology</source>, <volume>94</volume>(<issue>7</issue>): <elocation-id>e00127-20</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Ward</surname> <given-names>JH</given-names></string-name> (<year>1963</year>). <article-title>Hierarchical grouping to optimize an objective function</article-title>. <source>Journal of the American Statistical Association</source>, <volume>58</volume>(<issue>301</issue>): <fpage>236</fpage>–<lpage>244</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_031">
<mixed-citation publication-type="chapter"> <string-name><surname>Webb</surname> <given-names>B</given-names></string-name>, <string-name><surname>Sali</surname> <given-names>A</given-names></string-name> (<year>2017</year>). <chapter-title>Protein structure modeling with MODELLER</chapter-title>. In: <source>Functional Genomics</source> (<string-name><given-names>M</given-names> <surname>Kaufmann</surname></string-name>, <string-name><given-names>C</given-names> <surname>Klinger</surname></string-name>, <string-name><given-names>A</given-names> <surname>Savelsbergh</surname></string-name>, eds.), <fpage>39</fpage>–<lpage>54</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_032">
<mixed-citation publication-type="other"> WHO (2020a). Coronavirus disease (COVID-19) situation dashboard. <uri>https://who.sprinklr.com/</uri>. Last checked on Dec 19, 2020.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_033">
<mixed-citation publication-type="other"> WHO (2020b). Draft landscape of COVID-19 candidate vaccines. <uri>https://www.who.int/publications/m/item/draft-landscape-of-covid-19-candidate-vaccines</uri>. Last checked on Dec 20, 2020.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Wong</surname> <given-names>SW</given-names></string-name> (<year>2020</year>). <article-title>Assessing the impacts of mutations to the structure of COVID-19 spike protein via sequential Monte Carlo</article-title>. <source>Journal of Data Science</source>, <volume>18</volume>(<issue>3</issue>): <fpage>511</fpage>–<lpage>525</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_035">
<mixed-citation publication-type="journal"> <string-name><surname>Wong</surname> <given-names>SW</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>JS</given-names></string-name>, <string-name><surname>Kou</surname> <given-names>S</given-names></string-name> (<year>2018</year>). <article-title>Exploring the conformational space for protein folding with sequential Monte Carlo</article-title>. <source>Annals of Applied Statistics</source>, <volume>12</volume>(<issue>3</issue>): <fpage>1628</fpage>–<lpage>1654</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_036">
<mixed-citation publication-type="journal"> <string-name><surname>Wrapp</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>N</given-names></string-name>, <string-name><surname>Corbett</surname> <given-names>KS</given-names></string-name>, <string-name><surname>Goldsmith</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Hsieh</surname> <given-names>C-L</given-names></string-name>, <string-name><surname>Abiona</surname> <given-names>O</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation</article-title>. <source>Science</source>, <volume>367</volume>(<issue>6483</issue>): <fpage>1260</fpage>–<lpage>1263</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_037">
<mixed-citation publication-type="journal"> <string-name><surname>Wu</surname> <given-names>F</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>YM</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Song</surname> <given-names>ZG</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>A new coronavirus associated with human respiratory disease in China</article-title>. <source>Nature</source>, <volume>579</volume>(<issue>7798</issue>): <fpage>265</fpage>–<lpage>269</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_038">
<mixed-citation publication-type="journal"> <string-name><surname>Yurkovetskiy</surname> <given-names>L</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Pascal</surname> <given-names>KE</given-names></string-name>, <string-name><surname>Tomkins-Tinch</surname> <given-names>C</given-names></string-name>, <string-name><surname>Nyalile</surname> <given-names>TP</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Structural and functional analysis of the D614G SARS-CoV-2 spike protein variant</article-title>. <source>Cell</source>, <volume>183</volume>(<issue>3</issue>): <fpage>739</fpage>–<lpage>751</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_039">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Kou</surname> <given-names>SC</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>JS</given-names></string-name> (<year>2007</year>a). <article-title>Biopolymer structure simulation and optimization via fragment regrowth Monte Carlo</article-title>. <source>Journal of Chemical Physics</source>, <volume>126</volume>(<issue>22</issue>): <elocation-id>06B605</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_040">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>M</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>R</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>JS</given-names></string-name> (<year>2007</year>b). <article-title>Monte Carlo sampling of near-native structures of proteins with applications</article-title>. <source>Proteins: Structure, Function, and Bioinformatics</source>, <volume>66</volume>(<issue>1</issue>): <fpage>61</fpage>–<lpage>68</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_041">
<mixed-citation publication-type="journal"> <string-name><surname>Zhou</surname> <given-names>P</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>XL</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>XG</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>W</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>a). <article-title>A pneumonia outbreak associated with a new coronavirus of probable bat origin</article-title>. <source>Nature</source>, <volume>579</volume>(<issue>7798</issue>): <fpage>270</fpage>–<lpage>273</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1006_ref_042">
<mixed-citation publication-type="other"> <string-name><surname>Zhou</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tsybovsky</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Olia</surname> <given-names>AS</given-names></string-name>, <string-name><surname>Gorman</surname> <given-names>J</given-names></string-name>, <string-name><surname>Rapp</surname> <given-names>M</given-names></string-name>, <string-name><surname>Cerutti</surname> <given-names>G</given-names></string-name>, et al. (2020b). Cryo-EM structures delineate a ph-dependent switch that mediates endosomal positioning of SARS-CoV-2 spike receptor-binding domains. bioRxiv preprint: <uri>https://doi.org/10.1101/2020.07.04.187989</uri>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
