<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1016</article-id>
<article-id pub-id-type="doi">10.6339/21-JDS1016</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Data Science in Action</subject></subj-group></article-categories>
<title-group>
<article-title>Mutstats: An Ultra-fast Computational Method to Determine Clonal Status of Somatic Mutations</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Bi</surname><given-names>Dehua</given-names></name><xref ref-type="aff" rid="j_jds1016_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Sengupta</surname><given-names>Subhajit</given-names></name><xref ref-type="aff" rid="j_jds1016_aff_002">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhou</surname><given-names>Tianjian</given-names></name><xref ref-type="aff" rid="j_jds1016_aff_003">3</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ji</surname><given-names>Yuan</given-names></name><email xlink:href="mailto:yji@health.bsd.uchicago.edu">yji@health.bsd.uchicago.edu</email><xref ref-type="aff" rid="j_jds1016_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1016_aff_001"><label>1</label>Public Health Sciences, <institution>University of Chicago</institution>, Chicago, IL, <country>USA</country></aff>
<aff id="j_jds1016_aff_002"><label>2</label><institution>Cytel Inc</institution>, Cambridge, MA, <country>USA</country></aff>
<aff id="j_jds1016_aff_003"><label>3</label>Department of Statistics, <institution>Colorado State University</institution>, Fort Collins, CO, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:yji@health.bsd.uchicago.edu">yji@health.bsd.uchicago.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2021</year></pub-date><pub-date pub-type="epub"><day>1</day><month>6</month><year>2021</year></pub-date><volume>19</volume><issue>3</issue><fpage>465</fpage><lpage>484</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1016_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>We include an Appendix on the Bayes model used by the PyClone method. In addition, the simulation data can be obtained from the website <ext-link ext-link-type="uri" xlink:href="https://compgenome.shinyapps.io/app_5">https://compgenome.shinyapps.io/tumorsim</ext-link>. Finally, the code of the Mutstats method and the real data used in this analysis can be found in the author’s Github page <ext-link ext-link-type="uri" xlink:href="https://github.com/edwardbi/Mutstats">https://github.com/edwardbi/Mutstats</ext-link>.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>23</day><month>1</month><year>2021</year></date><date date-type="accepted"><day>16</day><month>5</month><year>2021</year></date></history>
<permissions><copyright-statement>2021 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2021</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Tumor cell population is a mixture of heterogeneous cell subpopulations, known as subclones. Identification of clonal status of mutations, i.e., whether a mutation occurs in all tumor cells or in a subset of tumor cells, is crucial for understanding tumor progression and developing personalized treatment strategies. We make three major contributions in this paper: (1) we summarize terminologies in the literature based on a unified mathematical representation of subclones; (2) we develop a simulation algorithm to generate hypothetical sequencing data that are akin to real data; and (3) we present an ultra-fast computational method, Mutstats, to infer clonal status of somatic mutations from sequencing data of tumors. The inference is based on a Gaussian mixture model for mutation multiplicities. To validate Mutstats, we evaluate its performance on simulated datasets as well as two breast carcinoma samples from The Cancer Genome Atlas project.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>cancer genomics</kwd>
<kwd>next-generation sequencing</kwd>
<kwd>subclone</kwd>
<kwd>tumor heterogeneity</kwd>
</kwd-group>
<funding-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100000002">NIH</funding-source><award-id>R01 CA132897</award-id></award-group><funding-statement>Yuan Ji’s research is partly supported by NIH R01 CA132897. </funding-statement></funding-group>
</article-meta>
</front>
<body/>
<back>
<ref-list id="j_jds1016_reflist_001">
<title>References</title>
<ref id="j_jds1016_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Barnett</surname> <given-names>DW</given-names></string-name>, <string-name><surname>Garrison</surname> <given-names>EK</given-names></string-name>, <string-name><surname>Quinlan</surname> <given-names>AR</given-names></string-name>, <string-name><surname>Strömberg</surname> <given-names>MP</given-names></string-name>, <string-name><surname>Marth</surname> <given-names>GT</given-names></string-name> (<year>2011</year>). <article-title>Bamtools: a C++ api and toolkit for analyzing and managing bam files</article-title>. <source>Bioinformatics</source>, <volume>27</volume>(<issue>12</issue>): <fpage>1691</fpage>–<lpage>1692</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Beerenwinkel</surname> <given-names>N</given-names></string-name>, <string-name><surname>Schwarz</surname> <given-names>RF</given-names></string-name>, <string-name><surname>Gerstung</surname> <given-names>M</given-names></string-name>, <string-name><surname>Markowetz</surname> <given-names>F</given-names></string-name> (<year>2014</year>). <article-title>Cancer evolution: Mathematical models and computational inference</article-title>. <source>Systematic Biology</source>, <volume>64</volume>(<issue>1</issue>): <fpage>e1</fpage>–<lpage>e25</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Carter</surname> <given-names>SL</given-names></string-name>, <string-name><surname>Cibulskis</surname> <given-names>K</given-names></string-name>, <string-name><surname>Helman</surname> <given-names>E</given-names></string-name>, <string-name><surname>McKenna</surname> <given-names>A</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zack</surname> <given-names>T</given-names></string-name>, <etal>et al.</etal> (<year>2012</year>). <article-title>Absolute quantification of somatic DNA alterations in human cancer</article-title>. <source>Nature Biotechnology</source>, <volume>30</volume>(<issue>5</issue>): <fpage>413</fpage>–<lpage>421</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Cibulskis</surname> <given-names>K</given-names></string-name>, <string-name><surname>Lawrence</surname> <given-names>MS</given-names></string-name>, <string-name><surname>Carter</surname> <given-names>SL</given-names></string-name>, <string-name><surname>Sivachenko</surname> <given-names>A</given-names></string-name>, <string-name><surname>Jaffe</surname> <given-names>D</given-names></string-name>, <string-name><surname>Sougnez</surname> <given-names>C</given-names></string-name>, <etal>et al.</etal> (<year>2013</year>). <article-title>Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples</article-title>. <source>Nature Biotechnology</source>, <volume>31</volume>(<issue>3</issue>): <fpage>213</fpage>–<lpage>219</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Cock</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Fields</surname> <given-names>CJ</given-names></string-name>, <string-name><surname>Goto</surname> <given-names>N</given-names></string-name>, <string-name><surname>Heuer</surname> <given-names>ML</given-names></string-name>, <string-name><surname>Rice</surname> <given-names>PM</given-names></string-name> (<year>2010</year>). <article-title>The sanger fastq file format for sequences with quality scores, and the solexa/illumina fastq variants</article-title>. <source>Nucleic Acids Research</source>, <volume>38</volume>(<issue>6</issue>): <fpage>1767</fpage>–<lpage>1771</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Dempster</surname> <given-names>AP</given-names></string-name>, <string-name><surname>Laird</surname> <given-names>NM</given-names></string-name>, <string-name><surname>Rubin</surname> <given-names>DB</given-names></string-name> (<year>1977</year>). <article-title>Maximum likelihood from incomplete data via the EM algorithm</article-title>. <source>Journal of the Royal Statistical Society, Series B, Methodological</source>, <volume>39</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>38</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Deshwar</surname> <given-names>AG</given-names></string-name>, <string-name><surname>Vembu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yung</surname> <given-names>CK</given-names></string-name>, <string-name><surname>Jang</surname> <given-names>GH</given-names></string-name>, <string-name><surname>Stein</surname> <given-names>L</given-names></string-name>, <string-name><surname>Morris</surname> <given-names>Q</given-names></string-name> (<year>2015</year>). <article-title>PhyloWGS: reconstructing subclonal composition and evolution from whole-genome sequencing of tumors</article-title>. <source>Genome Biology</source>, <volume>16</volume>(<issue>1</issue>): <fpage>35</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Fan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xi</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hughes</surname> <given-names>DS</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Futreal</surname> <given-names>PA</given-names></string-name>, <etal>et al.</etal> (<year>2016</year>). <article-title>MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data</article-title>. <source>Genome Biology</source>, <volume>17</volume>(<issue>1</issue>): <fpage>178</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_009">
<mixed-citation publication-type="other"> <string-name><surname>Fraley</surname> <given-names>C</given-names></string-name>, <string-name><surname>Raftery</surname> <given-names>A</given-names></string-name>, <string-name><surname>Scrucca</surname> <given-names>L</given-names></string-name> (2016). mclust: Gaussian mixture modelling for model-based clustering, classification, and density estimation. URL <uri>https://CRAN.R-project.org/package=mclust</uri>. <italic>R package version</italic>, 5: 1.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Huang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>L</given-names></string-name>, <string-name><surname>Myers</surname> <given-names>JR</given-names></string-name>, <string-name><surname>Marth</surname> <given-names>GT</given-names></string-name> (<year>2012</year>). <article-title>Art: a next-generation sequencing read simulator</article-title>. <source>Bioinformatics</source>, <volume>28</volume>(<issue>4</issue>): <fpage>593</fpage>–<lpage>594</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Koboldt</surname> <given-names>DC</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Larson</surname> <given-names>DE</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>D</given-names></string-name>, <string-name><surname>McLellan</surname> <given-names>MD</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>L</given-names></string-name>, <etal>et al.</etal> (<year>2012</year>). <article-title>VarScan 2: Somatic mutation and copy number alteration discovery in cancer by exome sequencing</article-title>. <source>Genome Research</source>, <volume>22</volume>(<issue>3</issue>): <fpage>568</fpage>–<lpage>576</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Landau</surname> <given-names>DA</given-names></string-name>, <string-name><surname>Carter</surname> <given-names>SL</given-names></string-name>, <string-name><surname>Stojanov</surname> <given-names>P</given-names></string-name>, <string-name><surname>McKenna</surname> <given-names>A</given-names></string-name>, <string-name><surname>Stevenson</surname> <given-names>K</given-names></string-name>, <string-name><surname>Lawrence</surname> <given-names>MS</given-names></string-name>, <etal>et al.</etal> (<year>2013</year>). <article-title>Evolution and impact of subclonal mutations in chronic lymphocytic leukemia</article-title>. <source>Cell</source>, <volume>152</volume>(<issue>4</issue>): <fpage>714</fpage>–<lpage>726</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Lee</surname> <given-names>J</given-names></string-name>, <string-name><surname>Müller</surname> <given-names>P</given-names></string-name>, <string-name><surname>Sengupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>Gulukota</surname> <given-names>K</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>Y</given-names></string-name> (<year>2016</year>). <article-title>Bayesian inference for intratumour heterogeneity in mutations and copy number variation</article-title>. <source>Journal of the Royal Statistical Society: Series C (Applied Statistics)</source>, <volume>65</volume>(<issue>4</issue>): <fpage>547</fpage>–<lpage>563</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Marusyk</surname> <given-names>A</given-names></string-name>, <string-name><surname>Polyak</surname> <given-names>K</given-names></string-name> (<year>2010</year>). <article-title>Tumor heterogeneity: causes and consequences</article-title>. <source>Biochimica et Biophysica Acta (BBA) – Reviews on Cancer</source>, <volume>1805</volume>(<issue>1</issue>): <fpage>105</fpage>–<lpage>117</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>McGranahan</surname> <given-names>N</given-names></string-name>, <string-name><surname>Favero</surname> <given-names>F</given-names></string-name>, <string-name><surname>de Bruin EC</surname></string-name>, <string-name><surname>Birkbak</surname> <given-names>NJ</given-names></string-name>, <string-name><surname>Szallasi</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Swanton</surname> <given-names>C</given-names></string-name> (<year>2015</year>). <article-title>Clonal status of actionable driver events and the timing of mutational processes in cancer evolution</article-title>. <source>Science Translational Medicine</source>, <volume>7</volume>(<issue>283</issue>): <fpage>283ra54</fpage>–<lpage>283ra54</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>McKenna</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hanna</surname> <given-names>M</given-names></string-name>, <string-name><surname>Banks</surname> <given-names>E</given-names></string-name>, <string-name><surname>Sivachenko</surname> <given-names>A</given-names></string-name>, <string-name><surname>Cibulskis</surname> <given-names>K</given-names></string-name>, <string-name><surname>Kernytsky</surname> <given-names>A</given-names></string-name>, <etal>et al.</etal> (<year>2010</year>). <article-title>The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data</article-title>. <source>Genome Research</source>, <volume>20</volume>(<issue>9</issue>): <fpage>1297</fpage>–<lpage>1303</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Melnykov</surname> <given-names>V</given-names></string-name>, <string-name><surname>Maitra</surname> <given-names>R</given-names></string-name>, <etal>et al.</etal> (<year>2010</year>). <article-title>Finite mixture models and model-based clustering</article-title>. <source>Statistics Surveys</source>, <volume>4</volume>: <fpage>80</fpage>–<lpage>116</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Misale</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yaeger</surname> <given-names>R</given-names></string-name>, <string-name><surname>Hobor</surname> <given-names>S</given-names></string-name>, <string-name><surname>Scala</surname> <given-names>E</given-names></string-name>, <string-name><surname>Janakiraman</surname> <given-names>M</given-names></string-name>, <string-name><surname>Liska</surname> <given-names>D</given-names></string-name>, <etal>et al.</etal> (<year>2012</year>). <article-title>Emergence of KRAS mutations and acquired resistance to anti-EGFR therapy in colorectal cancer</article-title>. <source>Nature</source>, <volume>486</volume>(<issue>7404</issue>): <fpage>532</fpage>–<lpage>536</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Nik-Zainal</surname> <given-names>S</given-names></string-name>, <string-name><surname>Van Loo</surname> <given-names>P</given-names></string-name>, <string-name><surname>Wedge</surname> <given-names>DC</given-names></string-name>, <string-name><surname>Alexandrov</surname> <given-names>LB</given-names></string-name>, <string-name><surname>Greenman</surname> <given-names>CD</given-names></string-name>, <string-name><surname>Lau</surname> <given-names>KW</given-names></string-name>, <etal>et al.</etal> (<year>2012</year>). <article-title>The life history of 21 breast cancers</article-title>. <source>Cell</source>, <volume>149</volume>(<issue>5</issue>): <fpage>994</fpage>–<lpage>1007</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Nowell</surname> <given-names>PC</given-names></string-name> (<year>1976</year>). <article-title>The clonal evolution of tumor cell populations</article-title>. <source>Science</source>, <volume>194</volume>(<issue>4260</issue>): <fpage>23</fpage>–<lpage>28</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Qin</surname> <given-names>M</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Conroy</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Morrison</surname> <given-names>CD</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>Y</given-names></string-name>, <etal>et al.</etal> (<year>2015</year>). <article-title>Scnvsim: somatic copy number variation and structure variation simulator</article-title>. <source>BMC Bioinformatics</source>, <volume>16</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>6</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Roth</surname> <given-names>A</given-names></string-name>, <string-name><surname>Khattra</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yap</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wan</surname> <given-names>A</given-names></string-name>, <string-name><surname>Laks</surname> <given-names>E</given-names></string-name>, <string-name><surname>Biele</surname> <given-names>J</given-names></string-name>, <etal>et al.</etal> (<year>2014</year>). <article-title>PyClone: statistical inference of clonal population structure in cancer</article-title>. <source>Nature Methods</source>, <volume>11</volume>(<issue>4</issue>): <fpage>396</fpage>–<lpage>398</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Schmitt</surname> <given-names>MW</given-names></string-name>, <string-name><surname>Loeb</surname> <given-names>LA</given-names></string-name>, <string-name><surname>Salk</surname> <given-names>JJ</given-names></string-name> (<year>2016</year>). <article-title>The influence of subclonal resistance mutations on targeted cancer therapy</article-title>. <source>Nature Reviews. Clinical Oncology</source>, <volume>13</volume>(<issue>6</issue>): <fpage>335</fpage>–<lpage>347</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Schwarz</surname> <given-names>G</given-names></string-name>, <etal>et al.</etal> (<year>1978</year>). <article-title>Estimating the dimension of a model</article-title>. <source>The Annals of Statistics</source>, <volume>6</volume>(<issue>2</issue>): <fpage>461</fpage>–<lpage>464</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Scrucca</surname> <given-names>L</given-names></string-name>, <string-name><surname>Fop</surname> <given-names>M</given-names></string-name>, <string-name><surname>Murphy</surname> <given-names>TB</given-names></string-name>, <string-name><surname>Raftery</surname> <given-names>AE</given-names></string-name> (<year>2016</year>). <article-title>mclust 5: Clustering, classification and density estimation using gaussian finite mixture models</article-title>. <source>The R Journal</source>, <volume>8</volume>(<issue>1</issue>): <fpage>289</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_026">
<mixed-citation publication-type="chapter"> <string-name><surname>Sengupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>J</given-names></string-name>, <string-name><surname>Müller</surname> <given-names>P</given-names></string-name>, <string-name><surname>Gulukota</surname> <given-names>K</given-names></string-name>, <string-name><surname>Banerjee</surname> <given-names>A</given-names></string-name>, <etal>et al.</etal> (<year>2015</year>). <chapter-title>Bayclone: Bayesian nonparametric inference of tumor subclones using NGS data</chapter-title>. In: <source>Pacific Symposium on Biocomputing</source>, volume <volume>20</volume>, <fpage>467</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Shcherbina</surname> <given-names>A</given-names></string-name> (<year>2014</year>). <article-title>Fastqsim: platform-independent data characterization and in silico read generation for ngs datasets</article-title>. <source>BMC Research Notes</source>, <volume>7</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>12</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_028">
<mixed-citation publication-type="journal"> <string-name><surname>Shen</surname> <given-names>R</given-names></string-name>, <string-name><surname>Seshan</surname> <given-names>VE</given-names></string-name> (<year>2016</year>). <article-title>FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing</article-title>. <source>Nucleic Acids Research</source>, <volume>44</volume>(<issue>16</issue>): <fpage>e131</fpage>–<lpage>e131</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Swanton</surname> <given-names>C</given-names></string-name> (<year>2012</year>). <article-title>Intratumor heterogeneity: evolution through space and time</article-title>. <source>Cancer Research</source>, <volume>72</volume>(<issue>19</issue>): <fpage>4875</fpage>–<lpage>4882</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>The</surname> <given-names>I</given-names></string-name>, <etal>et al.</etal> (<collab>G of Whole TPCA, Consortium</collab>) (<year>2020</year>). <article-title>Pan-cancer analysis of whole genomes</article-title>. <source>Nature</source>, <volume>578</volume>(<issue>7793</issue>): <fpage>82</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_031">
<mixed-citation publication-type="journal"> <string-name><surname>Xia</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>M</given-names></string-name>, <string-name><surname>Xi</surname> <given-names>R</given-names></string-name> (<year>2017</year>). <article-title>Pysim-sv: a package for simulating structural variation data with gc-biases</article-title>. <source>BMC Bioinformatics</source>, <volume>18</volume>(<issue>3</issue>): <fpage>23</fpage>–<lpage>30</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_032">
<mixed-citation publication-type="journal"> <string-name><surname>Yates</surname> <given-names>LR</given-names></string-name>, <string-name><surname>Campbell</surname> <given-names>PJ</given-names></string-name> (<year>2012</year>). <article-title>Evolution of the cancer genome</article-title>. <source>Nature Reviews. Genetics</source>, <volume>13</volume>(<issue>11</issue>): <fpage>795</fpage>–<lpage>806</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_033">
<mixed-citation publication-type="journal"> <string-name><surname>Yu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Du</surname> <given-names>F</given-names></string-name>, <string-name><surname>Ban</surname> <given-names>R</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name> (<year>2020</year>). <article-title>Simuscop: reliably simulate illumina sequencing data based on position and context dependent profiles</article-title>. <source>BMC Bioinformatics</source>, <volume>21</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>18</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Zhou</surname> <given-names>T</given-names></string-name>, <string-name><surname>Müller</surname> <given-names>P</given-names></string-name>, <string-name><surname>Sengupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>Y</given-names></string-name> (<year>2019</year>). <article-title>PairClone: a Bayesian subclone caller based on mutation pairs</article-title>. <source>Journal of the Royal Statistical Society. Series C. Applied Statistics</source>, <volume>68</volume>(<issue>3</issue>): <fpage>705</fpage>–<lpage>725</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1016_ref_035">
<mixed-citation publication-type="journal"> <string-name><surname>Zhou</surname> <given-names>T</given-names></string-name>, <string-name><surname>Sengupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>Müller</surname> <given-names>P</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>Y</given-names></string-name> (<year>2020</year>). <article-title>RNDClone: Tumor subclone reconstruction based on integrating DNA and RNA sequence data</article-title>. <source>Annals of Applied Statistics</source>, <volume>14</volume>(<issue>4</issue>): <fpage>1856</fpage>–<lpage>1877</lpage>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
