<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1092</article-id>
<article-id pub-id-type="doi">10.6339/23-JDS1092</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Impact of Bias Correction of the Least Squares Estimation on Bootstrap Confidence Intervals for Bifurcating Autoregressive Models</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Elbayoumi</surname><given-names>Tamer</given-names></name><xref ref-type="aff" rid="j_jds1092_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-8113-702X</contrib-id>
<name><surname>Mostafa</surname><given-names>Sayed</given-names></name><email xlink:href="mailto:sabdelmegeed@ncat.edu">sabdelmegeed@ncat.edu</email><xref ref-type="aff" rid="j_jds1092_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1092_aff_001"><label>1</label>Department of Mathematics &amp; Statistics, <institution>North Carolina A&amp;T State University</institution>, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:sabdelmegeed@ncat.edu">sabdelmegeed@ncat.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2024</year></pub-date><pub-date pub-type="epub"><day>24</day><month>2</month><year>2023</year></pub-date><volume>22</volume><issue>1</issue><fpage>25</fpage><lpage>44</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1092_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>The supplementary material includes the following files and folders: (1) README: a brief explanation of all the files and folders in the supplementary material; (2) The application datasets; (3) Code files; and (4) Additional simulation results.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>7</day><month>10</month><year>2022</year></date><date date-type="accepted"><day>19</day><month>2</month><year>2023</year></date></history>
<permissions><copyright-statement>2024 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>The least squares (LS) estimator of the autoregressive coefficient in the bifurcating autoregressive (BAR) model was recently shown to suffer from substantial bias, especially for small to moderate samples. This study investigates the impact of the bias in the LS estimator on the behavior of various types of bootstrap confidence intervals for the autoregressive coefficient and introduces methods for constructing bias-corrected bootstrap confidence intervals. We first describe several bootstrap confidence interval procedures for the autoregressive coefficient of the BAR model and present their bias-corrected versions. The behavior of uncorrected and corrected confidence interval procedures is studied empirically through extensive Monte Carlo simulations and two real cell lineage data applications. The empirical results show that the bias in the LS estimator can have a significant negative impact on the behavior of bootstrap confidence intervals and that bias correction can significantly improve the performance of bootstrap confidence intervals in terms of coverage, width, and symmetry.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>BAR model</kwd>
<kwd>binary trees</kwd>
<kwd>cell-lineage</kwd>
<kwd>maternal correlation</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1092_reflist_001">
<title>References</title>
<ref id="j_jds1092_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Berkowitz</surname> <given-names>J</given-names></string-name>, <string-name><surname>Kilian</surname> <given-names>L</given-names></string-name> (<year>2000</year>). <article-title>Recent developments in bootstrapping time series</article-title>. <source><italic>Econometric Reviews</italic></source>, <volume>19</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>48</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/07474930008800457" xlink:type="simple">https://doi.org/10.1080/07474930008800457</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Bui</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Huggins</surname> <given-names>R</given-names></string-name> (<year>1999</year>). <article-title>Inference for the random coefficients bifurcating autoregressive model for cell lineage studies</article-title>. <source><italic>Journal of Statistical Planning and Inference</italic></source>, <volume>81</volume>(<issue>2</issue>): <fpage>253</fpage>–<lpage>262</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/S0378-3758(99)00049-X" xlink:type="simple">https://doi.org/10.1016/S0378-3758(99)00049-X</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Chang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Hall</surname> <given-names>P</given-names></string-name> (<year>2015</year>). <article-title>Double-bootstrap methods that use a single double-bootstrap simulation</article-title>. <source><italic>Biometrika</italic></source>, <volume>102</volume>: <fpage>203</fpage>–<lpage>214</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/asu060" xlink:type="simple">https://doi.org/10.1093/biomet/asu060</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_004">
<mixed-citation publication-type="chapter"> <string-name><surname>Cowan</surname> <given-names>R</given-names></string-name> (<year>1984</year>). <chapter-title>Statistical concepts in the analysis of cell lineage data</chapter-title>. In: <source><italic>1983 Workshop Cell Growth Division</italic></source>, <fpage>18</fpage>–<lpage>22</lpage>. <publisher-name>Latrobe University</publisher-name>, <publisher-loc>Melbourne</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Cowan</surname> <given-names>R</given-names></string-name>, <string-name><surname>Staudte</surname> <given-names>RG</given-names></string-name> (<year>1986</year>). <article-title>The bifurcating autoregression model in cell lineage studies</article-title>. <source><italic>Biometrics</italic></source>, <volume>42</volume>: <fpage>769</fpage>–<lpage>783</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2530692" xlink:type="simple">https://doi.org/10.2307/2530692</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Efron</surname> <given-names>B</given-names></string-name> (<year>1987</year>). <article-title>Better bootstrap confidence intervals</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>82</volume>: <fpage>171</fpage>–<lpage>200</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1987.10478410" xlink:type="simple">https://doi.org/10.1080/01621459.1987.10478410</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_007">
<mixed-citation publication-type="other"> <string-name><surname>Elbayoumi</surname> <given-names>T</given-names></string-name>, <string-name><surname>Mostafa</surname> <given-names>S</given-names></string-name> (2021a). <italic>bifurcatingr: Bifurcating Autoregressive Models</italic>. R package version 1.0.0.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Elbayoumi</surname> <given-names>T</given-names></string-name>, <string-name><surname>Terpstra</surname> <given-names>J</given-names></string-name> (<year>2016</year>). <article-title>Weighted L1-estimates for the first-order bifurcating autoregressive model</article-title>. <source><italic>Communications in Statistics. Simulation and Computation</italic></source>, <volume>45</volume>: <fpage>2991</fpage>–<lpage>3013</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/03610918.2014.938826" xlink:type="simple">https://doi.org/10.1080/03610918.2014.938826</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Elbayoumi</surname> <given-names>TM</given-names></string-name>, <string-name><surname>Mostafa</surname> <given-names>SA</given-names></string-name> (<year>2021</year>b). <article-title>On the estimation bias in first-order bifurcating autoregressive models</article-title>. <source><italic>Stat</italic></source>, <volume>10</volume>(<issue>1</issue>): <fpage>e342</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_010">
<mixed-citation publication-type="book"> <string-name><surname>Hall</surname> <given-names>P</given-names></string-name> (<year>1992</year>). <source><italic>The Bootstrap and Edgeworth Expansion</italic></source>. <publisher-name>Springer-Verlag</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Hawkins</surname> <given-names>ED</given-names></string-name>, <string-name><surname>Markham</surname> <given-names>JF</given-names></string-name>, <string-name><surname>McGuinness</surname> <given-names>LP</given-names></string-name>, <string-name><surname>Hodgkin</surname> <given-names>PD</given-names></string-name> (<year>2009</year>). <article-title>A single-cell pedigree analysis of alternative stochastic lymphocyte fates</article-title>. <source><italic>Proceedings of the National Academy of Sciences</italic></source>, <volume>106</volume>(<issue>32</issue>): <fpage>13457</fpage>–<lpage>13462</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1073/pnas.0905629106" xlink:type="simple">https://doi.org/10.1073/pnas.0905629106</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Huggins</surname> <given-names>R</given-names></string-name>, <string-name><surname>Basawa</surname> <given-names>I</given-names></string-name> (<year>2000</year>). <article-title>Inference for the extended bifurcating autoregressive model for cell lineage studies</article-title>. <source><italic>Australian &amp; New Zealand Journal of Statistics</italic></source>, <volume>42</volume>: <fpage>423</fpage>–<lpage>432</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/1467-842X.00139" xlink:type="simple">https://doi.org/10.1111/1467-842X.00139</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Huggins</surname> <given-names>RM</given-names></string-name> (<year>1995</year>). <article-title>A law of large numbers for the bifurcating autoregressive process</article-title>. <source><italic>Communications in Statistics–Stochastic Models</italic></source>, <volume>11</volume>(<issue>2</issue>): <fpage>273</fpage>–<lpage>278</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/15326349508807345" xlink:type="simple">https://doi.org/10.1080/15326349508807345</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Huggins</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Basawa</surname> <given-names>IV</given-names></string-name> (<year>1999</year>). <article-title>Extensions of the bifurcating autoregressive model for cell lineage studies</article-title>. <source><italic>Journal of Applied Probability</italic></source>, <volume>36</volume>: <fpage>1225</fpage>–<lpage>1233</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1239/jap/1032374768" xlink:type="simple">https://doi.org/10.1239/jap/1032374768</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_015">
<mixed-citation publication-type="book"> <string-name><surname>Kimmel</surname> <given-names>M</given-names></string-name>, <string-name><surname>Axelrod</surname> <given-names>D</given-names></string-name> (<year>2005</year>). <source><italic>Branching Processes in Biology</italic></source>. <publisher-name>Springer-Verlag</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Lee</surname> <given-names>SMS</given-names></string-name>, <string-name><surname>Young</surname> <given-names>GA</given-names></string-name> (<year>1999</year>). <article-title>The effect of Monte Carlo approximation on coverage error of double-bootstrap confidence intervals</article-title>. <source><italic>Journal of the Royal Statistical Society: Series B</italic></source>, <volume>61</volume>(<issue>2</issue>): <fpage>353</fpage>–<lpage>366</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/1467-9868.00181" xlink:type="simple">https://doi.org/10.1111/1467-9868.00181</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Liu-Evans</surname> <given-names>GD</given-names></string-name>, <string-name><surname>Phillips</surname> <given-names>GD</given-names></string-name> (<year>2012</year>). <article-title>Bootstrap, Jackknife and COLS: Bias and mean squared error in estimation of autoregressive models</article-title>. <source><italic>Journal of Time Series Econometrics</italic></source>, <volume>4</volume>(<issue>2</issue>): <fpage>1</fpage>–<lpage>33</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1515/1941-1928.1122" xlink:type="simple">https://doi.org/10.1515/1941-1928.1122</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Ouysse</surname> <given-names>R</given-names></string-name> (<year>2013</year>). <article-title>A fast iterated bootstrap procedure for approximating the small-sample bias</article-title>. <source><italic>Communications in Statistics – Simulation and Computation</italic></source>, <volume>42</volume>(<issue>7</issue>): <fpage>1472</fpage>–<lpage>1494</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/03610918.2012.667473" xlink:type="simple">https://doi.org/10.1080/03610918.2012.667473</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_019">
<mixed-citation publication-type="book"> <collab>R Core Team</collab> (<year>2022</year>). <source><italic>R: A Language and Environment for Statistical Computing</italic></source>. <publisher-name>R Foundation for Statistical Computing</publisher-name>, <publisher-loc>Vienna, Austria</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Sandler</surname> <given-names>O</given-names></string-name>, <string-name><surname>Mizrahi</surname> <given-names>SP</given-names></string-name>, <string-name><surname>Weiss</surname> <given-names>N</given-names></string-name>, <string-name><surname>Agam</surname> <given-names>O</given-names></string-name>, <string-name><surname>Simon</surname> <given-names>I</given-names></string-name>, <string-name><surname>Balaban</surname> <given-names>NQ</given-names></string-name> (<year>2015</year>). <article-title>Lineage correlations of single cell division time as a probe of cell-cycle dynamics</article-title>. <source><italic>Nature</italic></source>, <volume>519</volume>: <fpage>468</fpage>–<lpage>471</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1038/nature14318" xlink:type="simple">https://doi.org/10.1038/nature14318</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Shi</surname> <given-names>SG</given-names></string-name> (<year>1992</year>). <article-title>Accurate and efficient double-bootstrap confidence limit method</article-title>. <source><italic>Computational Statistics and Data Analysis</italic></source>, <volume>13</volume>: <fpage>21</fpage>–<lpage>32</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/0167-9473(92)90151-5" xlink:type="simple">https://doi.org/10.1016/0167-9473(92)90151-5</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Staudte</surname> <given-names>R</given-names></string-name>, <string-name><surname>Guiguet</surname> <given-names>M</given-names></string-name>, <string-name><surname>d’Hooghe</surname> <given-names>MC</given-names></string-name> (<year>1984</year>). <article-title>Additive models for dependent cell populations</article-title>. <source><italic>Journal of Theoretical Biology</italic></source>, <volume>109</volume>(<issue>1</issue>): <fpage>127</fpage>–<lpage>146</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/S0022-5193(84)80115-0" xlink:type="simple">https://doi.org/10.1016/S0022-5193(84)80115-0</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Terpstra</surname> <given-names>JT</given-names></string-name>, <string-name><surname>Elbayoumi</surname> <given-names>T</given-names></string-name> (<year>2012</year>). <article-title>A law of large numbers result for a bifurcating process with an infinite moving average representation</article-title>. <source><italic>Statistics &amp; Probability Letters</italic></source>, <volume>82</volume>(<issue>1</issue>): <fpage>123</fpage>–<lpage>129</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.spl.2011.09.012" xlink:type="simple">https://doi.org/10.1016/j.spl.2011.09.012</ext-link></mixed-citation>
</ref>
<ref id="j_jds1092_ref_024">
<mixed-citation publication-type="other"> <string-name><surname>Wuertz</surname> <given-names>D</given-names></string-name>, <string-name><surname>Setz</surname> <given-names>T</given-names></string-name>, <string-name><surname>Chalabi</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Smith</surname> <given-names>P</given-names></string-name> (2022). <italic>fCopulae: Rmetrics – Bivariate Dependence Structures with Copulae</italic>. R package version 4021.84.</mixed-citation>
</ref>
<ref id="j_jds1092_ref_025">
<mixed-citation publication-type="other"><italic>Statistics &amp; Probability Letters</italic></mixed-citation>
</ref>
</ref-list>
</back>
</article>
