<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1097</article-id>
<article-id pub-id-type="doi">10.6339/23-JDS1097</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>FROSTY: A High-Dimensional Scale-Free Bayesian Network Learning Method</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Bang</surname><given-names>Joshua</given-names></name><email xlink:href="mailto:joshuaybang@gmail.com">joshuaybang@gmail.com</email><xref ref-type="aff" rid="j_jds1097_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Oh</surname><given-names>Sang-Yun</given-names></name><xref ref-type="aff" rid="j_jds1097_aff_002">2</xref>
</contrib>
<aff id="j_jds1097_aff_001"><label>1</label>Department of Statistics and Applied Probability, <institution>University of California</institution>, Santa Barbara, Santa Barbara, CA, <country>USA</country></aff>
<aff id="j_jds1097_aff_002"><label>2</label><institution>Lawrence Berkeley National Lab</institution>, Berkeley, CA, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:joshuaybang@gmail.com">joshuaybang@gmail.com</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2023</year></pub-date><pub-date pub-type="epub"><day>14</day><month>3</month><year>2023</year></pub-date><volume>21</volume><issue>2</issue><fpage>354</fpage><lpage>367</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1097_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>Supplementary material includes proofs of the theorems and the Python code for simulation.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>31</day><month>7</month><year>2022</year></date><date date-type="accepted"><day>7</day><month>3</month><year>2023</year></date></history>
<permissions><copyright-statement>2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2023</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>We propose a scalable Bayesian network learning algorithm based on sparse Cholesky decomposition. Our approach only requires observational data and user-specified confidence level as inputs and can estimate networks with thousands of variables. The computational complexity of the proposed method is <inline-formula id="j_jds1097_ineq_001"><alternatives><mml:math>
<mml:mi mathvariant="italic">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$O({p^{3}})$]]></tex-math></alternatives></inline-formula> for a graph with <italic>p</italic> vertices. Extensive numerical experiments illustrate the usefulness of our method with promising results. In simulation, the initial step in our approach also improves an alternative Bayesian network structure estimation method that uses an undirected graph as an input.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>Bayesian networks</kwd>
<kwd>robust selection</kwd>
<kwd>sparse Cholesky decomposition</kwd>
<kwd>structure learning</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1097_reflist_001">
<title>References</title>
<ref id="j_jds1097_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Abramson</surname> <given-names>B</given-names></string-name>, <string-name><surname>Brown</surname> <given-names>J</given-names></string-name>, <string-name><surname>Edwards</surname> <given-names>W</given-names></string-name>, <string-name><surname>Murphy</surname> <given-names>A</given-names></string-name>, <string-name><surname>Winkler</surname> <given-names>RL</given-names></string-name> (<year>1996</year>). <article-title>Hailfinder: A Bayesian system for forecasting severe weather</article-title>. <source><italic>International Journal of Forecasting</italic></source>, <volume>12</volume>(<issue>1</issue>): <fpage>57</fpage>–<lpage>71</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/0169-2070(95)00664-8" xlink:type="simple">https://doi.org/10.1016/0169-2070(95)00664-8</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Amestoy</surname> <given-names>PR</given-names></string-name>, <string-name><surname>Davis</surname> <given-names>TA</given-names></string-name>, <string-name><surname>Duff</surname> <given-names>IS</given-names></string-name> (<year>1996</year>). <article-title>An approximate minimum degree ordering algorithm</article-title>. <source><italic>SIAM Journal on Matrix Analysis and Applications</italic></source>, <volume>17</volume>(<issue>4</issue>): <fpage>886</fpage>–<lpage>905</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1137/S0895479894278952" xlink:type="simple">https://doi.org/10.1137/S0895479894278952</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Barabási</surname> <given-names>AL</given-names></string-name>, <string-name><surname>Albert</surname> <given-names>R</given-names></string-name> (<year>1999</year>). <article-title>Emergence of scaling in random networks</article-title>. <source><italic>Science</italic></source>, <volume>286</volume>(<issue>5439</issue>): <fpage>509</fpage>–<lpage>512</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1126/science.286.5439.509" xlink:type="simple">https://doi.org/10.1126/science.286.5439.509</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_004">
<mixed-citation publication-type="chapter"> <string-name><surname>Cisneros-Velarde</surname> <given-names>P</given-names></string-name>, <string-name><surname>Petersen</surname> <given-names>A</given-names></string-name>, <string-name><surname>Oh</surname> <given-names>SY</given-names></string-name> (<year>2020</year>). <chapter-title>Distributionally robust formulation and model selection for the graphical lasso</chapter-title>. In: <source><italic>International Conference on Artificial Intelligence and Statistics</italic></source>, <fpage>756</fpage>–<lpage>765</lpage>, <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Friedman</surname> <given-names>J</given-names></string-name>, <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name> (<year>2008</year>). <article-title>Sparse inverse covariance estimation with the graphical lasso</article-title>. <source><italic>Biostatistics</italic></source>, <volume>9</volume>(<issue>3</issue>): <fpage>432</fpage>–<lpage>441</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biostatistics/kxm045" xlink:type="simple">https://doi.org/10.1093/biostatistics/kxm045</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Friedman</surname> <given-names>N</given-names></string-name> (<year>2004</year>). <article-title>Inferring cellular networks using probabilistic graphical models</article-title>. <source><italic>Science</italic></source>, <volume>303</volume>(<issue>5659</issue>): <fpage>799</fpage>–<lpage>805</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1126/science.1094068" xlink:type="simple">https://doi.org/10.1126/science.1094068</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_007">
<mixed-citation publication-type="book"> <string-name><surname>Goodfellow</surname> <given-names>I</given-names></string-name>, <string-name><surname>Bengio</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Courville</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bengio</surname> <given-names>Y</given-names></string-name> (<year>2016</year>). <source><italic>Deep learning</italic></source>, volume <volume>1</volume>. <publisher-name>MIT press</publisher-name>, <publisher-loc>Cambridge</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_008">
<mixed-citation publication-type="book"> <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name>, <string-name><surname>Friedman</surname> <given-names>JH</given-names></string-name>, <string-name><surname>Friedman</surname> <given-names>JH</given-names></string-name> (<year>2009</year>). <source><italic>The elements of statistical learning: data mining, inference, and prediction</italic></source>, volume <volume>2</volume>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Heckerman</surname> <given-names>DE</given-names></string-name>, <string-name><surname>Nathwani</surname> <given-names>BN</given-names></string-name> (<year>1992</year>). <article-title>An evaluation of the diagnostic accuracy of pathfinder</article-title>. <source><italic>Computers and Biomedical Research</italic></source>, <volume>25</volume>(<issue>1</issue>): <fpage>56</fpage>–<lpage>74</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/0010-4809(92)90035-9" xlink:type="simple">https://doi.org/10.1016/0010-4809(92)90035-9</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_010">
<mixed-citation publication-type="book"> <string-name><surname>Huang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Acero</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hon</surname> <given-names>HW</given-names></string-name>, <string-name><surname>Reddy</surname> <given-names>R</given-names></string-name> (<year>2001</year>). <source><italic>Spoken language processing: A guide to theory, algorithm, and system development</italic></source>. <publisher-name>Prentice hall PTR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Luo</surname> <given-names>J</given-names></string-name>, <string-name><surname>Savakis</surname> <given-names>AE</given-names></string-name>, <string-name><surname>Singhal</surname> <given-names>A</given-names></string-name> (<year>2005</year>). <article-title>A Bayesian network-based framework for semantic image understanding</article-title>. <source><italic>Pattern recognition</italic></source>, <volume>38</volume>(<issue>6</issue>): <fpage>919</fpage>–<lpage>934</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.patcog.2004.11.001" xlink:type="simple">https://doi.org/10.1016/j.patcog.2004.11.001</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_012">
<mixed-citation publication-type="book"> <string-name><surname>Pearl</surname> <given-names>J</given-names></string-name> (<year>2014</year>). <source><italic>Probabilistic reasoning in intelligent systems: networks of plausible inference</italic></source>. <publisher-name>Elsevier</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Pourahmadi</surname> <given-names>M</given-names></string-name> (<year>1999</year>). <article-title>Joint mean-covariance models with applications to longitudinal data: Unconstrained parameterisation</article-title>. <source><italic>Biometrika</italic></source>, <volume>86</volume>(<issue>3</issue>): <fpage>677</fpage>–<lpage>690</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/86.3.677" xlink:type="simple">https://doi.org/10.1093/biomet/86.3.677</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Raskutti</surname> <given-names>G</given-names></string-name>, <string-name><surname>Uhler</surname> <given-names>C</given-names></string-name> (<year>2018</year>). <article-title>Learning directed acyclic graph models based on sparsest permutations</article-title>. <source><italic>Stat</italic></source>, <volume>7</volume>(<issue>1</issue>): <elocation-id>e183</elocation-id>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/sta4.183" xlink:type="simple">https://doi.org/10.1002/sta4.183</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_015">
<mixed-citation publication-type="chapter"> <string-name><surname>Robinson</surname> <given-names>RW</given-names></string-name> (<year>1977</year>). <chapter-title>Counting unlabeled acyclic digraphs</chapter-title>. In: <source><italic>Combinatorial Mathematics V: Proceedings of the Fifth Australian Conference, Held at the Royal Melbourne Institute of Technology, August 24–26, 1976</italic></source>, <fpage>28</fpage>–<lpage>43</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_016">
<mixed-citation publication-type="other"> <string-name><surname>Rose</surname> <given-names>DJ</given-names></string-name> (1970). Symmetric elimination on sparse positive definite systems and the potential flow network problem, Ph.D. thesis, Harvard University.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_017">
<mixed-citation publication-type="chapter"> <string-name><surname>Rose</surname> <given-names>DJ</given-names></string-name> (<year>1972</year>). <chapter-title>A graph-theoretic study of the numerical solution of sparse positive definite systems of linear equations</chapter-title>. In: <source><italic>Graph theory and computing</italic></source>, <fpage>183</fpage>–<lpage>217</lpage>. <publisher-name>Elsevier</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_018">
<mixed-citation publication-type="chapter"> <string-name><surname>Rue</surname> <given-names>H</given-names></string-name>, <string-name><surname>Held</surname> <given-names>L</given-names></string-name> (<year>2010</year>). <chapter-title>Discrete spatial variation</chapter-title>. In: <source><italic>Handbook of spatial statistics</italic></source>, <fpage>171</fpage>–<lpage>200</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Sachs</surname> <given-names>K</given-names></string-name>, <string-name><surname>Perez</surname> <given-names>O</given-names></string-name>, <string-name><surname>Pe’er</surname> <given-names>D</given-names></string-name>, <string-name><surname>Lauffenburger</surname> <given-names>DA</given-names></string-name>, <string-name><surname>Nolan</surname> <given-names>GP</given-names></string-name> (<year>2005</year>). <article-title>Causal protein-signaling networks derived from multiparameter single-cell data</article-title>. <source><italic>Science</italic></source>, <volume>308</volume>(<issue>5721</issue>): <fpage>523</fpage>–<lpage>529</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1126/science.1105809" xlink:type="simple">https://doi.org/10.1126/science.1105809</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_020">
<mixed-citation publication-type="chapter"> <string-name><surname>Sioutis</surname> <given-names>M</given-names></string-name>, <string-name><surname>Condotta</surname> <given-names>JF</given-names></string-name> (<year>2014</year>). <chapter-title>Tackling large qualitative spatial networks of scale-free-like structure</chapter-title>. In: <source><italic>Artificial Intelligence: Methods and Applications: 8th Hellenic Conference on AI, SETN 2014, Ioannina, Greece, May 15–17, 2014. Proceedings 8</italic></source>, <fpage>178</fpage>–<lpage>191</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_021">
<mixed-citation publication-type="book"> <string-name><surname>Spirtes</surname> <given-names>P</given-names></string-name>, <string-name><surname>Glymour</surname> <given-names>CN</given-names></string-name>, <string-name><surname>Scheines</surname> <given-names>R</given-names></string-name>, <string-name><surname>Heckerman</surname> <given-names>D</given-names></string-name> (<year>2000</year>). <source><italic>Causation, prediction, and search</italic></source>. <publisher-name>MIT press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_022">
<mixed-citation publication-type="other"> <string-name><surname>Squires</surname> <given-names>C</given-names></string-name>, <string-name><surname>Amaniampong</surname> <given-names>J</given-names></string-name>, <string-name><surname>Uhler</surname> <given-names>C</given-names></string-name> (2020). Efficient permutation discovery in causal dags. arXiv preprint: <uri>https://arxiv.org/abs/2011.03610</uri>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Tran</surname> <given-names>C</given-names></string-name>, <string-name><surname>Cisneros-Velarde</surname> <given-names>P</given-names></string-name>, <string-name><surname>Oh</surname> <given-names>SY</given-names></string-name>, <string-name><surname>Petersen</surname> <given-names>A</given-names></string-name> (<year>2022</year>). <article-title>Family-wise error rate control in Gaussian graphical model selection via distributionally robust optimization</article-title>. <source><italic>Stat</italic></source>, <volume>11</volume>(<issue>1</issue>): <elocation-id>e477</elocation-id>, <publisher-name>Wiley Online Library</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Van de Geer</surname> <given-names>S</given-names></string-name>, <string-name><surname>Bühlmann</surname> <given-names>P</given-names></string-name>, <etal>et al.</etal> (<year>2013</year>). <article-title><inline-formula id="j_jds1097_ineq_002"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi>ℓ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\ell _{0}}$]]></tex-math></alternatives></inline-formula>-penalized maximum likelihood for sparse directed acyclic graphs</article-title>. <source><italic>Annals of Statistics</italic></source>, <volume>41</volume>(<issue>2</issue>): <fpage>536</fpage>–<lpage>567</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_025">
<mixed-citation publication-type="chapter"> <string-name><surname>Verma</surname> <given-names>TS</given-names></string-name>, <string-name><surname>Pearl</surname> <given-names>J</given-names></string-name> (<year>2022</year>). <chapter-title>Equivalence and synthesis of causal models</chapter-title>. In: <source><italic>Probabilistic and Causal Inference: The Works of Judea Pearl</italic></source>. <fpage>221</fpage>–<lpage>236</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1097_ref_026">
<mixed-citation publication-type="book"> <string-name><surname>Wainwright</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Jordan</surname> <given-names>MI</given-names></string-name> (<year>2008</year>). <source><italic>Graphical models, exponential families, and variational inference</italic></source>. <publisher-name>Now Publishers Inc.</publisher-name></mixed-citation>
</ref>
<ref id="j_jds1097_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Yannakakis</surname> <given-names>M</given-names></string-name> (<year>1981</year>). <article-title>Computing the minimum fill-in is np-complete</article-title>. <source><italic>SIAM Journal on Algebraic Discrete Methods</italic></source>, <volume>2</volume>(<issue>1</issue>): <fpage>77</fpage>–<lpage>79</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1137/0602010" xlink:type="simple">https://doi.org/10.1137/0602010</ext-link></mixed-citation>
</ref>
<ref id="j_jds1097_ref_028">
<mixed-citation publication-type="other"><italic>IEEE Transactions on Pattern Analysis and Machine Intelligence</italic></mixed-citation>
</ref>
</ref-list>
</back>
</article>
