<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1088</article-id>
<article-id pub-id-type="doi">10.6339/23-JDS1088</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Computing in Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Causal Discovery for Observational Sciences Using Supervised Machine Learning</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Petersen</surname><given-names>Anne Helby</given-names></name><email xlink:href="mailto:ahpe@sund.ku.dk">ahpe@sund.ku.dk</email><xref ref-type="aff" rid="j_jds1088_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ramsey</surname><given-names>Joseph</given-names></name><xref ref-type="aff" rid="j_jds1088_aff_002">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ekstrøm</surname><given-names>Claus Thorn</given-names></name><xref ref-type="aff" rid="j_jds1088_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Spirtes</surname><given-names>Peter</given-names></name><xref ref-type="aff" rid="j_jds1088_aff_002">2</xref>
</contrib>
<aff id="j_jds1088_aff_001"><label>1</label>Section of Biostatistics, Department of Public Health, <institution>University of Copenhagen</institution>, <country>Denmark</country></aff>
<aff id="j_jds1088_aff_002"><label>2</label>Department of Philosophy, <institution>Carnegie Mellon University</institution>, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:ahpe@sund.ku.dk">ahpe@sund.ku.dk</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2023</year></pub-date><pub-date pub-type="epub"><day>13</day><month>3</month><year>2023</year></pub-date><volume>21</volume><issue>2</issue><fpage>255</fpage><lpage>280</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1088_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>The following supplementary materials are available:
<def-list><def-item><term><bold>Appendices:</bold></term><def>
<p><def-list><def-item><term><bold>A:</bold></term><def>
<p>Terminology and notation</p></def></def-item><def-item><term><bold>B:</bold></term><def>
<p>Details about data simulation</p></def></def-item><def-item><term><bold>C:</bold></term><def>
<p>Details about the neural network</p></def></def-item><def-item><term><bold>D:</bold></term><def>
<p>Results: Extra figures</p></def></def-item><def-item><term><bold>E:</bold></term><def>
<p>Application: Extra figures</p></def></def-item></def-list></p></def></def-item><def-item><term><bold>Application data:</bold></term><def>
<p>Correlation matrices from the Metropolit data application and estimated adjacency matrices.</p></def></def-item><def-item><term><bold>GES simulation study results:</bold></term><def>
<p>Estimated adjacency matrices from the GES applications for the simulation study (estimated using TETRAD).</p></def></def-item><def-item><term><bold>Neural network models:</bold></term><def>
<p>Trained neural network models from the simulation study (.h5 files).</p></def></def-item><def-item><term><bold>Replication code:</bold></term><def>
<p>R code for replicating the simulation study and the application.</p></def></def-item></def-list></p>
</caption>
</supplementary-material><history><date date-type="received"><day>16</day><month>7</month><year>2022</year></date><date date-type="accepted"><day>1</day><month>2</month><year>2023</year></date></history>
<permissions><copyright-statement>2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2023</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Causal inference can estimate causal effects, but unless data are collected experimentally, statistical analyses must rely on pre-specified causal models. Causal discovery algorithms are empirical methods for constructing such causal models from data. Several asymptotically correct discovery methods already exist, but they generally struggle on smaller samples. Moreover, most methods focus on very sparse causal models, which may not always be a realistic representation of real-life data generating mechanisms. Finally, while causal relationships suggested by the methods often hold true, their claims about causal non-relatedness have high error rates. This non-conservative error trade off is not ideal for observational sciences, where the resulting model is directly used to inform causal inference: A causal model with many missing causal relations entails too strong assumptions and may lead to biased effect estimates. We propose a new causal discovery method that addresses these three shortcomings: Supervised learning discovery (SLdisco). SLdisco uses supervised machine learning to obtain a mapping from observational data to equivalence classes of causal models. We evaluate SLdisco in a large simulation study based on Gaussian data and we consider several choices of model size and sample size. We find that SLdisco is more conservative, only moderately less informative and less sensitive towards sample size than existing procedures. We furthermore provide a real epidemiological data application. We use random subsampling to investigate real data performance on small samples and again find that SLdisco is less sensitive towards sample size and hence seems to better utilize the information available in small datasets.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>causal learning</kwd>
<kwd>causality</kwd>
<kwd>deep learning</kwd>
<kwd>neural network</kwd>
<kwd>observational data</kwd>
<kwd>structure learning</kwd>
</kwd-group>
<funding-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100000002">National Institutes of Health</funding-source><award-id>AWD000044521364471</award-id></award-group><funding-statement>This work was funded by the Independent Research Fund Denmark (grant 8020-00031B) and the National Institutes of Health (grant AWD000044521364471). </funding-statement></funding-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1088_reflist_001">
<title>References</title>
<ref id="j_jds1088_ref_001">
<mixed-citation publication-type="book"> <string-name><surname>Allaire</surname> <given-names>J</given-names></string-name>, <string-name><surname>Chollet</surname> <given-names>F</given-names></string-name> (<year>2021</year>). <source><italic>keras: R Interface to ‘Keras’</italic></source>. <comment>R package version 2.4.0</comment>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Chickering</surname> <given-names>DM</given-names></string-name> (<year>2002</year>). <article-title>Optimal structure identification with greedy search</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>3</volume>: <fpage>507</fpage>–<lpage>554</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Colombo</surname> <given-names>D</given-names></string-name>, <string-name><surname>Maathuis</surname> <given-names>MH</given-names></string-name>, <etal>et al.</etal> (<year>2014</year>). <article-title>Order-independent constraint-based causal structure learning</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>15</volume>(<issue>1</issue>): <fpage>3741</fpage>–<lpage>3782</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>De Stavola</surname> <given-names>BL</given-names></string-name>, <string-name><surname>Nitsch</surname> <given-names>D</given-names></string-name>, <string-name><surname>dos Santos Silva</surname> <given-names>I</given-names></string-name>, <string-name><surname>McCormack</surname> <given-names>V</given-names></string-name>, <string-name><surname>Hardy</surname> <given-names>R</given-names></string-name>, <string-name><surname>Mann</surname> <given-names>V</given-names></string-name>, <etal>et al.</etal> (<year>2006</year>). <article-title>Statistical issues in life course epidemiology</article-title>. <source><italic>American Journal of Epidemiology</italic></source>, <volume>163</volume>(<issue>1</issue>): <fpage>84</fpage>–<lpage>96</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1093/aje/kwj003" xlink:type="simple"> https://doi.org/10.1093/aje/kwj003</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_005">
<mixed-citation publication-type="chapter"> <string-name><surname>Goudet</surname> <given-names>O</given-names></string-name>, <string-name><surname>Kalainathan</surname> <given-names>D</given-names></string-name>, <string-name><surname>Caillou</surname> <given-names>P</given-names></string-name>, <string-name><surname>Guyon</surname> <given-names>I</given-names></string-name>, <string-name><surname>Lopez-Paz</surname> <given-names>D</given-names></string-name>, <string-name><surname>Sebag</surname> <given-names>M</given-names></string-name> (<year>2018</year>). <chapter-title>Learning functional causal models with generative neural networks</chapter-title>. In: <source><italic>Explainable and Interpretable Models in Computer Vision and Machine Learning</italic></source> (<string-name><given-names>HJ</given-names> <surname>Escalante</surname></string-name>, <string-name><given-names>S</given-names> <surname>Escalera</surname></string-name>, <string-name><given-names>I</given-names> <surname>Guyon</surname></string-name>, <string-name><given-names>X</given-names> <surname>Baró</surname></string-name>, <string-name><given-names>Y</given-names> <surname>Güçlütürk</surname></string-name>, <string-name><given-names>U</given-names> <surname>Güçlü</surname></string-name>, <string-name><given-names>M</given-names> <surname>van Gerven</surname></string-name>, <string-name><given-names>R</given-names> <surname>van Lier</surname></string-name>, eds.), <fpage>39</fpage>–<lpage>80</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Greenland</surname> <given-names>S</given-names></string-name>, <string-name><surname>Pearl</surname> <given-names>J</given-names></string-name>, <string-name><surname>Robins</surname> <given-names>JM</given-names></string-name> (<year>1999</year>). <article-title>Causal diagrams for epidemiologic research</article-title>. <source><italic>Epidemiology</italic></source>, <volume>10</volume>(<issue>1</issue>): <fpage>37</fpage>–<lpage>48</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1097/00001648-199901000-00008" xlink:type="simple"> https://doi.org/10.1097/00001648-199901000-00008</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Hornik</surname> <given-names>K</given-names></string-name>, <string-name><surname>Stinchcombe</surname> <given-names>M</given-names></string-name>, <string-name><surname>White</surname> <given-names>H</given-names></string-name> (<year>1989</year>). <article-title>Multilayer feedforward networks are universal approximators</article-title>. <source><italic>Neural Networks</italic></source>, <volume>2</volume>(<issue>5</issue>): <fpage>359</fpage>–<lpage>366</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1016/0893-6080(89)90020-8" xlink:type="simple"> https://doi.org/10.1016/0893-6080(89)90020-8</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_008">
<mixed-citation publication-type="other"> <string-name><surname>Howard</surname> <given-names>AG</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>M</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>B</given-names></string-name>, <string-name><surname>Kalenichenko</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Weyand</surname> <given-names>T</given-names></string-name>, et al. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint: <ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1704.04861">https://arxiv.org/abs/1704.04861</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Kalisch</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bühlman</surname> <given-names>P</given-names></string-name> (<year>2007</year>). <article-title>Estimating high-dimensional directed acyclic graphs with the pc-algorithm</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>8</volume>(<issue>3</issue>): <fpage>613</fpage>–<lpage>636</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Kalisch</surname> <given-names>M</given-names></string-name>, <string-name><surname>Mächler</surname> <given-names>M</given-names></string-name>, <string-name><surname>Colombo</surname> <given-names>D</given-names></string-name>, <string-name><surname>Maathuis</surname> <given-names>MH</given-names></string-name>, <string-name><surname>Bühlmann</surname> <given-names>P</given-names></string-name> (<year>2012</year>). <article-title>Causal inference using graphical models with the r package pcalg</article-title>. <source><italic>Journal of Statistical Software</italic></source>, <volume>47</volume>(<issue>11</issue>): <fpage>1</fpage>–<lpage>26</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.18637/jss.v047.i11" xlink:type="simple"> https://doi.org/10.18637/jss.v047.i11</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_011">
<mixed-citation publication-type="other"> <string-name><surname>Ke</surname> <given-names>NR</given-names></string-name>, <string-name><surname>Chiappa</surname> <given-names>S</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bornschein</surname> <given-names>J</given-names></string-name>, <string-name><surname>Weber</surname> <given-names>T</given-names></string-name>, <string-name><surname>Goyal</surname> <given-names>A</given-names></string-name>, et al. (2022). Learning to induce causal structure. arXiv preprint: <uri>https://arxiv.org/abs/2204.04875</uri>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_012">
<mixed-citation publication-type="other"> <string-name><surname>Li</surname> <given-names>H</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Tian</surname> <given-names>J</given-names></string-name> (2020). Supervised whole dag causal discovery. arXiv preprint: <ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2006.04697">https://arxiv.org/abs/2006.04697</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_013">
<mixed-citation publication-type="chapter"> <string-name><surname>Meek</surname> <given-names>C</given-names></string-name> (<year>1995</year>). <chapter-title>Causal inference and causal explanation with background knowledge</chapter-title>. In: <source><italic>Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, UAI’95</italic></source>, <fpage>403</fpage>–<lpage>410</lpage>. <publisher-name>Morgan Kaufmann Publishers Inc.</publisher-name>, <publisher-loc>San Francisco, CA, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_014">
<mixed-citation publication-type="chapter"> <string-name><surname>Ogarrio</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Spirtes</surname> <given-names>P</given-names></string-name>, <string-name><surname>Ramsey</surname> <given-names>J</given-names></string-name> (<year>2016</year>). <chapter-title>A hybrid causal search algorithm for latent variable models</chapter-title>. In: <source><italic>Conference on Probabilistic Graphical Models</italic></source>, <fpage>368</fpage>–<lpage>379</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Osler</surname> <given-names>M</given-names></string-name>, <string-name><surname>Andersen</surname> <given-names>AMN</given-names></string-name>, <string-name><surname>Lund</surname> <given-names>R</given-names></string-name>, <string-name><surname>Batty</surname> <given-names>GD</given-names></string-name>, <string-name><surname>Hougaard</surname> <given-names>CØ</given-names></string-name>, <string-name><surname>Damsgaard</surname> <given-names>MT</given-names></string-name>, <etal>et al.</etal> (<year>2004</year>). <article-title>Revitalising the metropolit 1953 danish male birth cohort: Background, aims and design</article-title>. <source><italic>Paediatric and Perinatal Epidemiology</italic></source>, <volume>18</volume>(<issue>5</issue>): <fpage>385</fpage>–<lpage>394</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1111/j.1365-3016.2004.00584.x" xlink:type="simple"> https://doi.org/10.1111/j.1365-3016.2004.00584.x</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Osler</surname> <given-names>M</given-names></string-name>, <string-name><surname>Lund</surname> <given-names>R</given-names></string-name>, <string-name><surname>Kriegbaum</surname> <given-names>M</given-names></string-name>, <string-name><surname>Christensen</surname> <given-names>U</given-names></string-name>, <string-name><surname>Andersen</surname> <given-names>AMN</given-names></string-name> (<year>2006</year>). <article-title>Cohort profile: The metropolit 1953 Danish male birth cohort</article-title>. <source><italic>International Journal of Epidemiology</italic></source>, <volume>35</volume>(<issue>3</issue>): <fpage>541</fpage>–<lpage>545</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1093/ije/dyi300" xlink:type="simple"> https://doi.org/10.1093/ije/dyi300</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_017">
<mixed-citation publication-type="book"> <string-name><surname>Pearl</surname> <given-names>J</given-names></string-name> (<year>2009</year>). <source><italic>Causality</italic></source>. <publisher-name>Cambridge university press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_018">
<mixed-citation publication-type="chapter"> <string-name><surname>Perkovic</surname> <given-names>E</given-names></string-name> (<year>2020</year>). <chapter-title>Identifying causal effects in maximally oriented partially directed acyclic graphs</chapter-title>. In: <source><italic>Conference on Uncertainty in Artificial Intelligence</italic></source>, <fpage>530</fpage>–<lpage>539</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_019">
<mixed-citation publication-type="book"> <string-name><surname>Peters</surname> <given-names>J</given-names></string-name>, <string-name><surname>Janzing</surname> <given-names>D</given-names></string-name>, <string-name><surname>Schölkopf</surname> <given-names>B</given-names></string-name> (<year>2017</year>). <source><italic>Elements of Causal Inference: Foundations and Learning Algorithms</italic></source>. <publisher-name>The MIT Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_020">
<mixed-citation publication-type="book"> <string-name><surname>Petersen</surname> <given-names>AH</given-names></string-name> (<year>2022</year>). <source><italic>causalDisco: Tools for Causal Discovery on Observational Data</italic></source>. <comment>R package version 0.9.1</comment>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Petersen</surname> <given-names>AH</given-names></string-name>, <string-name><surname>Osler</surname> <given-names>M</given-names></string-name>, <string-name><surname>Ekstrøm</surname> <given-names>CT</given-names></string-name> (<year>2021</year>). <article-title>Data-driven model building for life-course epidemiology</article-title>. <source><italic>American Journal of Epidemiology</italic></source>, <volume>190</volume>(<issue>9</issue>): <fpage>1898</fpage>–<lpage>1907</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1093/aje/kwab087" xlink:type="simple"> https://doi.org/10.1093/aje/kwab087</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Ramsey</surname> <given-names>J</given-names></string-name>, <string-name><surname>Glymour</surname> <given-names>M</given-names></string-name>, <string-name><surname>Sanchez-Romero</surname> <given-names>R</given-names></string-name>, <string-name><surname>Glymour</surname> <given-names>C</given-names></string-name> (<year>2017</year>). <article-title>A million variables and more: The fast greedy equivalence search algorithm for learning high-dimensional graphical causal models, with an application to functional magnetic resonance images</article-title>. <source><italic>International Journal of Data Science and Analytics</italic></source>, <volume>3</volume>(<issue>2</issue>): <fpage>121</fpage>–<lpage>129</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1007/s41060-016-0032-z" xlink:type="simple"> https://doi.org/10.1007/s41060-016-0032-z</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_023">
<mixed-citation publication-type="chapter"> <string-name><surname>Ramsey</surname> <given-names>JD</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>K</given-names></string-name>, <string-name><surname>Glymour</surname> <given-names>M</given-names></string-name>, <string-name><surname>Romero</surname> <given-names>RS</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Ebert-Uphoff</surname> <given-names>I</given-names></string-name>, <etal>et al.</etal> (<year>2018</year>). <chapter-title>Tetrad—a toolbox for causal discovery</chapter-title>. In: <source><italic>8th International Workshop on Climate Informatics</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Richardson</surname> <given-names>T</given-names></string-name>, <string-name><surname>Spirtes</surname> <given-names>P</given-names></string-name> (<year>2002</year>). <article-title>Ancestral graph Markov models</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>30</volume>(<issue>4</issue>): <fpage>962</fpage>–<lpage>1030</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1214/aos/1031689015" xlink:type="simple"> https://doi.org/10.1214/aos/1031689015</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Scarselli</surname> <given-names>F</given-names></string-name>, <string-name><surname>Gori</surname> <given-names>M</given-names></string-name>, <string-name><surname>Tsoi</surname> <given-names>AC</given-names></string-name>, <string-name><surname>Hagenbuchner</surname> <given-names>M</given-names></string-name>, <string-name><surname>Monfardini</surname> <given-names>G</given-names></string-name> (<year>2008</year>). <article-title>The graph neural network model</article-title>. <source><italic>IEEE Transactions on Neural Networks</italic></source>, <volume>20</volume>(<issue>1</issue>): <fpage>61</fpage>–<lpage>80</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1109/TNN.2008.2005605" xlink:type="simple"> https://doi.org/10.1109/TNN.2008.2005605</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>Shah</surname> <given-names>RD</given-names></string-name>, <string-name><surname>Peters</surname> <given-names>J</given-names></string-name> (<year>2020</year>). <article-title>The hardness of conditional independence testing and the generalised covariance measure</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>48</volume>(<issue>3</issue>): <fpage>1514</fpage>–<lpage>1538</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1214/19-AOS1857" xlink:type="simple"> https://doi.org/10.1214/19-AOS1857</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Spirtes</surname> <given-names>P</given-names></string-name>, <string-name><surname>Glymour</surname> <given-names>C</given-names></string-name> (<year>1991</year>). <article-title>An algorithm for fast recovery of sparse causal graphs</article-title>. <source><italic>Social Science Computer Review</italic></source>, <volume>9</volume>(<issue>1</issue>): <fpage>62</fpage>–<lpage>72</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1177/089443939100900106" xlink:type="simple"> https://doi.org/10.1177/089443939100900106</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_028">
<mixed-citation publication-type="book"> <string-name><surname>Spirtes</surname> <given-names>P</given-names></string-name>, <string-name><surname>Glymour</surname> <given-names>CN</given-names></string-name>, <string-name><surname>Scheines</surname> <given-names>R</given-names></string-name>, <string-name><surname>Heckerman</surname> <given-names>D</given-names></string-name> (<year>2000</year>). <source><italic>Causation, Prediction, and Search</italic></source>. <publisher-name>MIT press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Von Stumm</surname> <given-names>S</given-names></string-name>, <string-name><surname>Plomin</surname> <given-names>R</given-names></string-name> (<year>2015</year>). <article-title>Socioeconomic status and the growth of intelligence from infancy through adolescence</article-title>. <source><italic>Intelligence</italic></source>, <volume>48</volume>: <fpage>30</fpage>–<lpage>36</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1016/j.intell.2014.10.002" xlink:type="simple"> https://doi.org/10.1016/j.intell.2014.10.002</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Xu</surname> <given-names>C</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>W</given-names></string-name> (<year>2021</year>). <article-title>Causal structure learning with one-dimensional convolutional neural networks</article-title>. <source><italic>IEEE Access</italic></source>, <volume>9</volume>: <fpage>162147</fpage>–<lpage>162155</lpage>. <ext-link ext-link-type="doi" xlink:href=" https://doi.org/10.1109/ACCESS.2021.3133496" xlink:type="simple"> https://doi.org/10.1109/ACCESS.2021.3133496</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_031">
<mixed-citation publication-type="chapter"> <string-name><surname>Yu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Gao</surname> <given-names>T</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>M</given-names></string-name> (<year>2019</year>). <chapter-title>Dag-gnn: Dag structure learning with graph neural networks</chapter-title>. In: <source><italic>International Conference on Machine Learning</italic></source>, <fpage>7154</fpage>–<lpage>7163</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_032">
<mixed-citation publication-type="other"> <string-name><surname>Zheng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Aragam</surname> <given-names>B</given-names></string-name>, <string-name><surname>Ravikumar</surname> <given-names>P</given-names></string-name>, <string-name><surname>Xing</surname> <given-names>EP</given-names></string-name> (2018). Dags with no tears: Continuous optimization for structure learning. arXiv preprint: <uri>https://arxiv.org/abs/1803.01422</uri>.</mixed-citation>
</ref>
<ref id="j_jds1088_ref_033">
<mixed-citation publication-type="chapter"> <string-name><surname>Zheng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Dan</surname> <given-names>C</given-names></string-name>, <string-name><surname>Aragam</surname> <given-names>B</given-names></string-name>, <string-name><surname>Ravikumar</surname> <given-names>P</given-names></string-name>, <string-name><surname>Xing</surname> <given-names>E</given-names></string-name> (<year>2020</year>). <chapter-title>Learning sparse nonparametric dags</chapter-title>. In: <source><italic>International Conference on Artificial Intelligence and Statistics</italic></source>, <fpage>3414</fpage>–<lpage>3425</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
