<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1070</article-id>
<article-id pub-id-type="doi">10.6339/22-JDS1070</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Data Science in Action</subject></subj-group></article-categories>
<title-group>
<article-title>On the Use of Deep Neural Networks for Large-Scale Spatial Prediction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Gray</surname><given-names>Skyler D.</given-names></name><xref ref-type="aff" rid="j_jds1070_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-4654-9827</contrib-id>
<name><surname>Heaton</surname><given-names>Matthew J.</given-names></name><email xlink:href="mailto:mheaton@stat.byu.edu">mheaton@stat.byu.edu</email><xref ref-type="aff" rid="j_jds1070_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Bolintineanu</surname><given-names>Dan S.</given-names></name><xref ref-type="aff" rid="j_jds1070_aff_002">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Olson</surname><given-names>Aaron</given-names></name><xref ref-type="aff" rid="j_jds1070_aff_003">3</xref>
</contrib>
<aff id="j_jds1070_aff_001"><label>1</label>Department of Statistics, <institution>Brigham Young University</institution>, 2152 WVB, Provo, UT 84602, <country>United States</country></aff>
<aff id="j_jds1070_aff_002"><label>2</label>Fluid and Reactive Processes Department, <institution>Sandia National Laboratories</institution>, Albuquerque, NM 87185, <country>USA</country></aff>
<aff id="j_jds1070_aff_003"><label>3</label>Radiation Effects Theory Department, <institution>Sandia National Laboratories</institution>, Albuquerque, NM 87185, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:mheaton@stat.byu.edu">mheaton@stat.byu.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2022</year></pub-date><pub-date pub-type="epub"><day>3</day><month>10</month><year>2022</year></pub-date><volume>20</volume><issue>4</issue><fpage>493</fpage><lpage>511</lpage><history><date date-type="received"><day>28</day><month>7</month><year>2022</year></date><date date-type="accepted"><day>27</day><month>9</month><year>2022</year></date></history>
<permissions><copyright-statement>2022 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>For spatial kriging (prediction), the Gaussian process (GP) has been the go-to tool of spatial statisticians for decades. However, the GP is plagued by computational intractability, rendering it infeasible for use on large spatial data sets. Neural networks (NNs), on the other hand, have arisen as a flexible and computationally feasible approach for capturing nonlinear relationships. To date, however, NNs have only been scarcely used for problems in spatial statistics but their use is beginning to take root. In this work, we argue for equivalence between a NN and a GP and demonstrate how to implement NNs for kriging from large spatial data. We compare the computational efficacy and predictive power of NNs with that of GP approximations across a variety of big spatial Gaussian, non-Gaussian and binary data applications of up to size <inline-formula id="j_jds1070_ineq_001"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$n={10^{6}}$]]></tex-math></alternatives></inline-formula>. Our results suggest that fully-connected NNs perform similarly to state-of-the-art, GP-approximated models for short-range predictions but can suffer for longer range predictions.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>big data</kwd>
<kwd>fully-connected neural network</kwd>
<kwd>grid search</kwd>
</kwd-group>
<funding-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100000104">NASA</funding-source><award-id>80NSSC20K1594</award-id></award-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100006234">Sandia</funding-source></award-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100006168">Department of Energy’s National Nuclear Security Administration</funding-source><award-id>DE-NA0003525</award-id></award-group><funding-statement>This research was supported by NASA grant 80NSSC20K1594 and by the Laboratory Directed Research and Development program at Sandia National Laboratories, a multimission laboratory managed and operated by National Technology and Engineering Solutions of Sandia LLC, a wholly owned subsidiary of Honeywell International Inc. for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525. </funding-statement></funding-group>
</article-meta>
</front>
<body/>
<back>
<ref-list id="j_jds1070_reflist_001">
<title>References</title>
<ref id="j_jds1070_ref_001">
<mixed-citation publication-type="other"> <string-name><surname>Allaire</surname> <given-names>J</given-names></string-name>, <string-name><surname>Chollet</surname> <given-names>F</given-names></string-name> (2022). <italic>keras: R Interface to ‘Keras’</italic>. R package version 2.9.0.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Banerjee</surname> <given-names>S</given-names></string-name>, <string-name><surname>Gelfand</surname> <given-names>AE</given-names></string-name>, <string-name><surname>Finley</surname> <given-names>AO</given-names></string-name>, <string-name><surname>Sang</surname> <given-names>H</given-names></string-name> (<year>2008</year>). <article-title>Gaussian predictive process models for large spatial data sets</article-title>. <source>Journal of the Royal Statistical Society, Series B, Statistical Methodology</source>, <volume>70</volume>(<issue>4</issue>): <fpage>825</fpage>–<lpage>848</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Chen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Reich</surname> <given-names>BJ</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>Y</given-names></string-name> (<year>2022</year>). <article-title>Deepkriging: Spatially dependent deep neural networks for spatial prediction</article-title>. <source>Statistica Sinica</source>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.5705/ss.202021.0277" xlink:type="simple">https://doi.org/10.5705/ss.202021.0277</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Cressie</surname> <given-names>N</given-names></string-name>, <string-name><surname>Johannesson</surname> <given-names>G</given-names></string-name> (<year>2008</year>a). <article-title>Fixed rank Kriging for very large spatial data sets</article-title>. <source>Journal of the Royal Statistical Society, Series B</source>, <volume>70</volume>: <fpage>209</fpage>–<lpage>226</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Cressie</surname> <given-names>N</given-names></string-name>, <string-name><surname>Johannesson</surname> <given-names>G</given-names></string-name> (<year>2008</year>b). <article-title>Fixed rank Kriging for very large spatial data sets</article-title>. <source>Journal of the Royal Statistical Society, Series B, Statistical Methodology</source>, <volume>70</volume>(<issue>1</issue>): <fpage>209</fpage>–<lpage>226</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_006">
<mixed-citation publication-type="book"> <string-name><surname>Cressie</surname> <given-names>N</given-names></string-name>, <string-name><surname>Wikle</surname> <given-names>CK</given-names></string-name> (<year>2015</year>). <source>Statistics for Spatio-Temporal Data</source>. <publisher-name>John Wiley &amp; Sons</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Datta</surname> <given-names>A</given-names></string-name>, <string-name><surname>Banerjee</surname> <given-names>S</given-names></string-name>, <string-name><surname>Finley</surname> <given-names>AO</given-names></string-name>, <string-name><surname>Gelfand</surname> <given-names>AE</given-names></string-name> (<year>2016</year>a). <article-title>Hierarchical nearest-neighbor Gaussian process models for large geostatistical datasets</article-title>. <source>Journal of the American Statistical Association</source>, <volume>111</volume>(<issue>514</issue>): <fpage>800</fpage>–<lpage>812</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Datta</surname> <given-names>A</given-names></string-name>, <string-name><surname>Banerjee</surname> <given-names>S</given-names></string-name>, <string-name><surname>Finley</surname> <given-names>AO</given-names></string-name>, <string-name><surname>Gelfand</surname> <given-names>AE</given-names></string-name> (<year>2016</year>b). <article-title>On nearest-neighbor Gaussian process models for massive spatial data</article-title>. <source>Wiley Interdisciplinary Reviews: Computational Statistics</source>, <volume>8</volume>(<issue>5</issue>): <fpage>162</fpage>–<lpage>171</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_009">
<mixed-citation publication-type="chapter"> <string-name><surname>Diggle</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Ribeiro</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Christensen</surname> <given-names>OF</given-names></string-name> (<year>2003</year>). <chapter-title>An introduction to model-based geostatistics</chapter-title>. In: <source>Spatial Statistics and Computational Methods</source>, <fpage>43</fpage>–<lpage>86</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Diggle</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Tawn</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Moyeed</surname> <given-names>RA</given-names></string-name> (<year>1998</year>). <article-title>Model-based geostatistics</article-title>. <source>Journal of the Royal Statistical Society. Series C. Applied Statistics</source>, <volume>47</volume>(<issue>3</issue>): <fpage>299</fpage>–<lpage>350</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>El Bannany</surname> <given-names>M</given-names></string-name>, <string-name><surname>Khedr</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Sreedharan</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kanakkayil</surname> <given-names>S</given-names></string-name> (<year>2021</year>). <article-title>Financial distress prediction based on multi-layer perceptron with parameter optimization</article-title>. <source>IAENG International Journal of Computer Science</source>, <volume>48</volume>: <fpage>3</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Furrer</surname> <given-names>R</given-names></string-name>, <string-name><surname>Genton</surname> <given-names>MG</given-names></string-name>, <string-name><surname>Nychka</surname> <given-names>D</given-names></string-name> (<year>2006</year>). <article-title>Covariance tapering for interpolation of large spatial datasets</article-title>. <source>Journal of Computational and Graphical Statistics</source>, <volume>15</volume>(<issue>3</issue>): <fpage>502</fpage>–<lpage>523</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Gelfand</surname> <given-names>AE</given-names></string-name>, <string-name><surname>Schliep</surname> <given-names>EM</given-names></string-name> (<year>2016</year>). <article-title>Spatial statistics and Gaussian processes: A beautiful marriage</article-title>. <source>Spatial Statistics</source>, <volume>18</volume>: <fpage>86</fpage>–<lpage>104</lpage>. <comment>Spatial Statistics Avignon: Emerging Patterns.</comment></mixed-citation>
</ref>
<ref id="j_jds1070_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Genton</surname> <given-names>MG</given-names></string-name>, <string-name><surname>Kleiber</surname> <given-names>W</given-names></string-name> (<year>2015</year>). <article-title>Cross-covariance functions for multivariate geostatistics</article-title>. <source>Statistical Science</source>, <volume>30</volume>(<issue>2</issue>): <fpage>147</fpage>–<lpage>163</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Gerber</surname> <given-names>F</given-names></string-name>, <string-name><surname>Nychka</surname> <given-names>D</given-names></string-name> (<year>2021</year>). <article-title>Fast covariance parameter estimation of spatial Gaussian process models using neural networks</article-title>. <source>Stat</source>, <volume>10</volume>(<issue>1</issue>): <fpage>e382</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Heaton</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Datta</surname> <given-names>A</given-names></string-name>, <string-name><surname>Finley</surname> <given-names>AO</given-names></string-name>, <string-name><surname>Furrer</surname> <given-names>R</given-names></string-name>, <string-name><surname>Guinness</surname> <given-names>J</given-names></string-name>, <string-name><surname>Guhaniyogi</surname> <given-names>R</given-names></string-name>, <etal>et al.</etal> (<year>2019</year>). <article-title>A case study competition among methods for analyzing large spatial data</article-title>. <source>Journal of Agricultural, Biological, and Environmental Statistics</source>, <volume>24</volume>(<issue>3</issue>): <fpage>398</fpage>–<lpage>425</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Higdon</surname> <given-names>D</given-names></string-name> (<year>1998</year>). <article-title>A process-convolution approach to modelling temperatures in the North Atlantic Ocean</article-title>. <source>Environmental and Ecological Statistics</source>, <volume>5</volume>(<issue>2</issue>): <fpage>173</fpage>–<lpage>190</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Huang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Abdulah</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ltaief</surname> <given-names>H</given-names></string-name>, <string-name><surname>Keyes</surname> <given-names>DE</given-names></string-name>, <string-name><surname>Genton</surname> <given-names>MG</given-names></string-name> (<year>2021</year>a). <article-title>Competition on spatial statistics for large datasets</article-title>. <source>Journal of Agricultural, Biological, and Environmental Statistics</source>, <volume>26</volume>(<issue>4</issue>): <fpage>580</fpage>–<lpage>595</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_019">
<mixed-citation publication-type="other"> <string-name><surname>Huang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Blake</surname> <given-names>LR</given-names></string-name>, <string-name><surname>Katzfuss</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hammerling</surname> <given-names>DM</given-names></string-name> (2021b). Nonstationary spatial modeling of massive global satellite data. arXiv preprint: <uri>https://arxiv.org/abs/2111.13428</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Hughes</surname> <given-names>J</given-names></string-name>, <string-name><surname>Haran</surname> <given-names>M</given-names></string-name> (<year>2013</year>). <article-title>Dimension reduction and alleviation of confounding for spatial generalized linear mixed models</article-title>. <source>Journal of the Royal Statistical Society, Series B, Statistical Methodology</source>, <volume>75</volume>(<issue>1</issue>): <fpage>139</fpage>–<lpage>159</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Jabbar</surname> <given-names>H</given-names></string-name>, <string-name><surname>Khan</surname> <given-names>RZ</given-names></string-name> (<year>2015</year>). <article-title>Methods to avoid over-fitting and under-fitting in supervised machine learning (comparative study)</article-title>. <source>Computer Science, Communication and Instrumentation Devices</source>, <volume>70</volume>: <fpage>163</fpage>–<lpage>172</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Katzfuss</surname> <given-names>M</given-names></string-name> (<year>2017</year>). <article-title>A multi-resolution approximation for massive spatial datasets</article-title>. <source>Journal of the American Statistical Association</source>, <volume>112</volume>(<issue>517</issue>): <fpage>201</fpage>–<lpage>214</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Katzfuss</surname> <given-names>M</given-names></string-name>, <string-name><surname>Guinness</surname> <given-names>J</given-names></string-name> (<year>2021</year>). <article-title>A general framework for Vecchia approximations of Gaussian processes</article-title>. <source>Statistical Science</source>, <volume>36</volume>(<issue>1</issue>): <fpage>124</fpage>–<lpage>141</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Kaufman</surname> <given-names>CG</given-names></string-name>, <string-name><surname>Schervish</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Nychka</surname> <given-names>DW</given-names></string-name> (<year>2008</year>). <article-title>Covariance tapering for likelihood-based estimation in large spatial data sets</article-title>. <source>Journal of the American Statistical Association</source>, <volume>103</volume>(<issue>484</issue>): <fpage>1545</fpage>–<lpage>1555</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_025">
<mixed-citation publication-type="chapter"> <string-name><surname>Lee</surname> <given-names>J</given-names></string-name>, <string-name><surname>Sohl-dickstein</surname> <given-names>J</given-names></string-name>, <string-name><surname>Pennington</surname> <given-names>J</given-names></string-name>, <string-name><surname>Novak</surname> <given-names>R</given-names></string-name>, <string-name><surname>Schoenholz</surname> <given-names>S</given-names></string-name>, <string-name><surname>Bahri</surname> <given-names>Y</given-names></string-name> (<year>2018</year>). <chapter-title>Deep neural networks as Gaussian processes</chapter-title>. In: <source>International Conference on Learning Representations</source>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_026">
<mixed-citation publication-type="other"> <string-name><surname>Lenzi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bessac</surname> <given-names>J</given-names></string-name>, <string-name><surname>Rudi</surname> <given-names>J</given-names></string-name>, <string-name><surname>Stein</surname> <given-names>ML</given-names></string-name> (2021). Neural networks for parameter estimation in intractable models. arXiv preprint: <uri>https://arxiv.org/abs/2107.14346</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Liu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ong</surname> <given-names>YS</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>X</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>J</given-names></string-name> (<year>2020</year>). <article-title>When Gaussian process meets big data: A review of scalable gps</article-title>. <source>IEEE Transactions on Neural Networks and Learning Systems</source>, <volume>31</volume>(<issue>11</issue>): <fpage>4405</fpage>–<lpage>4423</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_028">
<mixed-citation publication-type="other"> <string-name><surname>Matthews</surname> <given-names>A</given-names></string-name>, <string-name><surname>Rowland</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hron</surname> <given-names>J</given-names></string-name>, <string-name><surname>Turner</surname> <given-names>RE</given-names></string-name>, <string-name><surname>Ghahramani</surname> <given-names>Z</given-names></string-name> (2018). Gaussian process behaviour in wide deep neural networks. arXiv preprint: <uri>https://arxiv.org/abs/1804.11271</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Mesa</surname> <given-names>J</given-names></string-name>, <string-name><surname>Vasquez</surname> <given-names>DB</given-names></string-name>, <string-name><surname>Aguirre</surname> <given-names>JV</given-names></string-name>, <string-name><surname>Valencia</surname> <given-names>JSB</given-names></string-name> (<year>2019</year>). <article-title>Sensor fusion for distance estimation under disturbance with reflective optical sensors using multi layer perceptron (mlp)</article-title>. <source>IEEE Latin America Transactions</source>, <volume>17</volume>(<issue>09</issue>): <fpage>1418</fpage>–<lpage>1423</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_030">
<mixed-citation publication-type="other"> <string-name><surname>Molnar</surname> <given-names>C</given-names></string-name>, <string-name><surname>Freiesleben</surname> <given-names>T</given-names></string-name>, <string-name><surname>König</surname> <given-names>G</given-names></string-name>, <string-name><surname>Casalicchio</surname> <given-names>G</given-names></string-name>, <string-name><surname>Wright</surname> <given-names>MN</given-names></string-name>, <string-name><surname>Bischl</surname> <given-names>B</given-names></string-name> (2021). Relating the partial dependence plot and permutation feature importance to the data generating process. arXiv preprint: <uri>https://arxiv.org/abs/2109.01433</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_031">
<mixed-citation publication-type="other"> <string-name><surname>Neal</surname> <given-names>RM</given-names></string-name> (1994). Priors for infinite networks (tech. rep. no. crg-tr-94-1). <italic>University of Toronto</italic>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_032">
<mixed-citation publication-type="journal"> <string-name><surname>Nuanmeesri</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sriurai</surname> <given-names>W</given-names></string-name> (<year>2021</year>). <article-title>Multi-layer perceptron neural network model development for chili pepper disease diagnosis using filter and wrapper feature selection methods</article-title>. <source>Engineering, Technology &amp; Applied Science Research</source>, <volume>11</volume>(<issue>5</issue>): <fpage>7714</fpage>–<lpage>7719</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_033">
<mixed-citation publication-type="other"> <string-name><surname>Nwankpa</surname> <given-names>C</given-names></string-name>, <string-name><surname>Ijomah</surname> <given-names>W</given-names></string-name>, <string-name><surname>Gachagan</surname> <given-names>A</given-names></string-name>, <string-name><surname>Marshall</surname> <given-names>S</given-names></string-name> (2018). Activation functions: Comparison of trends in practice and research for deep learning. arXiv preprint: <uri>https://arxiv.org/abs/1811.03378</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Nychka</surname> <given-names>D</given-names></string-name>, <string-name><surname>Bandyopadhyay</surname> <given-names>S</given-names></string-name>, <string-name><surname>Hammerling</surname> <given-names>D</given-names></string-name>, <string-name><surname>Lindgren</surname> <given-names>F</given-names></string-name>, <string-name><surname>Sain</surname> <given-names>S</given-names></string-name> (<year>2015</year>). <article-title>A multiresolution Gaussian process model for the analysis of large spatial datasets</article-title>. <source>Journal of Computational and Graphical Statistics</source>, <volume>24</volume>(<issue>2</issue>): <fpage>579</fpage>–<lpage>599</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_035">
<mixed-citation publication-type="book"> <collab>R Core Team</collab> (<year>2021</year>). <source>R: A Language and Environment for Statistical Computing</source>. <publisher-name>R Foundation for Statistical Computing</publisher-name>, <publisher-loc>Vienna, Austria</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_036">
<mixed-citation publication-type="other"> <string-name><surname>Ramachandran</surname> <given-names>P</given-names></string-name>, <string-name><surname>Zoph</surname> <given-names>B</given-names></string-name>, <string-name><surname>Le</surname> <given-names>QV</given-names></string-name> (2017). Searching for activation functions. arXiv preprint: <uri>https://arxiv.org/abs/1710.05941</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_037">
<mixed-citation publication-type="journal"> <string-name><surname>Sang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>JZ</given-names></string-name> (<year>2012</year>). <article-title>A full scale approximation of covariance functions for large spatial data sets</article-title>. <source>Journal of the Royal Statistical Society, Series B, Statistical Methodology</source>, <volume>74</volume>(<issue>1</issue>): <fpage>111</fpage>–<lpage>132</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_038">
<mixed-citation publication-type="other"> <string-name><surname>Sauer</surname> <given-names>A</given-names></string-name>, <string-name><surname>Cooper</surname> <given-names>A</given-names></string-name>, <string-name><surname>Gramacy</surname> <given-names>RB</given-names></string-name> (2022). Vecchia-approximated deep Gaussian processes for computer experiments. arXiv preprint: <uri>https://arxiv.org/abs/2204.02904</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_039">
<mixed-citation publication-type="journal"> <string-name><surname>Sauer</surname> <given-names>A</given-names></string-name>, <string-name><surname>Gramacy</surname> <given-names>RB</given-names></string-name>, <string-name><surname>Higdon</surname> <given-names>D</given-names></string-name> (<year>2022</year>). <article-title>Active learning for deep Gaussian process surrogates</article-title>. <source>Technometrics</source>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/00401706.2021.2008505" xlink:type="simple">https://doi.org/10.1080/00401706.2021.2008505</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_040">
<mixed-citation publication-type="journal"> <string-name><surname>Victoria</surname> <given-names>AH</given-names></string-name>, <string-name><surname>Maragatham</surname> <given-names>G</given-names></string-name> (<year>2021</year>). <article-title>Automatic tuning of hyperparameters using Bayesian optimization</article-title>. <source>Evolving Systems</source>, <volume>12</volume>(<issue>1</issue>): <fpage>217</fpage>–<lpage>223</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_041">
<mixed-citation publication-type="other"> <string-name><surname>Wikle</surname> <given-names>CK</given-names></string-name>, <string-name><surname>Zammit-Mangion</surname> <given-names>A</given-names></string-name> (2022). Statistical deep learning for spatial and spatio-temporal data. arXiv preprint: <uri>https://arxiv.org/abs/2206.02218</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_042">
<mixed-citation publication-type="other"> <string-name><surname>Xu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>M</given-names></string-name>, <string-name><surname>Li</surname> <given-names>J</given-names></string-name>, <string-name><surname>SS Kawarabayashi Ki</surname> <given-names>D</given-names></string-name>, <string-name><surname>Jegelka</surname> <given-names>S</given-names></string-name> (2020). How neural networks extrapolate: From feedforward to graph neural networks. arXiv preprint: <uri>https://arxiv.org/abs/2009.11848</uri>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_043">
<mixed-citation publication-type="chapter"> <string-name><surname>Yarotsky</surname> <given-names>D</given-names></string-name> (<year>2018</year>). <chapter-title>Optimal approximation of continuous functions by very deep relu networks</chapter-title>. In: <source>Conference on Learning Theory</source> (<string-name><given-names>S</given-names> <surname>Bubeck</surname></string-name>, <string-name><given-names>V</given-names> <surname>Perchet</surname></string-name>, <string-name><given-names>P</given-names> <surname>Rigollet</surname></string-name>, eds.), <fpage>639</fpage>–<lpage>649</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_044">
<mixed-citation publication-type="journal"> <string-name><surname>Zammit-Mangion</surname> <given-names>A</given-names></string-name>, <string-name><surname>Ng</surname> <given-names>TLJ</given-names></string-name>, <string-name><surname>Vu</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Filippone</surname> <given-names>M</given-names></string-name> (<year>2021</year>). <article-title>Deep compositional spatial models</article-title>. <source>Journal of the American Statistical Association</source>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2021.1887741" xlink:type="simple">https://doi.org/10.1080/01621459.2021.1887741</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_045">
<mixed-citation publication-type="journal"> <string-name><surname>Zammit-Mangion</surname> <given-names>A</given-names></string-name>, <string-name><surname>Wikle</surname> <given-names>CK</given-names></string-name> (<year>2020</year>). <article-title>Deep integro-difference equation models for spatio-temporal forecasting</article-title>. <source>Spatial Statistics</source>, <volume>37</volume>: <fpage>100408</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1070_ref_046">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>P</given-names></string-name>, <string-name><surname>Jia</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Gao</surname> <given-names>J</given-names></string-name>, <string-name><surname>Song</surname> <given-names>W</given-names></string-name>, <string-name><surname>Leung</surname> <given-names>H</given-names></string-name> (<year>2018</year>). <article-title>Short-term rainfall forecasting using multi-layer perceptron</article-title>. <source>IEEE Transactions on Big Data</source>, <volume>6</volume>(<issue>1</issue>): <fpage>93</fpage>–<lpage>106</lpage>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
