<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1127</article-id>
<article-id pub-id-type="doi">10.6339/24-JDS1127</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Interaction Selection and Prediction Performance in High-Dimensional Data: A Comparative Study of Statistical and Tree-Based Methods</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Nzekwe</surname><given-names>Chinedu J.</given-names></name><email xlink:href="mailto:cjnzekwe@ncat.edu">cjnzekwe@ncat.edu</email><xref ref-type="aff" rid="j_jds1127_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Kim</surname><given-names>Seongtae</given-names></name><xref ref-type="aff" rid="j_jds1127_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-8113-702X</contrib-id>
<name><surname>Mostafa</surname><given-names>Sayed A.</given-names></name><xref ref-type="aff" rid="j_jds1127_aff_001">1</xref>
</contrib>
<aff id="j_jds1127_aff_001"><label>1</label>Department of Mathematics and Statistics, <institution>North Carolina Agricultural and Technical State University</institution>, Greensboro, NC, 27411, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:cjnzekwe@ncat.edu">cjnzekwe@ncat.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2024</year></pub-date><pub-date pub-type="epub"><day>22</day><month>5</month><year>2024</year></pub-date><volume>22</volume><issue>2</issue><fpage>259</fpage><lpage>279</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1127_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>The supplementary material includes the following: (1) README: a brief explanation of the supplementary material; (2) application datasets; (3) code files; and (4) the description of the RIT algorithm and additional simulation results.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>1</day><month>8</month><year>2023</year></date><date date-type="accepted"><day>26</day><month>3</month><year>2024</year></date></history>
<permissions><copyright-statement>2024 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Predictive modeling often ignores interaction effects among predictors in high-dimensional data because of analytical and computational challenges. Research in interaction selection has been galvanized along with methodological and computational advances. In this study, we aim to investigate the performance of two types of predictive algorithms that can perform interaction selection. Specifically, we compare the predictive performance and interaction selection accuracy of both penalty-based and tree-based predictive algorithms. Penalty-based algorithms included in our comparative study are the regularization path algorithm under the marginality principle (RAMP), the least absolute shrinkage selector operator (LASSO), the smoothed clipped absolute deviance (SCAD), and the minimax concave penalty (MCP). The tree-based algorithms considered are random forest (RF) and iterative random forest (iRF). We evaluate the effectiveness of these algorithms under various regression and classification models with varying structures and dimensions. We assess predictive performance using the mean squared error for regression and accuracy, sensitivity, specificity, balanced accuracy, and F1 score for classification. We use interaction coverage to judge the algorithm’s efficacy for interaction selection. Our findings reveal that the effectiveness of the selected algorithms varies depending on the number of predictors (data dimension) and the structure of the data-generating model, i.e., linear or nonlinear, hierarchical or non-hierarchical. There were at least one or more scenarios that favored each of the algorithms included in this study. However, from the general pattern, we are able to recommend one or more specific algorithm(s) for some specific scenarios. Our analysis helps clarify each algorithm’s strengths and limitations, offering guidance to researchers and data analysts in choosing an appropriate algorithm for their predictive modeling task based on their data structure.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>interaction selection</kwd>
<kwd>iRF</kwd>
<kwd>LASSO</kwd>
<kwd>predictive modeling</kwd>
<kwd>RAMP</kwd>
<kwd>RF</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1127_reflist_001">
<title>References</title>
<ref id="j_jds1127_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Antoniou</surname> <given-names>A</given-names></string-name>, <string-name><surname>Pharoah</surname> <given-names>P</given-names></string-name>, <string-name><surname>Narod</surname> <given-names>S</given-names></string-name>, <string-name><surname>Risch</surname> <given-names>H</given-names></string-name>, <string-name><surname>Eyfjörd</surname> <given-names>J</given-names></string-name>, <string-name><surname>Hopper</surname> <given-names>J</given-names></string-name>, <etal>et al.</etal> (<year>2003</year>). <article-title>Average risks of breast and ovarian cancer associated with brca1 or brca2 mutations detected in case series unselected for family history: A combined analysis of 22 studies</article-title>. <source><italic>American Journal of Human Genetics</italic></source>, <volume>72</volume>: <fpage>1117</fpage>–<lpage>1130</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1086/375033" xlink:type="simple">https://doi.org/10.1086/375033</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_002">
<mixed-citation publication-type="other"> <string-name><surname>Basu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kumbier</surname> <given-names>K</given-names></string-name> (<year>2018</year>). <italic>iRF: Iterative Random Forests</italic>. R package version 3.0.0.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Basu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kumbier</surname> <given-names>K</given-names></string-name>, <string-name><surname>Brown</surname> <given-names>JB</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>B</given-names></string-name> (<year>2018</year>). <article-title>Iterative random forests to discover predictive and stable high-order interactions</article-title>. <source><italic>Proceedings of the National Academy of Sciences</italic></source>, <volume>115</volume>(<issue>8</issue>): <fpage>1943</fpage>–<lpage>1948</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1073/pnas.1711236115" xlink:type="simple">https://doi.org/10.1073/pnas.1711236115</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Bien</surname> <given-names>J</given-names></string-name>, <string-name><surname>Taylor</surname> <given-names>J</given-names></string-name>, <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name> (<year>2013</year>). <article-title>A lasso for hierarchical interactions</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>41</volume>(<issue>3</issue>): <fpage>1111</fpage>–<lpage>1141</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Breheny</surname> <given-names>P</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>J</given-names></string-name> (<year>2011</year>). <article-title>Coordinate descent algorithms for nonconvex penalized regression, with applications to biological feature selection</article-title>. <source><italic>Annals of Applied Statistics</italic></source>, <volume>5</volume>(<issue>1</issue>): <fpage>232</fpage>–<lpage>253</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Breheny</surname> <given-names>P</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>J</given-names></string-name> (<year>2015</year>). <article-title>Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors</article-title>. <source><italic>Statistics and Computing</italic></source>, <volume>25</volume>(<issue>2</issue>): <fpage>173</fpage>–<lpage>187</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s11222-013-9424-2" xlink:type="simple">https://doi.org/10.1007/s11222-013-9424-2</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Breiman</surname> <given-names>L</given-names></string-name> (<year>1996</year>). <article-title>Bagging predictors</article-title>. <source><italic>Machine Learning</italic></source>, <volume>24</volume>(<issue>2</issue>): <fpage>123</fpage>–<lpage>140</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Breiman</surname> <given-names>L</given-names></string-name> (<year>2001</year>). <article-title>Random forests</article-title>. <source><italic>Machine Learning</italic></source>, <volume>45</volume>(<issue>1</issue>): <fpage>5</fpage>–<lpage>32</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1023/A:1010933404324" xlink:type="simple">https://doi.org/10.1023/A:1010933404324</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Breiman</surname> <given-names>L</given-names></string-name>, <string-name><surname>Friedman</surname> <given-names>JH</given-names></string-name>, <string-name><surname>Olshen</surname> <given-names>RA</given-names></string-name>, <string-name><surname>Stone</surname> <given-names>CJ</given-names></string-name> (<year>1984</year>). <article-title>Classification and regression trees</article-title>. <source><italic>Biometrics</italic></source>, <volume>40</volume>: <fpage>874</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2530946" xlink:type="simple">https://doi.org/10.2307/2530946</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Chipman</surname> <given-names>H</given-names></string-name>, <string-name><surname>Hamada</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>CF</given-names></string-name> (<year>1997</year>). <article-title>A Bayesian variable-selection approach for analyzing designed experiments with complex aliasing</article-title>. <source><italic>Technometrics</italic></source>, <volume>39</volume>(<issue>4</issue>): <fpage>372</fpage>–<lpage>381</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/00401706.1997.10485156" xlink:type="simple">https://doi.org/10.1080/00401706.1997.10485156</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Choi</surname> <given-names>N</given-names></string-name>, <string-name><surname>Li</surname> <given-names>W</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>J</given-names></string-name> (<year>2010</year>). <article-title>Variable selection with the strong heredity constraint and its oracle property</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>105</volume>: <fpage>354</fpage>–<lpage>364</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/jasa.2010.tm08281" xlink:type="simple">https://doi.org/10.1198/jasa.2010.tm08281</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Cordell</surname> <given-names>D</given-names></string-name>, <string-name><surname>Drangert</surname> <given-names>JO</given-names></string-name>, <string-name><surname>White</surname> <given-names>S</given-names></string-name> (<year>2009</year>). <article-title>The story of phosphorus: Global food security and food for thought</article-title>. <source><italic>Global Environmental Change</italic></source>, <volume>19</volume>: <fpage>292</fpage>–<lpage>305</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.gloenvcha.2008.10.009" xlink:type="simple">https://doi.org/10.1016/j.gloenvcha.2008.10.009</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Deng</surname> <given-names>CX</given-names></string-name>, <string-name><surname>Brodie</surname> <given-names>SG</given-names></string-name> (<year>2000</year>). <article-title>Roles of brca1 and its interacting proteins</article-title>. <source><italic>BioEssays</italic></source>, <volume>22</volume>(<issue>8</issue>): <fpage>728</fpage>–<lpage>737</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/1521-1878(200008)22:8&lt;728::AID-BIES6&gt;3.0.CO;2-B" xlink:type="simple">https://doi.org/10.1002/1521-1878(200008)22:8&lt;728::AID-BIES6&gt;3.0.CO;2-B</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Dong</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>Y</given-names></string-name> (<year>2022</year>). <article-title>Nonparametric interaction selection</article-title>. <source><italic>Statistica Sinica</italic></source>, <volume>32</volume>: <fpage>1563</fpage>–<lpage>1582</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Donoho</surname> <given-names>D</given-names></string-name> (<year>2000</year>). <article-title>High-dimensional data analysis: The curses and blessings of dimensionality</article-title>. <source><italic>AMS Math Challenges Lecture</italic></source>, <volume>1</volume>(<issue>2000</issue>): <fpage>1</fpage>–<lpage>32</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Evans</surname> <given-names>JD</given-names></string-name> (<year>2006</year>). <article-title>Beepath: An ordered quantitative-PCR array for exploring honey bee immunity and disease</article-title>. <source><italic>Journal of Invertebrate Pathology</italic></source>, <volume>93</volume>(<issue>2</issue>): <fpage>135</fpage>–<lpage>139</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jip.2006.04.004" xlink:type="simple">https://doi.org/10.1016/j.jip.2006.04.004</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Fan</surname> <given-names>J</given-names></string-name>, <string-name><surname>Li</surname> <given-names>R</given-names></string-name> (<year>2001</year>). <article-title>Variable selection via nonconcave penalized likelihood and its oracle properties</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>96</volume>(<issue>456</issue>): <fpage>1348</fpage>–<lpage>1360</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/016214501753382273" xlink:type="simple">https://doi.org/10.1198/016214501753382273</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_018">
<mixed-citation publication-type="chapter"> <string-name><surname>Fan</surname> <given-names>J</given-names></string-name>, <string-name><surname>Li</surname> <given-names>R</given-names></string-name> (<year>2006</year>). <chapter-title>Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery</chapter-title>. In <string-name><given-names>M</given-names> <surname>Sanz-Solé</surname></string-name>, <string-name><given-names>J</given-names> <surname>Soria</surname></string-name>, <string-name><given-names>JL</given-names> <surname>Varona</surname></string-name> &amp; <string-name><given-names>J</given-names> <surname>Verdera</surname></string-name>. <source><italic>Proc. Madrid Int. Congress of Mathematicians</italic></source>, <volume>3</volume>: <fpage>595</fpage>–<lpage>622</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Fan</surname> <given-names>J</given-names></string-name>, <string-name><surname>Lv</surname> <given-names>J</given-names></string-name> (<year>2010</year>). <article-title>A selective overview of variable selection in high dimensional feature space</article-title>. <source><italic>Statistica Sinica</italic></source>, <volume>20</volume>: <fpage>101</fpage>–<lpage>148</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_020">
<mixed-citation publication-type="other"> <string-name><surname>Feng</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>N</given-names></string-name>, <string-name><surname>Helen Zhang</surname> <given-names>H</given-names></string-name> (<year>2020</year>). <italic>RAMP: Regularized Generalized Linear Models with Interaction Effects</italic>. R package version 2.0.2.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Friedman</surname> <given-names>J</given-names></string-name>, <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name>, <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name> (<year>2010</year>). <article-title>Regularization paths for generalized linear models via coordinate descent</article-title>. <source><italic>Journal of Statistical Software</italic></source>, <volume>33</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>22</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18637/jss.v033.i01" xlink:type="simple">https://doi.org/10.18637/jss.v033.i01</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Hao</surname> <given-names>N</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>HH</given-names></string-name> (<year>2018</year>). <article-title>Model selection for high-dimensional quadratic regression via regularization</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>113</volume>(<issue>522</issue>): <fpage>615</fpage>–<lpage>625</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2016.1264956" xlink:type="simple">https://doi.org/10.1080/01621459.2016.1264956</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Hao</surname> <given-names>N</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>HH</given-names></string-name> (<year>2014</year>). <article-title>Interaction screening for ultrahigh-dimensional data</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>109</volume>(<issue>507</issue>): <fpage>1285</fpage>–<lpage>1301</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2014.881741" xlink:type="simple">https://doi.org/10.1080/01621459.2014.881741</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Hao</surname> <given-names>N</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>HH</given-names></string-name> (<year>2017</year>). <article-title>A note on high-dimensional linear regression with interactions</article-title>. <source><italic>American Statistician</italic></source>, <volume>71</volume>(<issue>4</issue>): <fpage>291</fpage>–<lpage>297</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/00031305.2016.1264311" xlink:type="simple">https://doi.org/10.1080/00031305.2016.1264311</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name> (<year>1990</year>). <article-title>Exploring the nature of covariate effects in the proportional hazards model</article-title>. <source><italic>Biometrics</italic></source>, <volume>46</volume>(<issue>4</issue>): <fpage>1005</fpage>–<lpage>1016</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2532444" xlink:type="simple">https://doi.org/10.2307/2532444</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name>, <string-name><surname>Friedman</surname> <given-names>J</given-names></string-name>, <string-name><surname>Franklin</surname> <given-names>J</given-names></string-name> (<year>2004</year>). <article-title>The elements of statistical learning: Data mining, inference, and prediction</article-title>. <source><italic>The Mathematical Intelligencer</italic></source>, <volume>27</volume>: <fpage>83</fpage>–<lpage>85</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Jain</surname> <given-names>R</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>W</given-names></string-name> (<year>2021</year>). <article-title>HDSI: High dimensional selection with interactions algorithm on feature selection and testing</article-title>. <source><italic>PLoS ONE</italic></source>, <volume>16</volume>(<issue>2</issue>): <fpage>e0246159</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1371/journal.pone.0246159" xlink:type="simple">https://doi.org/10.1371/journal.pone.0246159</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_028">
<mixed-citation publication-type="other"> <string-name><surname>Kong</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Li</surname> <given-names>D</given-names></string-name>, <string-name><surname>Fan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Lv</surname> <given-names>J</given-names></string-name> (<year>2017</year>). Interaction pursuit in high-dimensional multi-response regression via distance correlation. <italic>ArXiv:Methodology.</italic></mixed-citation>
</ref>
<ref id="j_jds1127_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Kooperberg</surname> <given-names>C</given-names></string-name>, <string-name><surname>Leblanc</surname> <given-names>M</given-names></string-name> (<year>2008</year>). <article-title>Increasing the power of identifying gene-gene interactions in genome-wide association studies</article-title>. <source><italic>Genetic Epidemiology</italic></source>, <volume>32</volume>: <fpage>255</fpage>–<lpage>263</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/gepi.20300" xlink:type="simple">https://doi.org/10.1002/gepi.20300</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Kotsiantis</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kanellopoulos</surname> <given-names>D</given-names></string-name> (<year>2012</year>). <article-title>Combining bagging, boosting, and random subspace ensembles for regression problems</article-title>. <source><italic>International Journal of Innovative Computing, Information &amp; Control: IJICIC.</italic></source> <fpage>3953</fpage>–<lpage>3961</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_031">
<mixed-citation publication-type="journal"> <string-name><surname>Kuchenbaecker</surname> <given-names>K</given-names></string-name>, <string-name><surname>Hopper</surname> <given-names>J</given-names></string-name>, <string-name><surname>Barnes</surname> <given-names>D</given-names></string-name>, <string-name><surname>Phillips</surname> <given-names>KA</given-names></string-name>, <string-name><surname>Mooij</surname> <given-names>T</given-names></string-name>, <string-name><surname>Roos-Blom</surname> <given-names>MJ</given-names></string-name>, <etal>et al.</etal> (<year>2017</year>). <article-title>Risks of breast, ovarian, and contralateral breast cancer for brca1 and brca2 mutation carriers</article-title>. <source><italic>JAMA</italic></source>, <volume>317</volume>: <fpage>2402</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1001/jama.2017.7112" xlink:type="simple">https://doi.org/10.1001/jama.2017.7112</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_032">
<mixed-citation publication-type="journal"> <string-name><surname>Liaw</surname> <given-names>A</given-names></string-name>, <string-name><surname>Wiener</surname> <given-names>M</given-names></string-name> (<year>2002</year>). <article-title>Classification and regression by randomforest</article-title>. <source><italic>R News</italic></source>, <volume>2</volume>(<issue>3</issue>): <fpage>18</fpage>–<lpage>22</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_033">
<mixed-citation publication-type="journal"> <string-name><surname>Manolio</surname> <given-names>TA</given-names></string-name>, <string-name><surname>Collins</surname> <given-names>FS</given-names></string-name> (<year>2007</year>). <article-title>Genes, environment, health, and disease</article-title>. <source><italic>Human Heredity</italic></source>, <volume>63</volume>(<issue>2</issue>): <fpage>63</fpage>–<lpage>66</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1159/000099178" xlink:type="simple">https://doi.org/10.1159/000099178</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>McCullagh</surname> <given-names>P</given-names></string-name> (<year>2002</year>). <article-title>What is a statistical model?</article-title> <source><italic>The Annals of Statistics</italic></source>, <volume>30</volume>(<issue>5</issue>): <fpage>1225</fpage>–<lpage>1267</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aos/1035844977" xlink:type="simple">https://doi.org/10.1214/aos/1035844977</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_035">
<mixed-citation publication-type="journal"> <string-name><surname>Meinshausen</surname> <given-names>N</given-names></string-name> (<year>2010</year>). <article-title>Node harvest</article-title>. <source><italic>Annals of Applied Statistics</italic></source>, <volume>4</volume>(<issue>4</issue>): <fpage>2049</fpage>–<lpage>2072</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/10-AOAS367" xlink:type="simple">https://doi.org/10.1214/10-AOAS367</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_036">
<mixed-citation publication-type="journal"> <string-name><surname>Nelder</surname> <given-names>JA</given-names></string-name> (<year>1977</year>). <article-title>A reformulation of linear models</article-title>. <source><italic>Journal of the Royal Statistical Society. Series A. General</italic></source>, <volume>140</volume>(<issue>1</issue>): <fpage>48</fpage>–<lpage>77</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2344517" xlink:type="simple">https://doi.org/10.2307/2344517</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_037">
<mixed-citation publication-type="journal"> <string-name><surname>Shah</surname> <given-names>RD</given-names></string-name>, <string-name><surname>Meinshausen</surname> <given-names>N</given-names></string-name> (<year>2014</year>). <article-title>Random intersection trees</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>15</volume>(<issue>1</issue>): <fpage>629</fpage>–<lpage>654</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_038">
<mixed-citation publication-type="journal"> <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name> (<year>1996</year>). <article-title>Regression shrinkage and selection via the lasso</article-title>. <source><italic>Journal of the Royal Statistical Society, Series B, Methodological</italic></source>, <volume>58</volume>(<issue>1</issue>): <fpage>267</fpage>–<lpage>288</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.2517-6161.1996.tb02080.x" xlink:type="simple">https://doi.org/10.1111/j.2517-6161.1996.tb02080.x</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_039">
<mixed-citation publication-type="journal"> <string-name><surname>Tin Kam Ho</surname></string-name> (<year>1998</year>). <article-title>The random subspace method for constructing decision forests</article-title>. <source><italic>IEEE Transactions on Pattern Analysis and Machine Intelligence</italic></source>, <volume>20</volume>(<issue>8</issue>): <fpage>832</fpage>–<lpage>844</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1109/34.709601" xlink:type="simple">https://doi.org/10.1109/34.709601</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_040">
<mixed-citation publication-type="journal"> <string-name><surname>Van der Laan</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Polley</surname> <given-names>EC</given-names></string-name>, <string-name><surname>Hubbard</surname> <given-names>AE</given-names></string-name> (<year>2007</year>). <article-title>Super learner</article-title>. <source><italic>Statistical Applications in Genetics and Molecular Biology</italic></source>, <volume>6</volume>(<issue>2007</issue>): <fpage>25</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1127_ref_041">
<mixed-citation publication-type="book"> <string-name><surname>Wolberg</surname> <given-names>W</given-names></string-name>, <string-name><surname>Mangasarian</surname> <given-names>O</given-names></string-name>, <string-name><surname>Street</surname> <given-names>N</given-names></string-name>, <string-name><surname>Street</surname> <given-names>W</given-names></string-name> (<year>1995</year>). <source><italic>Breast Cancer Wisconsin (Diagnostic). UCI Machine Learning Repository</italic></source>. <comment>DOI</comment>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.24432/C5DW2B" xlink:type="simple">https://doi.org/10.24432/C5DW2B</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_042">
<mixed-citation publication-type="journal"> <string-name><surname>Yuan</surname> <given-names>M</given-names></string-name>, <string-name><surname>Joseph</surname> <given-names>VR</given-names></string-name>, <string-name><surname>Zou</surname> <given-names>H</given-names></string-name> (<year>2009</year>). <article-title>Structured variable selection and estimation</article-title>. <source><italic>Annals of Applied Statistics</italic></source>, <volume>3</volume>(<issue>4</issue>): <fpage>1738</fpage>–<lpage>1757</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/09-AOAS254" xlink:type="simple">https://doi.org/10.1214/09-AOAS254</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_043">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>CH</given-names></string-name> (<year>2010</year>). <article-title>Nearly unbiased variable selection under minimax concave penalty</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>38</volume>(<issue>2</issue>): <fpage>894</fpage>–<lpage>942</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/09-AOS729" xlink:type="simple">https://doi.org/10.1214/09-AOS729</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_044">
<mixed-citation publication-type="journal"> <string-name><surname>Zhao</surname> <given-names>P</given-names></string-name>, <string-name><surname>Rocha</surname> <given-names>G</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>B</given-names></string-name> (<year>2009</year>). <article-title>The composite absolute penalties family for grouped and hierarchical variable selection</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>37</volume>, No. <issue>6A</issue>: <fpage>3468</fpage>–<lpage>3497</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/07-AOS584" xlink:type="simple">https://doi.org/10.1214/07-AOS584</ext-link></mixed-citation>
</ref>
<ref id="j_jds1127_ref_045">
<mixed-citation publication-type="journal"> <string-name><surname>Zou</surname> <given-names>H</given-names></string-name> (<year>2006</year>). <article-title>The adaptive lasso and its oracle properties</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>101</volume>(<issue>476</issue>): <fpage>1418</fpage>–<lpage>1429</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/016214506000000735" xlink:type="simple">https://doi.org/10.1198/016214506000000735</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
