<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1152</article-id>
<article-id pub-id-type="doi">10.6339/24-JDS1152</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Variable Importance Measures for Multivariate Random Forests</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0001-5123-2512</contrib-id>
<name><surname>Sikdar</surname><given-names>Sharmistha</given-names></name><email xlink:href="mailto:sharmistha.sikdar@tuck.dartmouth.edu">sharmistha.sikdar@tuck.dartmouth.edu</email><xref ref-type="aff" rid="j_jds1152_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Hooker</surname><given-names>Giles</given-names></name><xref ref-type="aff" rid="j_jds1152_aff_002">2</xref><xref ref-type="fn" rid="j_jds1152_fn_001">†</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Kadiyali</surname><given-names>Vrinda</given-names></name><xref ref-type="aff" rid="j_jds1152_aff_003">3</xref><xref ref-type="fn" rid="j_jds1152_fn_001">†</xref>
</contrib>
<aff id="j_jds1152_aff_001"><label>1</label><institution>Tuck School of Business at Dartmouth, Marketing Department</institution>, Hanover, NH,<country>USA</country></aff>
<aff id="j_jds1152_aff_002"><label>2</label>Wharton School of Business, Department of Statistics and Data Science, <institution>U. Pennsylvania</institution>, Philadephia, PA, <country>USA</country></aff>
<aff id="j_jds1152_aff_003"><label>3</label>SC Johnson College of Business, Marketing Department, <institution>Cornell University</institution>, Ithaca, NY, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:sharmistha.sikdar@tuck.dartmouth.edu">sharmistha.sikdar@tuck.dartmouth.edu</ext-link>.</corresp><fn id="j_jds1152_fn_001"><label>†</label>
<p>This work is part of the first author’s dissertation research with the second and third authors as dissertation advisors.</p></fn>
</author-notes>
<pub-date pub-type="ppub"><year>2025</year></pub-date><pub-date pub-type="epub"><day>18</day><month>9</month><year>2024</year></pub-date><volume>23</volume><issue>1</issue><fpage>243</fpage><lpage>263</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1152_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>In our Online Supplement, we have included pseudo-codes on the MVRF ensemble build using sub-bagging procedure, proposed SI-based VIMs with significant splits, and the proposed RFE strategy of our iterative variable selection method. We have also included the variable choices in the simulation design; box plots and confidence intervals of top features selected by our proposed VIMs from the Amazon application on Luggage category.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>31</day><month>10</month><year>2023</year></date><date date-type="accepted"><day>24</day><month>8</month><year>2024</year></date></history>
<permissions><copyright-statement>2025 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2025</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Multivariate random forests (or MVRFs) are an extension of tree-based ensembles to examine multivariate responses. MVRF can be particularly helpful where some of the responses exhibit sparse (e.g., zero-inflated) distributions, making borrowing strength from correlated features attractive. Tree-based algorithms select features using variable importance measures (VIMs) that score each covariate based on the strength of dependence of the model on that variable. In this paper, we develop and propose new VIMs for MVRFs. Specifically, we focus on the variable’s ability to achieve split improvement, i.e., the difference in the responses between the left and right nodes obtained after splitting the parent node, for a multivariate response. Our proposed VIMs are an improvement over the default naïve VIM in existing software and allow us to investigate the strength of dependence both globally and on a per-response basis. Our simulation studies show that our proposed VIM recovers the true predictors better than naïve measures. We demonstrate usage of the VIMs for variable selection in two empirical applications; the first is on Amazon Marketplace data to predict Buy Box prices of multiple brands in a category, and the second is on ecology data to predict co-occurrence of multiple, rare bird species. A feature of both data sets is that some outcomes are sparse — exhibiting a substantial proportion of zeros or fixed values. In both cases, the proposed VIMs when used for variable screening give superior predictive accuracy over naïve measures.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>multivariate response problems</kwd>
<kwd>multivariate tree-based ensembles</kwd>
<kwd>split improvement</kwd>
<kwd>variable selection</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1152_reflist_001">
<title>References</title>
<ref id="j_jds1152_ref_001">
<mixed-citation publication-type="chapter"> <string-name><surname>Adler</surname> <given-names>P</given-names></string-name>, <string-name><surname>Kleinhesselink</surname> <given-names>AR</given-names></string-name>, <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name>, <string-name><surname>Teller</surname> <given-names>BJ</given-names></string-name>, <string-name><surname>Ellner</surname> <given-names>S</given-names></string-name>, <string-name><surname>Taylor</surname> <given-names>JB</given-names></string-name> (<year>2017</year>). <chapter-title>Weak interspecific interactions in a sagebrush steppe: Evidence from observations, models, and experiments</chapter-title>. In: <source><italic>2017 ESA Annual Meeting (August 6–11)</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_002">
<mixed-citation publication-type="other"> Amazon - Price Matching (no date). Amazon - price matching. <uri>https://www.amazon.com/gp/help/customer/display.html?nodeId=G9EAYKPV5YYDB8P7</uri>. Accessed: 25 August 2021.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_003">
<mixed-citation publication-type="chapter"> <string-name><surname>Andonova</surname> <given-names>S</given-names></string-name>, <string-name><surname>Elisseeff</surname> <given-names>A</given-names></string-name>, <string-name><surname>Evgeniou</surname> <given-names>T</given-names></string-name>, <string-name><surname>Pontil</surname> <given-names>M</given-names></string-name> (<year>2002</year>). <chapter-title>A simple algorithm for learning stable machines</chapter-title>. In: <source><italic>ECAI</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Breiman</surname> <given-names>L</given-names></string-name> (<year>2001</year>). <article-title>Random forests</article-title>. <source><italic>Machine Learning</italic></source>, <volume>45</volume>: <fpage>5</fpage>–<lpage>32</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Chen</surname> <given-names>KH</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>WL</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>SM</given-names></string-name> (<year>2022</year>). <article-title>Competition between the black-winged kite and Eurasian kestrel led to population turnover at a subtropical sympatric site</article-title>. <source><italic>Journal of Avian Biology</italic></source>, <volume>10</volume>: <fpage>e03040</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_006">
<mixed-citation publication-type="chapter"> <string-name><surname>Chen</surname> <given-names>L</given-names></string-name>, <string-name><surname>Mislove</surname> <given-names>A</given-names></string-name>, <string-name><surname>Wilson</surname> <given-names>C</given-names></string-name> (<year>2016</year>). <chapter-title>An empirical analysis of algorithmic pricing on Amazon marketplace</chapter-title>. In: <source><italic>Proceedings of the 25th International Conference on World Wide Web</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Coleman</surname> <given-names>T</given-names></string-name>, <string-name><surname>Mentch</surname> <given-names>L</given-names></string-name>, <string-name><surname>Fink</surname> <given-names>D</given-names></string-name>, <string-name><surname>Sorte</surname> <given-names>F</given-names></string-name>, <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name>, <string-name><surname>Hochachka</surname> <given-names>W</given-names></string-name>, <etal>et al.</etal> (<year>2020</year>). <article-title>Statistical inference on tree swallow migrations with random forests</article-title>. <source><italic>Journal of the Royal Statistical Society. Series C. Applied Statistics</italic></source>, <volume>69</volume>(<issue>4</issue>): <fpage>973</fpage>–<lpage>989</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Covert</surname> <given-names>I</given-names></string-name>, <string-name><surname>Lundberg</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>SI</given-names></string-name> (<year>2020</year>). <article-title>Understanding global feature contributions with additive importance measures</article-title>. <source><italic>Advances in Neural Information Processing Systems</italic></source>, <volume>33</volume>: <fpage>17212</fpage>–<lpage>17223</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Danaher</surname> <given-names>PJ</given-names></string-name> (<year>2007</year>). <article-title>Modeling page views across multiple websites with an application to Internet reach and frequency prediction</article-title>. <source><italic>Marketing Science</italic></source>, <volume>26</volume>(<issue>3</issue>): <fpage>422</fpage>–<lpage>437</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Danaher</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Smith</surname> <given-names>MS</given-names></string-name> (<year>2011</year>). <article-title>Modeling multivariate distributions using copulas: Applications in marketing</article-title>. <source><italic>Marketing Science</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>De’Ath</surname> <given-names>G</given-names></string-name> (<year>2002</year>). <article-title>Multivariate regression trees: A new technique for modeling species–environment relationships</article-title>. <source><italic>Ecology</italic></source>, <volume>83</volume>(<issue>4</issue>): <fpage>1105</fpage>–<lpage>1117</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Efron</surname> <given-names>B</given-names></string-name> (<year>2014</year>). <article-title>Estimation and accuracy after model selection</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>109</volume>(<issue>507</issue>): <fpage>991</fpage>–<lpage>1007</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Fink</surname> <given-names>D</given-names></string-name>, <string-name><surname>Auer</surname> <given-names>T</given-names></string-name>, <string-name><surname>Johnston</surname> <given-names>A</given-names></string-name>, <string-name><surname>Ruiz-Gutierrez</surname> <given-names>V</given-names></string-name>, <string-name><surname>Hochachka</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Kelling</surname> <given-names>S</given-names></string-name> (<year>2020</year>). <article-title>Modeling avian full annual cycle distribution and population trends with citizen science data</article-title>. <source><italic>Ecological Applications</italic></source>, <volume>30</volume>(<issue>3</issue>): <fpage>e02056</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_014">
<mixed-citation publication-type="other"> <string-name><surname>Fink</surname> <given-names>D</given-names></string-name>, <string-name><surname>Auer</surname> <given-names>T</given-names></string-name>, <string-name><surname>Johnston</surname> <given-names>A</given-names></string-name>, <string-name><surname>Strimas-Mackey</surname> <given-names>M</given-names></string-name>, <string-name><surname>Iliff</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kelling</surname> <given-names>S</given-names></string-name> (<year>2021</year>). ebird status and trends. cornell lab of ornithology, ithaca, new york.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Fink</surname> <given-names>D</given-names></string-name>, <string-name><surname>Hochachka</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Zuckerberg</surname> <given-names>B</given-names></string-name>, <string-name><surname>Winkler</surname> <given-names>DW</given-names></string-name>, <string-name><surname>Shaby</surname> <given-names>B</given-names></string-name>, <string-name><surname>Munson</surname> <given-names>MA</given-names></string-name>, <etal>et al.</etal> (<year>2010</year>). <article-title>Spatiotemporal exploratory models for broad-scale survey data</article-title>. <source><italic>Ecological Applications</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Friedman</surname> <given-names>JH</given-names></string-name> (<year>2001</year>). <article-title>Greedy function approximation: A gradient boosting machine</article-title>. <source><italic>The Annals of Statistics</italic></source>, <fpage>1189</fpage>–<lpage>1232</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_017">
<mixed-citation publication-type="chapter"> <string-name><surname>Á Gómez-Losada</surname></string-name>, <string-name><surname>Duch-Brown</surname> <given-names>N</given-names></string-name> (<year>2019</year>). <chapter-title>Competing for Amazon’s buy box: A machine-learning approach</chapter-title>. In: <source><italic>Business Information Systems Workshops: BIS 2019 International Workshops, Seville, Spain, June 26–28, 2019, Revised Papers 22</italic></source> (<string-name><given-names>W</given-names> <surname>Abramowicz</surname></string-name>, <string-name><given-names>R</given-names> <surname>Corchuelo</surname></string-name>, eds.), <fpage>445</fpage>–<lpage>456</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Guyon</surname> <given-names>I</given-names></string-name>, <string-name><surname>Weston</surname> <given-names>J</given-names></string-name>, <string-name><surname>Barnhill</surname> <given-names>S</given-names></string-name>, <string-name><surname>Vapnik</surname> <given-names>V</given-names></string-name> (<year>2002</year>). <article-title>Gene selection for cancer classification using support vector machines</article-title>. <source><italic>Machine Learning</italic></source>, <volume>46</volume>: <fpage>389</fpage>–<lpage>422</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1023/A:1012487302797" xlink:type="simple">https://doi.org/10.1023/A:1012487302797</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name>, <string-name><surname>Mentch</surname> <given-names>L</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>S</given-names></string-name> (<year>2021</year>). <article-title>Unrestricted permutation forces extrapolation: Variable importance requires at least one more model, or there is no free variable importance</article-title>. <source><italic>Statistics and Computing</italic></source>, <volume>31</volume>: <fpage>1</fpage>–<lpage>16</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s11222-021-10057-z" xlink:type="simple">https://doi.org/10.1007/s11222-021-10057-z</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Ishwaran</surname> <given-names>H</given-names></string-name> (<year>2007</year>). <article-title>Variable importance in binary regression trees and forests</article-title>. <source><italic>Electronic Journal of Statistics</italic></source>, <fpage>519</fpage>–<lpage>537</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_021">
<mixed-citation publication-type="book"> <string-name><surname>Joe</surname> <given-names>H</given-names></string-name> (<year>1997</year>). <source><italic>Multivariate Models and Multivariate Dependence Concepts</italic></source>. <publisher-name>CRC press</publisher-name>, <publisher-loc>Florida</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Johnston</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hochachka</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Strimas-Mackey</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Ruiz Gutierrez</surname> <given-names>V</given-names></string-name>, <string-name><surname>Robinson</surname> <given-names>OJ</given-names></string-name>, <string-name><surname>Miller</surname> <given-names>ET</given-names></string-name>, <etal>et al.</etal> (<year>2021</year>). <article-title>Analytical guidelines to increase the value of community science data: An example using ebird data to estimate species distributions</article-title>. <source><italic>Diversity and Distributions</italic></source>, <volume>27</volume>(<issue>7</issue>): <fpage>1265</fpage>–<lpage>1277</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/ddi.13271" xlink:type="simple">https://doi.org/10.1111/ddi.13271</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Mentch</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name> (<year>2016</year>). <article-title>Quantifying uncertainty in random forests via confidence intervals and hypothesis tests</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>17</volume>(<issue>1</issue>): <fpage>841</fpage>–<lpage>881</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Miller</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Lubke</surname> <given-names>GH</given-names></string-name>, <string-name><surname>McArtor</surname> <given-names>DB</given-names></string-name>, <string-name><surname>Bergeman</surname> <given-names>C</given-names></string-name> (<year>2016</year>). <article-title>Finding structure in data using multivariate tree boosting</article-title>. <source><italic>Psychological Methods</italic></source>, <volume>21</volume>(<issue>4</issue>): <fpage>583</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1037/met0000087" xlink:type="simple">https://doi.org/10.1037/met0000087</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Ng</surname> <given-names>WH</given-names></string-name>, <string-name><surname>Fink</surname> <given-names>D</given-names></string-name>, <string-name><surname>LaSorte</surname> <given-names>FA</given-names></string-name>, <string-name><surname>Auer</surname> <given-names>T</given-names></string-name>, <string-name><surname>Hochachka</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Johnston</surname> <given-names>A</given-names></string-name>, <etal>et al.</etal> (<year>2022</year>). <article-title>Continental-scale biomass redistribution by migratory birds in response to seasonal variation in productivity</article-title>. <source><italic>Global Ecology and Biogeography</italic></source>, <volume>31</volume>(<issue>4</issue>): <fpage>727</fpage>–<lpage>739</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/geb.13460" xlink:type="simple">https://doi.org/10.1111/geb.13460</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>Pierdzioch</surname> <given-names>C</given-names></string-name>, <string-name><surname>Risse</surname> <given-names>M</given-names></string-name> (<year>2020</year>). <article-title>Forecasting precious metal returns with multivariate random forests</article-title>. <source><italic>Empirical Economics</italic></source>, <volume>58</volume>(<issue>3</issue>): <fpage>1167</fpage>–<lpage>1184</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s00181-018-1558-9" xlink:type="simple">https://doi.org/10.1007/s00181-018-1558-9</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Rahman</surname> <given-names>R</given-names></string-name>, <string-name><surname>Otridge</surname> <given-names>J</given-names></string-name>, <string-name><surname>Pal</surname> <given-names>R</given-names></string-name> (<year>2017</year>). <article-title>Integratedmrf: Random forest-based framework for integrating prediction from different data types</article-title>. <source><italic>Bioinformatics</italic></source>, <volume>33</volume>(<issue>9</issue>): <fpage>1407</fpage>–<lpage>1410</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/bioinformatics/btw765" xlink:type="simple">https://doi.org/10.1093/bioinformatics/btw765</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_028">
<mixed-citation publication-type="chapter"> <string-name><surname>Ribeiro</surname> <given-names>MT</given-names></string-name>, <string-name><surname>Singh</surname> <given-names>S</given-names></string-name>, <string-name><surname>Guestrin</surname> <given-names>C</given-names></string-name> (<year>2016</year>). <chapter-title>“why should I trust you?” explaining the predictions of any classifier</chapter-title>. In: <source><italic>Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</italic></source>, <fpage>1135</fpage>–<lpage>1144</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Rosenberg</surname> <given-names>KV</given-names></string-name>, <string-name><surname>Dokter</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Blancher</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Sauer</surname> <given-names>JR</given-names></string-name>, <string-name><surname>Smith</surname> <given-names>AC</given-names></string-name>, <string-name><surname>Smith</surname> <given-names>PA</given-names></string-name>, <etal>et al.</etal> (<year>2019</year>). <article-title>Decline of the North American avifauna</article-title>. <source><italic>Science</italic></source>, <volume>366</volume>(<issue>6461</issue>): <fpage>120</fpage>–<lpage>124</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1126/science.aaw1313" xlink:type="simple">https://doi.org/10.1126/science.aaw1313</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Segal</surname> <given-names>M</given-names></string-name> (<year>1992</year>). <article-title>Tree-structured methods for longitudinal data</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>87</volume>(<issue>418</issue>): <fpage>407</fpage>–<lpage>418</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_031">
<mixed-citation publication-type="journal"> <string-name><surname>Segal</surname> <given-names>M</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>Y</given-names></string-name> (<year>2011</year>). <article-title>Multivariate random forests</article-title>. <source><italic>Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery</italic></source>, <volume>1</volume>(<issue>1</issue>): <fpage>80</fpage>–<lpage>87</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_032">
<mixed-citation publication-type="other"> <string-name><surname>Sikdar</surname> <given-names>S</given-names></string-name>, <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name>, <string-name><surname>Kadiyali</surname> <given-names>V</given-names></string-name> (<year>2021</year>). Multivariate random forest variable importance measures r package. <uri>https://github.com/Megatvini/VIM/</uri>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_033">
<mixed-citation publication-type="other"> <string-name><surname>Sikdar</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kadiyali</surname> <given-names>V</given-names></string-name>, <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name> (<year>2022</year>). Price dynamics on amazon marketplace: A multivariate random forest variable selection approach. Tuck School of Business Working Paper, (3518690).</mixed-citation>
</ref>
<ref id="j_jds1152_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Strobl</surname> <given-names>C</given-names></string-name>, <string-name><surname>Boulesteix</surname> <given-names>AL</given-names></string-name>, <string-name><surname>Zeileis</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hothorn</surname> <given-names>T</given-names></string-name> (<year>2007</year>). <article-title>Bias in random forest variable importance measures: Illustrations, sources and a solution</article-title>. <source><italic>BMC Bioinformatics</italic></source>, <volume>8</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>21</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1186/1471-2105-8-1" xlink:type="simple">https://doi.org/10.1186/1471-2105-8-1</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_035">
<mixed-citation publication-type="journal"> <string-name><surname>Sullivan</surname> <given-names>BL</given-names></string-name>, <string-name><surname>Wood</surname> <given-names>CL</given-names></string-name>, <string-name><surname>Iliff</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Bonney</surname> <given-names>RE</given-names></string-name>, <string-name><surname>Fink</surname> <given-names>D</given-names></string-name>, <string-name><surname>Kelling</surname> <given-names>S</given-names></string-name> (<year>2009</year>). <article-title>ebird: A citizen-based bird observation network in the biological sciences</article-title>. <source><italic>Biological Conservation</italic></source>, <volume>142</volume>(<issue>10</issue>): <fpage>2282</fpage>–<lpage>2292</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.biocon.2009.05.006" xlink:type="simple">https://doi.org/10.1016/j.biocon.2009.05.006</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_036">
<mixed-citation publication-type="other"> <string-name><surname>Verdinelli</surname> <given-names>I</given-names></string-name>, <string-name><surname>Wasserman</surname> <given-names>L</given-names></string-name> (<year>2023</year>). Feature importance: A closer look at shapley values and loco. arXiv preprint: <uri>https://arxiv.org/abs/2303.05981</uri>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_037">
<mixed-citation publication-type="journal"> <string-name><surname>Wager</surname> <given-names>S</given-names></string-name>, <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Efron</surname> <given-names>B</given-names></string-name> (<year>2014</year>). <article-title>Confidence intervals for random forests: The jackknife and the infinitesimal jackknife</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>15</volume>(<issue>1</issue>): <fpage>1625</fpage>–<lpage>1651</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_038">
<mixed-citation publication-type="chapter"> <string-name><surname>Zaman</surname> <given-names>F</given-names></string-name>, <string-name><surname>Hirose</surname> <given-names>H</given-names></string-name> (<year>2009</year>). <chapter-title>Effect of subsampling rate on subbagging and related ensembles of stable classifiers</chapter-title>. In: <source><italic>Pattern Recognition and Machine Intelligence: Third International Conference, PReMI 2009 New Delhi, India, December 16-20, 2009 Proceedings 3</italic></source> (<string-name><given-names>S</given-names> <surname>Chaudhury</surname></string-name>, <string-name><given-names>S</given-names> <surname>Mitra</surname></string-name>, <string-name><given-names>CA</given-names> <surname>Murthy</surname></string-name>, <string-name><given-names>PS</given-names> <surname>Sastry</surname></string-name>, <string-name><given-names>SK</given-names> <surname>Pal</surname></string-name>, eds.), <fpage>44</fpage>–<lpage>49</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1152_ref_039">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>H</given-names></string-name> (<year>1998</year>). <article-title>Classification trees for multiple binary responses</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>93</volume>(<issue>441</issue>): <fpage>180</fpage>–<lpage>193</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1998.10474100" xlink:type="simple">https://doi.org/10.1080/01621459.1998.10474100</ext-link></mixed-citation>
</ref>
<ref id="j_jds1152_ref_040">
<mixed-citation publication-type="journal"> <string-name><surname>Zhou</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Hooker</surname> <given-names>G</given-names></string-name> (<year>2021</year>). <article-title>Unbiased measurement of feature importance in tree-based methods</article-title>. <source><italic>ACM Transactions on Knowledge Discovery from Data</italic></source>, <volume>15</volume>(<issue>2</issue>): <fpage>1</fpage>–<lpage>21</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1145/3425637" xlink:type="simple">https://doi.org/10.1145/3425637</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
