<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1102</article-id>
<article-id pub-id-type="doi">10.6339/23-JDS1102</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Computing in Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Efficient Bayesian High-Dimensional Classification via Random Projection with Application to Gene Expression Data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Chakraborty</surname><given-names>Abhisek</given-names></name><email xlink:href="mailto:cabhisek@stat.tamu.edu">cabhisek@stat.tamu.edu</email><email xlink:href="mailto:zovialpapai@gmail.com">zovialpapai@gmail.com</email><xref ref-type="aff" rid="j_jds1102_aff_001">1</xref><xref ref-type="fn" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1102_aff_001"><label>1</label>Department of Statistics, <institution>Texas A&amp;M University</institution>, 3143 TAMU, College Station, TX 77843, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Email: <ext-link ext-link-type="uri" xlink:href="mailto:cabhisek@stat.tamu.edu">cabhisek@stat.tamu.edu</ext-link> or <ext-link ext-link-type="uri" xlink:href="mailto:zovialpapai@gmail.com">zovialpapai@gmail.com</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2024</year></pub-date><pub-date pub-type="epub"><day>12</day><month>6</month><year>2023</year></pub-date><volume>22</volume><issue>1</issue><fpage>152</fpage><lpage>172</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1102_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>Software implementation of the methodologies developed in the article is available for use at <ext-link ext-link-type="uri" xlink:href="https://github.com/zovialpapai/Bayesian-classification-with-random-projection">zovialpapai/Bayesian-classification-with-random-projection</ext-link>. Here, we present a short description about the directories in the repository, as follows: <bold>(1) functions:</bold> The directory contains utility functions in two R scripts, that are utilised in the repeated simulations and real data analysis conducted in the paper. (a) “BCC_Functions.R” contains functions for compression matrix generation; Probit regression via Albert &amp; Chib and Holmes &amp; Held data augmentation schemes; Logit regression via Polya-Gamma data augmentation scheme; hyper-parameter tuning; and associated helper functions. (b) Probit_HH_cpp.R contains Probit regression via Holmes &amp; Held data augmentation scheme, written in Rcpp. <bold>(2) repeated simulations</bold>: The directory contains three R scripts, named BCC_sims.R, Weakleaners.R, and time_comparison.R. (a) BCC_sims.R can be utilised to carry out the simulations presented in Section 3 on High-dimensional Probit regression, and Section 5 on High-dimensional Logit regression, along with hyper-parameter tuning. (b) Weakleaners.R can be utilized to study the effect of number of replications of compression matrix (or number of weak classifiers) on the accuracy of classifiers AC, AC+, HH, HH+. The results are presented in Section 3. (c) time_comparison.R can be utilised to study comparative computional time of our classifiers. The results are presented in Section 3. <bold>(3) data</bold>: Micro-array gene expression cancer data sets utilized in the article is freely available on the website <ext-link ext-link-type="uri" xlink:href="https://data.mendeley.com/">data.mendeley.com</ext-link>. Copies of the data sets are available in the data directory in the our repository. <bold>(4) real data analysis</bold>: The directory contains the a R script named BCC_data.R that can be utilised to carry out the analysis of micro-array gene expression cancer data sets (Leukemia, Lung Cancer, Prostate cancer), presented in Section 4 of the paper.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>10</day><month>1</month><year>2023</year></date><date date-type="accepted"><day>26</day><month>4</month><year>2023</year></date></history>
<permissions><copyright-statement>2024 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Inspired by the impressive successes of compress sensing-based machine learning algorithms, data augmentation-based efficient Gibbs samplers for Bayesian high-dimensional classification models are developed by compressing the design matrix to a much lower dimension. Ardent care is exercised in the choice of the projection mechanism, and an adaptive voting rule is employed to reduce sensitivity to the random projection matrix. Focusing on the high-dimensional Probit regression model, we note that the naive implementation of the data augmentation-based Gibbs sampler is not robust to the presence of co-linearity in the design matrix – a setup ubiquitous in <inline-formula id="j_jds1102_ineq_001"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi></mml:math><tex-math><![CDATA[$n\lt p$]]></tex-math></alternatives></inline-formula> problems. We demonstrate that a simple fix based on joint updates of parameters in the latent space circumnavigates this issue. With a computationally efficient MCMC scheme in place, we introduce an ensemble classifier by creating <italic>R</italic> (<inline-formula id="j_jds1102_ineq_002"><alternatives><mml:math>
<mml:mo stretchy="false">∼</mml:mo>
<mml:mn>25</mml:mn></mml:math><tex-math><![CDATA[$\sim 25$]]></tex-math></alternatives></inline-formula>–50) projected copies of the design matrix, and subsequently running <italic>R</italic> classification models with the <italic>R</italic> projected design matrix in parallel. We combine the output from the <italic>R</italic> replications via an adaptive voting scheme. Our scheme is inherently parallelizable and capable of taking advantage of modern computing environments often equipped with multiple cores. The empirical success of our methodology is illustrated in elaborate simulations and gene expression data applications. We also extend our methodology to a high-dimensional logistic regression model and carry out numerical studies to showcase its efficacy.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>collapsed Gibbs sampler</kwd>
<kwd>data augmentation</kwd>
<kwd>dimensionality reduction</kwd>
<kwd>ensemble learning</kwd>
<kwd>parallel processing</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1102_reflist_001">
<title>References</title>
<ref id="j_jds1102_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Achlioptas</surname> <given-names>D</given-names></string-name> (<year>2003</year>). <article-title>Database-friendly random projections: Johnson-lindenstrauss with binary coins</article-title>. <source><italic>Journal of Computer and System Sciences</italic></source>, <volume>66</volume>(<issue>4</issue>): <fpage>671</fpage>–<lpage>687</lpage>. <comment>Special Issue on PODS 2001</comment>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/S0022-0000(03)00025-4" xlink:type="simple">https://doi.org/10.1016/S0022-0000(03)00025-4</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Adragni</surname> <given-names>KP</given-names></string-name>, <string-name><surname>Cook</surname> <given-names>RD</given-names></string-name> (<year>2014</year>). <article-title>Sufficient dimension reduction and prediction in regression</article-title>. <source><italic>Philosophical Transactions of Royal Society A</italic></source>, <volume>367</volume>: <fpage>1</fpage>–<lpage>21</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Albert</surname> <given-names>JH</given-names></string-name>, <string-name><surname>Chib</surname> <given-names>S</given-names></string-name> (<year>1993</year>). <article-title>Bayesian analysis of binary and polychotomous response data</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>88</volume>(<issue>422</issue>): <fpage>669</fpage>–<lpage>679</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1993.10476321" xlink:type="simple">https://doi.org/10.1080/01621459.1993.10476321</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Armagan</surname> <given-names>A</given-names></string-name>, <string-name><surname>Dunson</surname> <given-names>D</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>J</given-names></string-name> (<year>2013</year>). <article-title>Generalized double pareto shrinkage</article-title>. <source><italic>Statistica Sinica</italic></source>, <volume>23</volume>(<issue>1</issue>): <fpage>119</fpage>–<lpage>143</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_005">
<mixed-citation publication-type="book"> <string-name><surname>Banerjee</surname> <given-names>S</given-names></string-name>, <string-name><surname>Roy</surname> <given-names>A</given-names></string-name> (<year>2014</year>). <source><italic>Linear Algebra and Matrix Analysis for Statistics</italic></source>. <publisher-name>Chapman and Hall/CRC</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Bhadra</surname> <given-names>A</given-names></string-name>, <string-name><surname>Datta</surname> <given-names>J</given-names></string-name>, <string-name><surname>Polson</surname> <given-names>NG</given-names></string-name>, <string-name><surname>Willard</surname> <given-names>B</given-names></string-name> (<year>2017</year>). <article-title>The horseshoe+ estimator of ultra-sparse signals</article-title>. <source><italic>Bayesian Analysis</italic></source>, <volume>12</volume>(<issue>4</issue>): <fpage>1105</fpage>–<lpage>1131</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/16-BA1028" xlink:type="simple">https://doi.org/10.1214/16-BA1028</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Bhattacharya</surname> <given-names>A</given-names></string-name>, <string-name><surname>Chakraborty</surname> <given-names>A</given-names></string-name>, <string-name><surname>Mallick</surname> <given-names>BK</given-names></string-name> (<year>2016</year>). <article-title>Fast sampling with Gaussian scale mixture priors in high-dimensional regression</article-title>. <source><italic>Biometrika</italic></source>, <volume>103</volume>(<issue>4</issue>): <fpage>985</fpage>–<lpage>991</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/asw042" xlink:type="simple">https://doi.org/10.1093/biomet/asw042</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Bhattacharya</surname> <given-names>A</given-names></string-name>, <string-name><surname>Pati</surname> <given-names>D</given-names></string-name>, <string-name><surname>Pillai</surname> <given-names>NS</given-names></string-name>, <string-name><surname>Dunson</surname> <given-names>DB</given-names></string-name> (<year>2015</year>). <article-title>Dirichlet–laplace priors for optimal shrinkage</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>110</volume>(<issue>512</issue>): <fpage>1479</fpage>–<lpage>1490</lpage> <comment>PMID: 27019543</comment>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2014.960967" xlink:type="simple">https://doi.org/10.1080/01621459.2014.960967</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_009">
<mixed-citation publication-type="chapter"> <string-name><surname>Biswas</surname> <given-names>N</given-names></string-name>, <string-name><surname>Mackey</surname> <given-names>L</given-names></string-name>, <string-name><surname>Meng</surname> <given-names>XL</given-names></string-name> (<year>2022</year>). <chapter-title>Scalable spike-and-slab</chapter-title>. In: <source><italic>Proceedings of the 39th International Conference on Machine Learning</italic></source> (<string-name><given-names>K</given-names> <surname>Chaudhuri</surname></string-name>, <string-name><given-names>S</given-names> <surname>Jegelka</surname></string-name>, <string-name><given-names>L</given-names> <surname>Song</surname></string-name>, <string-name><given-names>C</given-names> <surname>Szepesvari</surname></string-name>, <string-name><given-names>G</given-names> <surname>Niu</surname></string-name>, <string-name><given-names>S</given-names> <surname>Sabato</surname></string-name>, eds.), volume <volume>162</volume> of <series><italic>Proceedings of Machine Learning Research</italic></series>, <fpage>2021</fpage>–<lpage>2040</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Brown</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Griffin</surname> <given-names>JE</given-names></string-name> (<year>2010</year>). <article-title>Inference with normal-gamma prior distributions in regression problems</article-title>. <source><italic>Bayesian Analysis</italic></source>, <volume>5</volume>(<issue>1</issue>): <fpage>171</fpage>–<lpage>188</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/10-BA507" xlink:type="simple">https://doi.org/10.1214/10-BA507</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Candes</surname> <given-names>EJ</given-names></string-name>, <string-name><surname>Romberg</surname> <given-names>JK</given-names></string-name>, <string-name><surname>Tao</surname> <given-names>T</given-names></string-name> (<year>2006</year>). <article-title>Stable signal recovery from incomplete and inaccurate measurements</article-title>. <source><italic>Communications on Pure and Applied Mathematics: A Journal Issued by the Courant Institute of Mathematical Sciences</italic></source>, <volume>59</volume>(<issue>8</issue>): <fpage>1207</fpage>–<lpage>1223</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/cpa.20124" xlink:type="simple">https://doi.org/10.1002/cpa.20124</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Cannings</surname> <given-names>TI</given-names></string-name>, <string-name><surname>Samworth</surname> <given-names>RJ</given-names></string-name> (<year>2017</year>). <article-title>Random-projection ensemble classification</article-title>. <source><italic>Journal of the Royal Statistical Society Series B</italic></source>, <volume>79</volume>(<issue>4</issue>): <fpage>959</fpage>–<lpage>1035</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssb.12228" xlink:type="simple">https://doi.org/10.1111/rssb.12228</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Cao</surname> <given-names>J</given-names></string-name>, <string-name><surname>Durante</surname> <given-names>D</given-names></string-name>, <string-name><surname>Genton</surname> <given-names>MG</given-names></string-name> (<year>2022</year>). <article-title>Scalable computation of predictive probabilities in probit models with Gaussian process priors</article-title>. <source><italic>Journal of Computational and Graphical Statistics</italic></source>, <volume>31</volume>(<issue>3</issue>): <fpage>709</fpage>–<lpage>720</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/10618600.2022.2036614" xlink:type="simple">https://doi.org/10.1080/10618600.2022.2036614</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_014">
<mixed-citation publication-type="chapter"> <string-name><surname>Carvalho</surname> <given-names>CM</given-names></string-name>, <string-name><surname>Polson</surname> <given-names>NG</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>JG</given-names></string-name> (<year>2009</year>). <chapter-title>Handling sparsity via the horseshoe</chapter-title>. In: <source><italic>Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics</italic></source> (<string-name><given-names>D</given-names> <surname>van Dyk</surname></string-name>, <string-name><given-names>M</given-names> <surname>Welling</surname></string-name>, eds.), volume <volume>5</volume> of <series><italic>Proceedings of Machine Learning Research</italic></series>, <fpage>73</fpage>–<lpage>80</lpage>. <publisher-name>PMLR, Hilton</publisher-name>, <publisher-loc>Clearwater Beach Resort, Clearwater Beach, Florida USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Carvalho</surname> <given-names>CM</given-names></string-name>, <string-name><surname>Polson</surname> <given-names>NG</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>JG</given-names></string-name> (<year>2010</year>). <article-title>The horseshoe estimator for sparse signals</article-title>. <source><italic>Biometrika</italic></source>, <volume>97</volume>(<issue>2</issue>): <fpage>465</fpage>–<lpage>480</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/asq017" xlink:type="simple">https://doi.org/10.1093/biomet/asq017</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_016">
<mixed-citation publication-type="chapter"> <string-name><surname>Chipman</surname> <given-names>H</given-names></string-name>, <string-name><surname>George</surname> <given-names>E</given-names></string-name>, <string-name><surname>Mcculloch</surname> <given-names>R</given-names></string-name> (<year>2006</year>). <chapter-title>Bayesian ensemble learning</chapter-title>. In: <source><italic>Advances in Neural Information Processing Systems</italic></source> (<string-name><given-names>B</given-names> <surname>Schölkopf</surname></string-name>, <string-name><given-names>J</given-names> <surname>Platt</surname></string-name>, <string-name><given-names>T</given-names> <surname>Hoffman</surname></string-name>, eds.), volume <volume>19</volume>, <fpage>1</fpage>–<lpage>8</lpage>. <publisher-name>MIT Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Chipman</surname> <given-names>HA</given-names></string-name>, <string-name><surname>George</surname> <given-names>EI</given-names></string-name>, <string-name><surname>McCulloch</surname> <given-names>RE</given-names></string-name> (<year>1998</year>). <article-title>Bayesian cart model search</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>93</volume>(<issue>443</issue>): <fpage>935</fpage>–<lpage>948</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1998.10473750" xlink:type="simple">https://doi.org/10.1080/01621459.1998.10473750</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_018">
<mixed-citation publication-type="chapter"> <string-name><surname>Clyde</surname> <given-names>M</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>H</given-names></string-name> (<year>2001</year>). <chapter-title>Bagging and the bayesian bootstrap</chapter-title>. In: <source><italic>Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics</italic></source> (<string-name><given-names>TS</given-names> <surname>Richardson</surname></string-name>, <string-name><given-names>TS</given-names> <surname>Jaakkola</surname></string-name>, eds.), volume <volume>R3</volume> of <series><italic>Proceedings of Machine Learning Research</italic></series>, <fpage>57</fpage>–<lpage>62</lpage>. <comment>PMLR. Reissued by PMLR on 31 March 2021</comment>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_019">
<mixed-citation publication-type="other"> <string-name><surname>Corrêa</surname> <given-names>RF</given-names></string-name>, <string-name><surname>Ludermir</surname> <given-names>TB</given-names></string-name> (2007). Dimensionality reduction of very large document collections by semantic mapping. In: Proceedings of the 6th International Workshop on Self-Organizing Maps. volume 6. 1–6.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_020">
<mixed-citation publication-type="book"> <string-name><surname>Cox</surname> <given-names>T</given-names></string-name>, <string-name><surname>Cox</surname> <given-names>M</given-names></string-name> (<year>2001</year>). <source><italic>Multidimensional Scaling</italic></source>. <publisher-name>Chapman and Hall/CRC</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_021">
<mixed-citation publication-type="other"> <string-name><surname>Dasgupta</surname> <given-names>S</given-names></string-name> (2013). Experiments with random projection. arXiv preprint: <uri>https://arxiv.org/abs/1301.3849</uri>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Donoho</surname> <given-names>D</given-names></string-name> (<year>2006</year>). <article-title>Compressed sensing</article-title>. <source><italic>IEEE Transactions on Information Theory</italic></source>, <volume>52</volume>(<issue>4</issue>): <fpage>1289</fpage>–<lpage>1306</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1109/TIT.2006.871582" xlink:type="simple">https://doi.org/10.1109/TIT.2006.871582</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_023">
<mixed-citation publication-type="other"> <string-name><surname>DuMouchel</surname> <given-names>W</given-names></string-name> (2002). Data Squashing: Constructing Summary Data Sets. 579–591. Springer US, Boston, MA.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Faes</surname> <given-names>C</given-names></string-name>, <string-name><surname>Ormerod</surname> <given-names>JT</given-names></string-name>, <string-name><surname>Wand</surname> <given-names>MP</given-names></string-name> (<year>2011</year>). <article-title>Variational bayesian inference for parametric and nonparametric regression with missing data</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>106</volume>(<issue>495</issue>): <fpage>959</fpage>–<lpage>971</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/jasa.2011.tm10301" xlink:type="simple">https://doi.org/10.1198/jasa.2011.tm10301</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>George</surname> <given-names>EI</given-names></string-name>, <string-name><surname>McCulloch</surname> <given-names>RE</given-names></string-name> (<year>1993</year>). <article-title>Variable selection via Gibbs sampling</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>88</volume>(<issue>423</issue>): <fpage>881</fpage>–<lpage>889</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1993.10476353" xlink:type="simple">https://doi.org/10.1080/01621459.1993.10476353</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>Girolami</surname> <given-names>M</given-names></string-name>, <string-name><surname>Rogers</surname> <given-names>S</given-names></string-name> (<year>2006</year>). <article-title>Variational Bayesian multinomial probit regression with gaussian process priors</article-title>. <source><italic>Neural Computation</italic></source>, <volume>18</volume>(<issue>8</issue>): <fpage>1790</fpage>–<lpage>1817</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1162/neco.2006.18.8.1790" xlink:type="simple">https://doi.org/10.1162/neco.2006.18.8.1790</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Golub</surname> <given-names>TR</given-names></string-name>, <string-name><surname>Slonim</surname> <given-names>DK</given-names></string-name>, <string-name><surname>Tamayo</surname> <given-names>P</given-names></string-name>, <string-name><surname>Huard</surname> <given-names>C</given-names></string-name>, <string-name><surname>Gaasenbeek</surname> <given-names>M</given-names></string-name>, <string-name><surname>Mesirov</surname> <given-names>JP</given-names></string-name>, <etal>et al.</etal> (<year>1999</year>). <article-title>Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring</article-title>. <source><italic>Science</italic></source>, <volume>286</volume>: <fpage>531</fpage>–<lpage>537</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1126/science.286.5439.531" xlink:type="simple">https://doi.org/10.1126/science.286.5439.531</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_028">
<mixed-citation publication-type="journal"> <string-name><surname>Gordon</surname> <given-names>M</given-names></string-name>, <string-name><surname>Beiser</surname> <given-names>J</given-names></string-name>, <string-name><surname>Brandt</surname> <given-names>J</given-names></string-name>, <etal>et al.</etal> (<year>2002</year>). <article-title>The ocular hypertension treatment study: Baseline factors that predict the onset of primary open-angle glaucoma</article-title>. <source><italic>Archives of Ophthalmology</italic></source>, <volume>120</volume>: <fpage>714</fpage>–<lpage>34</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1001/archopht.120.6.714" xlink:type="simple">https://doi.org/10.1001/archopht.120.6.714</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Guhaniyogi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Dunson</surname> <given-names>DB</given-names></string-name> (<year>2015</year>). <article-title>Bayesian compressed regression</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>110</volume>(<issue>512</issue>): <fpage>1500</fpage>–<lpage>1514</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2014.969425" xlink:type="simple">https://doi.org/10.1080/01621459.2014.969425</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Hans</surname> <given-names>C</given-names></string-name> (<year>2009</year>). <article-title>Bayesian lasso regression</article-title>. <source><italic>Biometrika</italic></source>, <volume>96</volume>(<issue>4</issue>): <fpage>835</fpage>–<lpage>845</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/asp047" xlink:type="simple">https://doi.org/10.1093/biomet/asp047</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_031">
<mixed-citation publication-type="journal"> <string-name><surname>Held</surname> <given-names>L</given-names></string-name>, <string-name><surname>Holmes</surname> <given-names>CC</given-names></string-name> (<year>2006</year>). <article-title>Bayesian auxiliary variable models for binary and multinomial regression</article-title>. <source><italic>Bayesian Analysis</italic></source>, <volume>1</volume>(<issue>1</issue>): <fpage>145</fpage>–<lpage>168</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/06-BA105" xlink:type="simple">https://doi.org/10.1214/06-BA105</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_032">
<mixed-citation publication-type="chapter"> <string-name><surname>Hinton</surname> <given-names>GE</given-names></string-name>, <string-name><surname>Roweis</surname> <given-names>S</given-names></string-name> (<year>2002</year>). <chapter-title>Stochastic neighbor embedding</chapter-title>. In: <source><italic>Advances in Neural Information Processing Systems</italic></source> (<string-name><given-names>S</given-names> <surname>Becker</surname></string-name>, <string-name><given-names>S</given-names> <surname>Thrun</surname></string-name>, <string-name><given-names>K</given-names> <surname>Obermayer</surname></string-name>, eds.), volume <volume>15</volume>. <publisher-name>MIT Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_033">
<mixed-citation publication-type="journal"> <string-name><surname>Hoeting</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Madigan</surname> <given-names>D</given-names></string-name>, <string-name><surname>Raftery</surname> <given-names>AE</given-names></string-name>, <string-name><surname>Volinsky</surname> <given-names>CT</given-names></string-name> (<year>1999</year>). <article-title>Bayesian model averaging: A tutorial</article-title>. <source><italic>Statistical Science</italic></source>, <volume>14</volume>(<issue>4</issue>): <fpage>382</fpage>–<lpage>401</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/ss/1009212519" xlink:type="simple">https://doi.org/10.1214/ss/1009212519</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Hotelling</surname> <given-names>H</given-names></string-name> (<year>1933</year>). <article-title>Analysis of a complex of statistical variables into principal components</article-title>. <source><italic>Journal of Educational Psychology</italic></source>, <volume>6</volume>: <fpage>417</fpage>–<lpage>441</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1037/h0071325" xlink:type="simple">https://doi.org/10.1037/h0071325</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_035">
<mixed-citation publication-type="journal"> <string-name><surname>Johnson</surname> <given-names>WB</given-names></string-name>, <string-name><surname>Lindenstraus</surname> <given-names>J</given-names></string-name> (<year>1984</year>). <article-title>Extensions of lipschitz mappings into hilbert space</article-title>. <source><italic>Contemporary Mathematics</italic></source>, <volume>26</volume>: <fpage>189</fpage>–<lpage>206</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1090/conm/026/737400" xlink:type="simple">https://doi.org/10.1090/conm/026/737400</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_036">
<mixed-citation publication-type="journal"> <string-name><surname>Jolliffe</surname> <given-names>I</given-names></string-name>, <string-name><surname>Cadima</surname> <given-names>J</given-names></string-name> (<year>2016</year>). <article-title>Principal component analysis: A review and recent developments</article-title>. <source><italic>Philosophical Transactions of the Royal Society A</italic></source>, <volume>374</volume>: <fpage>1</fpage>–<lpage>16</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_037">
<mixed-citation publication-type="chapter"> <string-name><surname>Kim</surname> <given-names>HC</given-names></string-name>, <string-name><surname>Ghahramani</surname> <given-names>Z</given-names></string-name> (<year>2012</year>). <chapter-title>Bayesian classifier combination</chapter-title>. In: <source><italic>Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics</italic></source> (<string-name><given-names>ND</given-names> <surname>Lawrence</surname></string-name>, <string-name><given-names>M</given-names> <surname>Girolami</surname></string-name>, eds.), volume <volume>22</volume> of <series><italic>Proceedings of Machine Learning Research</italic></series>, <fpage>619</fpage>–<lpage>627</lpage>. <publisher-name>PMLR, La</publisher-name>, <publisher-loc>Palma, Canary Islands</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_038">
<mixed-citation publication-type="journal"> <string-name><surname>Lee</surname> <given-names>HKH</given-names></string-name>, <string-name><surname>Taddy</surname> <given-names>M</given-names></string-name>, <string-name><surname>Gray</surname> <given-names>GA</given-names></string-name> (<year>2010</year>). <article-title>Selection of a representative sample</article-title>. <source><italic>Journal of Classification</italic></source>, <volume>27</volume>: <fpage>41</fpage>–<lpage>53</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s00357-010-9044-x" xlink:type="simple">https://doi.org/10.1007/s00357-010-9044-x</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_039">
<mixed-citation publication-type="other"> <string-name><surname>Li</surname> <given-names>G</given-names></string-name>, <string-name><surname>Japkowicz</surname> <given-names>N</given-names></string-name>, <string-name><surname>Stocki</surname> <given-names>TJ</given-names></string-name>, <string-name><surname>Ungar</surname> <given-names>RK</given-names></string-name> (2010). Cascading Customized Naïve Bayes Couple. 147–160. Springer, Berlin Heidelberg, Berlin, Heidelberg.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_040">
<mixed-citation publication-type="other"> <string-name><surname>Li</surname> <given-names>P</given-names></string-name>, <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Church</surname> <given-names>K</given-names></string-name> (2006a). Improving random projections using marginal information. In: Conference on Learning Theory. 635–649. 2006.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_041">
<mixed-citation publication-type="other"> <string-name><surname>Li</surname> <given-names>P</given-names></string-name>, <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name>, <string-name><surname>Church</surname> <given-names>K</given-names></string-name> (2006b). Very sparse random projections. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. 287–296. 2006.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_042">
<mixed-citation publication-type="journal"> <string-name><surname>Loaiza-Maya</surname> <given-names>R</given-names></string-name>, <string-name><surname>Nibbering</surname> <given-names>D</given-names></string-name> (<year>2022</year>). <article-title>Fast variational bayes methods for multinomial probit models</article-title>. <source><italic>Journal of Business &amp; Economic Statistics</italic></source>, <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/07350015.2022.2139267" xlink:type="simple">https://doi.org/10.1080/07350015.2022.2139267</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_043">
<mixed-citation publication-type="other"> <string-name><surname>Lorbert</surname> <given-names>A</given-names></string-name>, <string-name><surname>Blei</surname> <given-names>DM</given-names></string-name>, <string-name><surname>Schapire</surname> <given-names>RE</given-names></string-name>, <string-name><surname>Ramadge</surname> <given-names>PJ</given-names></string-name> (2012). A bayesian boosting model. arXiv preprint: <uri>https://arxiv.org/abs/1209.1996</uri>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_044">
<mixed-citation publication-type="journal"> <string-name><surname>Madigan</surname></string-name> (<year>2004</year>). <article-title>Likelihood-based data squashing: A modeling approach to instance construction</article-title>. <source><italic>Data Mining and Knowledge Discovery</italic></source>, <volume>6</volume>: <fpage>173</fpage>–<lpage>190</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1023/A:1014095614948" xlink:type="simple">https://doi.org/10.1023/A:1014095614948</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_045">
<mixed-citation publication-type="chapter"> <string-name><surname>Mika</surname> <given-names>S</given-names></string-name>, <string-name><surname>Schölkopf</surname> <given-names>B</given-names></string-name>, <string-name><surname>Smola</surname> <given-names>A</given-names></string-name>, <string-name><surname>Müller</surname> <given-names>KR</given-names></string-name>, <string-name><surname>Scholz</surname> <given-names>M</given-names></string-name>, <string-name><surname>Rätsch</surname> <given-names>G</given-names></string-name> (<year>1998</year>). <chapter-title>Kernel pca and de-noising in feature spaces</chapter-title>. In: <source><italic>Advances in Neural Information Processing Systems</italic></source> (<string-name><given-names>M</given-names> <surname>Kearns</surname></string-name>, <string-name><given-names>S</given-names> <surname>Solla</surname></string-name>, <string-name><given-names>D</given-names> <surname>Cohn</surname></string-name>, eds.), volume <volume>1</volume>, <fpage>8</fpage>. <publisher-name>MIT Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_046">
<mixed-citation publication-type="journal"> <string-name><surname>Mitchell</surname> <given-names>TJ</given-names></string-name>, <string-name><surname>Beauchamp</surname> <given-names>JJ</given-names></string-name> (<year>1988</year>). <article-title>Bayesian variable selection in linear regression</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>83</volume>(<issue>404</issue>): <fpage>1023</fpage>–<lpage>1032</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1988.10478694" xlink:type="simple">https://doi.org/10.1080/01621459.1988.10478694</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_047">
<mixed-citation publication-type="other"> <string-name><surname>Mukherjee</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sen</surname> <given-names>S</given-names></string-name> (2021). Variational inference in high-dimensional linear regression. arXiv preprint: <uri>https://arxiv.org/abs/2104.12232</uri>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_048">
<mixed-citation publication-type="journal"> <string-name><surname>Owen</surname> <given-names>A</given-names></string-name> (<year>2003</year>). <article-title>Data squashing empirical likelihood</article-title>. <source><italic>Data Mining and Knowledge Discovery</italic></source>, <volume>7</volume>: <fpage>101</fpage>–<lpage>113</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1023/A:1021568920107" xlink:type="simple">https://doi.org/10.1023/A:1021568920107</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_049">
<mixed-citation publication-type="journal"> <string-name><surname>Park</surname> <given-names>T</given-names></string-name>, <string-name><surname>Casella</surname> <given-names>G</given-names></string-name> (<year>2008</year>). <article-title>The bayesian lasso</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>103</volume>(<issue>482</issue>): <fpage>681</fpage>–<lpage>686</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/016214508000000337" xlink:type="simple">https://doi.org/10.1198/016214508000000337</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_050">
<mixed-citation publication-type="journal"> <string-name><surname>Piironen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Vehtari</surname> <given-names>A</given-names></string-name> (<year>2017</year>). <article-title>Sparsity information and regularization in the horseshoe and other shrinkage priors</article-title>. <source><italic>Electronic Journal of Statistics</italic></source>, <volume>11</volume>(<issue>2</issue>): <fpage>5018</fpage>–<lpage>5051</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/17-EJS1337SI" xlink:type="simple">https://doi.org/10.1214/17-EJS1337SI</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_051">
<mixed-citation publication-type="book"> <string-name><surname>Polson</surname> <given-names>NG</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>JG</given-names></string-name> (<year>2011</year>). <source><italic>Shrink Globally, Act Locally: Sparse Bayesian Regularization and Prediction</italic></source>. <publisher-name>Oxford University Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_052">
<mixed-citation publication-type="journal"> <string-name><surname>Polson</surname> <given-names>NG</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>JG</given-names></string-name>, <string-name><surname>Windle</surname> <given-names>J</given-names></string-name> (<year>2013</year>). <article-title>Bayesian inference for logistic models using pólya–gamma latent variables</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>108</volume>(<issue>504</issue>): <fpage>1339</fpage>–<lpage>1349</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2013.829001" xlink:type="simple">https://doi.org/10.1080/01621459.2013.829001</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_053">
<mixed-citation publication-type="journal"> <string-name><surname>Roweis</surname> <given-names>ST</given-names></string-name>, <string-name><surname>Saul</surname> <given-names>LK</given-names></string-name> (<year>2000</year>). <article-title>Nonlinear dimensionality reduction by locally linear embedding</article-title>. <source><italic>Science</italic></source>, <volume>290</volume>: <fpage>2323</fpage>–<lpage>2326</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1126/science.290.5500.2323" xlink:type="simple">https://doi.org/10.1126/science.290.5500.2323</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_054">
<mixed-citation publication-type="journal"> <string-name><surname>Shin</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bhattacharya</surname> <given-names>A</given-names></string-name>, <string-name><surname>Johnson</surname> <given-names>VE</given-names></string-name> (<year>2015</year>). <article-title>Scalable bayesian variable selection using nonlocal prior densities in ultrahigh-dimensional settings</article-title>. <source><italic>Statistica Sinica</italic></source>, <volume>28</volume>: <fpage>1053</fpage>–<lpage>1078</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_055">
<mixed-citation publication-type="journal"> <string-name><surname>Singh</surname> <given-names>D</given-names></string-name>, <string-name><surname>Febbo</surname> <given-names>P</given-names></string-name>, <string-name><surname>Ross</surname> <given-names>K</given-names></string-name>, <etal>et al.</etal> (<year>2002</year>). <article-title>Gene expression correlates of clinical prostate cancer behavior</article-title>. <source><italic>Genome Biology</italic></source>, <volume>1</volume>: <fpage>203</fpage>–<lpage>212</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_056">
<mixed-citation publication-type="chapter"> <string-name><surname>Sra</surname> <given-names>S</given-names></string-name>, <string-name><surname>Dhillon</surname> <given-names>I</given-names></string-name> (<year>2005</year>). <chapter-title>Generalized nonnegative matrix approximations with bregman divergences</chapter-title>. In: <source><italic>Advances in Neural Information Processing Systems</italic></source> (<string-name><given-names>Y</given-names> <surname>Weiss</surname></string-name>, <string-name><given-names>B</given-names> <surname>Schölkopf</surname></string-name>, <string-name><given-names>J</given-names> <surname>Platt</surname></string-name>, eds.), volume <volume>18</volume>. <publisher-name>MIT Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_057">
<mixed-citation publication-type="journal"> <string-name><surname>Tanner</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Wong</surname> <given-names>WH</given-names></string-name> (<year>1987</year>). <article-title>The calculation of posterior distributions by data augmentation</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>82</volume>(<issue>398</issue>): <fpage>528</fpage>–<lpage>540</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.1987.10478458" xlink:type="simple">https://doi.org/10.1080/01621459.1987.10478458</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_058">
<mixed-citation publication-type="journal"> <string-name><surname>Tibshirani</surname> <given-names>R</given-names></string-name> (<year>1996</year>). <article-title>Regression shrinkage and selection via the lasso</article-title>. <source><italic>Journal of the Royal Statistical Society, Series B, Methodological</italic></source>, <volume>58</volume>(<issue>1</issue>): <fpage>267</fpage>–<lpage>288</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.2517-6161.1996.tb02080.x" xlink:type="simple">https://doi.org/10.1111/j.2517-6161.1996.tb02080.x</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_059">
<mixed-citation publication-type="chapter"> <string-name><surname>Titsias</surname> <given-names>M</given-names></string-name>, <string-name><surname>Lawrence</surname> <given-names>ND</given-names></string-name> (<year>2010</year>). <chapter-title>Bayesian gaussian process latent variable model</chapter-title>. In: <source><italic>Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics</italic></source> (<string-name><given-names>YW</given-names> <surname>Teh</surname></string-name>, <string-name><given-names>M</given-names> <surname>Titterington</surname></string-name>, eds.), volume <volume>9</volume> of <series><italic>Proceedings of Machine Learning Research</italic></series>, <fpage>844</fpage>–<lpage>851</lpage>. <publisher-name>PMLR, Chia</publisher-name>, <publisher-loc>Laguna Resort, Sardinia, Italy</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_060">
<mixed-citation publication-type="journal"> <string-name><surname>van der Maaten</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hinton</surname> <given-names>G</given-names></string-name> (<year>2008</year>). <article-title>Visualizing data using t-sne</article-title>. <source><italic>Journal of Machine Learning Research</italic></source>, <volume>9</volume>: <fpage>1</fpage>–<lpage>27</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1102_ref_061">
<mixed-citation publication-type="journal"> <string-name><surname>Xie</surname> <given-names>H</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>J</given-names></string-name> (<year>2009</year>). <article-title>SCAD-penalized regression in high-dimensional partially linear models</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>37</volume>(<issue>2</issue>): <fpage>673</fpage>–<lpage>696</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/07-AOS580" xlink:type="simple">https://doi.org/10.1214/07-AOS580</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_062">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>CH</given-names></string-name> (<year>2010</year>). <article-title>Nearly unbiased variable selection under minimax concave penalty</article-title>. <source><italic>The Annals of Statistics</italic></source>, <volume>38</volume>(<issue>2</issue>): <fpage>894</fpage>–<lpage>942</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/09-AOS729" xlink:type="simple">https://doi.org/10.1214/09-AOS729</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_063">
<mixed-citation publication-type="journal"> <string-name><surname>Zou</surname> <given-names>H</given-names></string-name> (<year>2006</year>). <article-title>The adaptive lasso and its oracle properties</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>101</volume>(<issue>476</issue>): <fpage>1418</fpage>–<lpage>1429</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/016214506000000735" xlink:type="simple">https://doi.org/10.1198/016214506000000735</ext-link></mixed-citation>
</ref>
<ref id="j_jds1102_ref_064">
<mixed-citation publication-type="journal"> <string-name><surname>Zou</surname> <given-names>H</given-names></string-name>, <string-name><surname>Hastie</surname> <given-names>T</given-names></string-name> (<year>2005</year>). <article-title>Regularization and variable selection via the elastic net</article-title>. <source><italic>Journal of the Royal Statistical Society, Series B, Statistical Methodology</italic></source>, <volume>67</volume>(<issue>2</issue>): <fpage>301</fpage>–<lpage>320</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.1467-9868.2005.00503.x" xlink:type="simple">https://doi.org/10.1111/j.1467-9868.2005.00503.x</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
