Supplementary Material

JDS

Journal of Data Science

1683-86021680-743X

1680-743X

School of Statistics, Renmin University of China

JDS1138

10.6339/24-JDS1138

Computing in Data Science

Unified Robust Boosting

https://orcid.org/0000-0002-0773-0052

Wang

Zhu

zwang145@uthsc.edu1∗ 1Memphis, TN, Department of Preventive Medicine, The University of Tennessee Health Science Center, United States

∗Email: zwang145@uthsc.edu.

2025

2862024

23190108

Supplementary Material

The R code necessary to reproduce the analysis presented in the manuscript is provided.

18320242242024

2025 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.

2025

Open access article under the CC BY license.

Boosting is a popular algorithm in supervised machine learning with wide applications in regression and classification problems. It combines weak learners, such as regression trees, to obtain accurate predictions. However, in the presence of outliers, traditional boosting may yield inferior results since the algorithm optimizes a convex loss function. Recent literature has proposed boosting algorithms that optimize robust nonconvex loss functions. Nevertheless, there is a lack of weighted estimation to indicate the outlier status of observations. This article introduces the iteratively reweighted boosting (IRBoost) algorithm, which combines robust loss optimization and weighted estimation. It can be conveniently constructed with existing software. The output includes weights as valuable diagnostics for the outlier status of observations. For practitioners interested in the boosting algorithm, the new method can be interpreted as a way to tune robust observation weights. IRBoost is implemented in the R package irboost and is demonstrated using publicly available data in generalized linear models, classification, and survival data analysis.

Keywords boosting CC-family IRBoost IRCO machine learning robust method

This work was partially supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health under Award Number R21DK130006.

References

Barnwal

, Cho

, Hocking

(2022). Survival regression with accelerated failure time model in XGBoost. Journal of Computational and Graphical Statistics, 31(4): 1292–1302. https://doi.org/10.1080/10618600.2022.2067548

Bühlmann

, Hothorn

(2007). Boosting algorithms: Regularization, prediction and model fitting (with discussion). Statistical Science, 22(4): 477–505.

Chen

, Guestrin

(2016). Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794.

Chen

, He

, Benesty

, Khotilovich

, Tang

, Cho

, et al. (2024). Xgboost: extreme gradient boosting. R package version 1.7.7.1.

Friedman

(2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5): 1189–1232. https://doi.org/10.1214/aos/1013203451

Friedman

, Hastie

, Tibshirani

(2000). Additive logistic regression: A statistical view of boosting (with discussion and a rejoinder by the authors). The Annals of Statistics, 28(2): 337–407. https://doi.org/10.1214/aos/1016218223

Heritier

, Cantoni

, Copt

, Victoria-Feser

(2009). Robust Methods in Biostatistics, volume 825. John Wiley & Sons.

Hothorn

, Bühlmann

, Kneib

, Schmid

, Hofner

, Otto-Sobotka

, et al. (2023). mboost: Model-Based Boosting. R package version 2.9-9.

, Bradic

(2018). Boosting in the presence of outliers: Adaptive classification with nonconvex loss functions. Journal of the American Statistical Association, 113(522): 660–674.

Long

, Servedio

(2010). Random classification noise defeats all convex potential boosters. Machine Learning, 78(3): 287–304. https://doi.org/10.1007/s10994-009-5165-z

Mairal

(2013). Stochastic majorization-minimization algorithms for large-scale optimization. In: NIPS 2013 - Advances in Neural Information Processing Systems, 26, Dec 2013, South Lake Tahoe, United States, 2283–2291.

Maronna

, Martin

, Yohai

, Salibián-Barrera

(2019). Robust Statistics: Theory and Methods (with R). John Wiley & Sons, Hoboken, NJ.

Park

, Liu

(2011). Robust penalized logistic regression with truncated loss functions. Canadian Journal of Statistics, 39(2): 300–323. https://doi.org/10.1002/cjs.10105

Sigrist

(2021). Gradient and Newton boosting for classification and regression. Expert Systems with Applications, 167: 114080. https://doi.org/10.1016/j.eswa.2020.114080

Wang

(2018a). Quadratic majorization for nonconvex loss with applications to the boosting algorithm. Journal of Computational and Graphical Statistics, 27(3): 491–502. https://doi.org/10.1080/10618600.2018.1424635

Wang

(2018b). Robust boosting with truncated loss functions. Electronic Journal of Statistics, 12(1): 599–650. https://doi.org/10.1214/18-EJS1434

Wang

(2024a). irboost: Iteratively Reweighted Boosting for Robust Analysis. R package version 0.1-15.

Wang

(2024b). Unified robust estimation. Australian & New Zealand Journal of Statistics, 66(1): 77–102. https://doi.org/10.1111/anzs.12409

Wang

, Hothorn

(2023). bst: Gradient Boosting. R package version 0.3-24.

, Liu

(2007). Robust truncated hinge loss support vector machines. Journal of the American Statistical Association, 102(479): 974–983. https://doi.org/10.1198/016214507000000617

Zhao

, Mammadov

, Yearwood

(2010). From convex to nonconvex: A loss function analysis for binary classification. In: 2010 IEEE International Conference on Data Mining Workshops, 1281–1288. IEEE.