Transfer Learning for Individualized Treatment Rules with Application to Sepsis Patients Data
Pub. online: 7 August 2025
Type: Statistical Data Science
Open Access
†
Equal Contribution.
Received
24 November 2024
24 November 2024
Accepted
23 June 2025
23 June 2025
Published
7 August 2025
7 August 2025
Abstract
Modern precision medicine aims to utilize real-world data to provide the best treatment for an individual patient. An individualized treatment rule (ITR) maps each patient’s characteristics to a recommended treatment scheme that maximizes the expected outcome of the patient. A challenge precision medicine faces is population heterogeneity, as studies on treatment effects are often conducted on source populations that differ from the populations of interest in terms of the distribution of patient characteristics. Our research goal is to explore a transfer learning algorithm that aims to address the population heterogeneity problem and obtain targeted, optimal, and interpretable ITRs. The algorithm incorporates a calibrated augmented inverse probability weighting estimator for the average treatment effect and employs value function maximization for the target population using Genetic Algorithm to produce our desired ITR. To demonstrate its practical utility, we apply this transfer learning algorithm to two large medical databases, eICU Collaborative Research Database and Medical Information Mart for Intensive Care III. We first identify the important covariates, treatment options, and outcomes of interest based on the two databases, and then estimate the optimal linear ITRs for patients with sepsis. Our research introduces and applies new techniques for data fusion to obtain data-driven ITRs that cater to patients’ individual medical needs in a population of interest. By emphasizing generalizability and personalized decision-making, this methodology extends its potential application beyond medicine to fields such as marketing, technology, social sciences, and education.
Supplementary material
Supplementary MaterialSupplementary materials include pre-processed eICU-CRD and MIMIC-III data files used in the medical application, an R script containing the R functions and R Markdown files for both the simulation study and the medical application.
References
Azur M, Stuart E, Frangakis C, Leaf P (2011). Multiple imputation by chained equations: What is it and how does it work? International Journal of Methods in Psychiatric Research, 20(1): 40–49. PMID: 21499542; PMCID, PMC3074241. https://doi.org/10.1002/mpr.329
Bang H, Robins JM (2005). Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4): 962–973. https://doi.org/10.1111/j.1541-0420.2005.00377.x
Chu J, Lu W, Yang S (2023). Targeted optimal treatment regime learning using summary statistics. Biometrika, 110(4): 913–931. https://doi.org/10.1093/biomet/asad020
Colnet B, Mayer I, Chen G, Dieng A, Li R, Varoquaux G, et al. (2024). Causal inference methods for combining randomized trials and observational studies: A review. Statistical Science, 39(1): 165–191. https://doi.org/10.1214/23-STS889
Glynn A, Quinn KM (2010). An introduction to the augmented inverse propensity weighted estimator. Political Analysis, 18(1): 36–56. https://doi.org/10.1093/pan/mpp036
Hainmueller J (2012). Entropy balancing for causal effects: A multivariate reweighting method to produce balanced samples in observational studies. Political Analysis, 20(1): 25–46. https://doi.org/10.1093/pan/mpr025
Imbens G, Rubin D (2015). Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press. https://doi.org/10.1017/CBO9781139025751.
Katoch S, Chauhan S, Kumar V (2021). A review on genetic algorithm: Past, present, and future. Multimedia Tools and Applications, 80: 8091–8126. https://doi.org/10.1007/s11042-020-10139-6
Kosorok MR, Laber EB (2019). Precision medicine. Annual Review of Statistics and Its Application, 6: 263–286. https://doi.org/10.1146/annurev-statistics-030718-105251
Mebane Jr WR, Sekhon JS (2011). Genetic optimization using derivatives: The rgenoud package for R. Journal of Statistical Software, 42(11): 1–26. https://doi.org/10.18637/jss.v042.i11
Nagin D, Paternoster R (2000). Population heterogeneity and state dependence: State of the evidence and directions for future research. Journal of Quantitative Criminology, 16(2): 117–144. https://doi.org/10.1023/A:1007502804941
Rothwell PM (2005). External validity of randomised controlled trials: “to whom do the results of this trial apply?”. The Lancet, 365(9453): 82–93. https://doi.org/10.1016/S0140-6736(04)17670-8
Ryall B, Eydallin G, Ferenci T (2012). Culture history and population heterogeneity as determinants of bacterial adaptation: The adaptomics of a single environmental transition. Microbiology and Molecular Biology Reviews, 76(3): 597–625. https://doi.org/10.1128/mmbr.05028-11. https://doi.org/10.1128/MMBR.05028-11
Wu L, Yang S (2023). Transfer learning of individualized treatment rules from experimental to real-world data. Journal of Computational and Graphical Statistics, 32(3): 1036–1045. https://doi.org/10.1080/10618600.2022.2141752
Zampieri FG, Mazza B (2017). Mechanical ventilation in sepsis: A reappraisal. Shock, 47(1S): 41–46. PMID: 27454388. https://doi.org/10.1097/SHK.0000000000000702
Zhang B, Tsiatis AA, Laber EB, Davidian M (2012). A robust method for estimating optimal treatment regimes. Biometrics, 68(4): 1010–1018. https://doi.org/10.1111/j.1541-0420.2012.01763.x
Zhao Y, Zeng D, Rush AJ, Kosorok MR (2012). Estimating individualized treatment rules using outcome weighted learning. Journal of the American Statistical Association, 107(499): 1106–1118. https://doi.org/10.1080/01621459.2012.695674
Zhou X, Mayer-Hamblett N, Kosorok MR (2017). Residual weighted learning for estimating individualized treatment rules. Journal of the American Statistical Association, 112(517): 169–187. https://doi.org/10.1080/01621459.2015.1093947
Zubizarreta JR (2015). Stable weights that balance covariates for estimation with incomplete outcome data. Journal of the American Statistical Association, 110(511): 910–922. https://doi.org/10.1080/01621459.2015.1023805