Journal of Data Science logo


Login Register

  1. Home
  2. To appear
  3. Transfer Learning for Individualized Tre ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

Transfer Learning for Individualized Treatment Rules with Application to Sepsis Patients Data
Andong Wang   Kelly Wentzlof †   Johnny Rajala †     All authors (6)

Authors

 
Placeholder
https://doi.org/10.6339/25-JDS1193
Pub. online: 7 August 2025      Type: Statistical Data Science      Open accessOpen Access

† Equal Contribution.

Received
24 November 2024
Accepted
23 June 2025
Published
7 August 2025

Abstract

Modern precision medicine aims to utilize real-world data to provide the best treatment for an individual patient. An individualized treatment rule (ITR) maps each patient’s characteristics to a recommended treatment scheme that maximizes the expected outcome of the patient. A challenge precision medicine faces is population heterogeneity, as studies on treatment effects are often conducted on source populations that differ from the populations of interest in terms of the distribution of patient characteristics. Our research goal is to explore a transfer learning algorithm that aims to address the population heterogeneity problem and obtain targeted, optimal, and interpretable ITRs. The algorithm incorporates a calibrated augmented inverse probability weighting estimator for the average treatment effect and employs value function maximization for the target population using Genetic Algorithm to produce our desired ITR. To demonstrate its practical utility, we apply this transfer learning algorithm to two large medical databases, eICU Collaborative Research Database and Medical Information Mart for Intensive Care III. We first identify the important covariates, treatment options, and outcomes of interest based on the two databases, and then estimate the optimal linear ITRs for patients with sepsis. Our research introduces and applies new techniques for data fusion to obtain data-driven ITRs that cater to patients’ individual medical needs in a population of interest. By emphasizing generalizability and personalized decision-making, this methodology extends its potential application beyond medicine to fields such as marketing, technology, social sciences, and education.

Supplementary material

 Supplementary Material
Supplementary materials include pre-processed eICU-CRD and MIMIC-III data files used in the medical application, an R script containing the R functions and R Markdown files for both the simulation study and the medical application.

References

 
Azur M, Stuart E, Frangakis C, Leaf P (2011). Multiple imputation by chained equations: What is it and how does it work? International Journal of Methods in Psychiatric Research, 20(1): 40–49. PMID: 21499542; PMCID, PMC3074241. https://doi.org/10.1002/mpr.329
 
Bang H, Robins JM (2005). Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4): 962–973. https://doi.org/10.1111/j.1541-0420.2005.00377.x
 
Chu J, Lu W, Yang S (2023). Targeted optimal treatment regime learning using summary statistics. Biometrika, 110(4): 913–931. https://doi.org/10.1093/biomet/asad020
 
Colnet B, Mayer I, Chen G, Dieng A, Li R, Varoquaux G, et al. (2024). Causal inference methods for combining randomized trials and observational studies: A review. Statistical Science, 39(1): 165–191. https://doi.org/10.1214/23-STS889
 
Glynn A, Quinn KM (2010). An introduction to the augmented inverse propensity weighted estimator. Political Analysis, 18(1): 36–56. https://doi.org/10.1093/pan/mpp036
 
Hainmueller J (2012). Entropy balancing for causal effects: A multivariate reweighting method to produce balanced samples in observational studies. Political Analysis, 20(1): 25–46. https://doi.org/10.1093/pan/mpr025
 
Imbens G, Rubin D (2015). Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press. https://doi.org/10.1017/CBO9781139025751.
 
Katoch S, Chauhan S, Kumar V (2021). A review on genetic algorithm: Past, present, and future. Multimedia Tools and Applications, 80: 8091–8126. https://doi.org/10.1007/s11042-020-10139-6
 
Kosorok MR, Laber EB (2019). Precision medicine. Annual Review of Statistics and Its Application, 6: 263–286. https://doi.org/10.1146/annurev-statistics-030718-105251
 
Mebane Jr WR, Sekhon JS (2011). Genetic optimization using derivatives: The rgenoud package for R. Journal of Statistical Software, 42(11): 1–26. https://doi.org/10.18637/jss.v042.i11
 
Nagin D, Paternoster R (2000). Population heterogeneity and state dependence: State of the evidence and directions for future research. Journal of Quantitative Criminology, 16(2): 117–144. https://doi.org/10.1023/A:1007502804941
 
Rothwell PM (2005). External validity of randomised controlled trials: “to whom do the results of this trial apply?”. The Lancet, 365(9453): 82–93. https://doi.org/10.1016/S0140-6736(04)17670-8
 
Ryall B, Eydallin G, Ferenci T (2012). Culture history and population heterogeneity as determinants of bacterial adaptation: The adaptomics of a single environmental transition. Microbiology and Molecular Biology Reviews, 76(3): 597–625. https://doi.org/10.1128/mmbr.05028-11. https://doi.org/10.1128/MMBR.05028-11
 
Wu L, Yang S (2023). Transfer learning of individualized treatment rules from experimental to real-world data. Journal of Computational and Graphical Statistics, 32(3): 1036–1045. https://doi.org/10.1080/10618600.2022.2141752
 
Zampieri FG, Mazza B (2017). Mechanical ventilation in sepsis: A reappraisal. Shock, 47(1S): 41–46. PMID: 27454388. https://doi.org/10.1097/SHK.0000000000000702
 
Zhang B, Tsiatis AA, Laber EB, Davidian M (2012). A robust method for estimating optimal treatment regimes. Biometrics, 68(4): 1010–1018. https://doi.org/10.1111/j.1541-0420.2012.01763.x
 
Zhao Y, Zeng D, Rush AJ, Kosorok MR (2012). Estimating individualized treatment rules using outcome weighted learning. Journal of the American Statistical Association, 107(499): 1106–1118. https://doi.org/10.1080/01621459.2012.695674
 
Zhou X, Mayer-Hamblett N, Kosorok MR (2017). Residual weighted learning for estimating individualized treatment rules. Journal of the American Statistical Association, 112(517): 169–187. https://doi.org/10.1080/01621459.2015.1093947
 
Zubizarreta JR (2015). Stable weights that balance covariates for estimation with incomplete outcome data. Journal of the American Statistical Association, 110(511): 910–922. https://doi.org/10.1080/01621459.2015.1023805

Related articles PDF XML
Related articles PDF XML

Copyright
2025 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
augmented Inverse Probability Weighting causal inference generalizability genetic algorithm optimization population heterogeneity precision medicine

Funding
The authors gratefully acknowledge the generous support from the National Science Foundation (NSF) grant DMS2051010 and National Security Agency (NSA) grant H98230-22-1-0006. This research is also supported in part by the National Institute of Environmental Health Sciences (NIEHS) training grant T32ES007018.

Metrics
since February 2021
10

Article info
views

2

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy