Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 21, Issue 1 (2023)
  4. Subpopulation Treatment Effect Pattern P ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

Subpopulation Treatment Effect Pattern Plot (STEPP) Methods with R and Stata
Volume 21, Issue 1 (2023), pp. 106–126
Sergio Venturini   Marco Bonetti   Ann A. Lazar     All authors (7)

Authors

 
Placeholder
https://doi.org/10.6339/22-JDS1060
Pub. online: 9 August 2022      Type: Computing In Data Science      Open accessOpen Access

Received
5 May 2022
Accepted
3 July 2022
Published
9 August 2022

Abstract

We introduce the stepp packages for R and Stata that implement the subpopulation treatment effect pattern plot (STEPP) method. STEPP is a nonparametric graphical tool aimed at examining possible heterogeneous treatment effects in subpopulations defined on a continuous covariate or composite score. More pecifically, STEPP considers overlapping subpopulations defined with respect to a continuous covariate (or risk index) and it estimates a treatment effect for each subpopulation. It also produces confidence regions and tests for treatment effect heterogeneity among the subpopulations. The original method has been extended in different directions such as different survival contexts, outcome types, or more efficient procedures for identifying the overlapping subpopulations. In this paper, we also introduce a novel method to determine the number of subjects within the subpopulations by minimizing the variability of the sizes of the subpopulations generated by a specific parameter combination. We illustrate the packages using both synthetic data and publicly available data sets. The most intensive computations in R are implemented in Fortran, while the Stata version exploits the powerful Mata language.

Supplementary material

 Supplementary Material
The R and Stata scripts containing the code related to the examples discussed in the paper and a second application that involves a binary outcome are available in the supplementary material on the journal website.

References

 
Bonetti M, Gelber RD (2000). A graphical method to assess treatment–covariate interactions using the Cox model on subsets of the data. Statistics in Medicine, 19(19): 2595–2609.
 
Bonetti M, Gelber RD (2004). Patterns of treatment effects in subsets of patients in clinical trials. Biostatistics, 5(3): 465–481.
 
Bonetti M, Zahrieh D, Cole BF, Gelber RD (2009). A small sample study of the STEPP approach to assessing treatment–covariate interactions in survival data. Statistics in Medicine, 28(8): 1255–1268.
 
Thürlimann B, Keshaviah A, Coates AS, Mouridsen H, Mauriac L, et al. (Breast International Group (BIG) 1-98 Collaborative Group) (2005). A comparison of letrozole and tamoxifen in postmenopausal women with early breast cancer. The New England Journal of Medicine, 353(26): 2747–2757.
 
Clahsen P, van de Velde C, Duval C, Pallud C, Mandard AM, Delobelle-Deroide A, et al. (1999). The utility of mitotic index, oestrogen receptor and Ki-67 measurements in the creation of novel prognostic indices fornode-negative breast cancer. European Journal of Surgical Oncology, 25(4): 356–363.
 
Coates AS, Keshaviah A, Thürlimann B, Mouridsen H, Mauriac L, Forbes JF, et al. (2007). Five years of letrozole compared with tamoxifen as initial adjuvant therapy for postmenopausal women with endocrine-responsive early breast cancer: Update of study BIG 1-98. Journal of Clinical Oncology, 25(5): 486–492.
 
Collett D (2015). Modelling Survival Data in Medical Research. CRC Press.
 
Cox DR (1972). Regression models and life-tables. Journal of the Royal Statistical Society B, 34(2): 187–202.
 
Fine JP, Gray RJ (1999). A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association, 94(446): 496–509.
 
Foster JC, Taylor JMG, Rubert SJ (2011). Subgroup identification from randomized clinical trial data. Statistics in Medicine, 30: 2867–2880.
 
Gelber RD, Wang XV, Cole BF, Cameron D, Cardoso F, Tjan-Heijnen V, et al. (2022). Six-year absolute invasive disease-free survival benefit of adding adjuvant pertuzumab to trastuzumab and chemotherapy for patients with early her2-positive breast cancer: a subpopulation treatment effect pattern plot (stepp) analysis of the aphinity (big 4-11) trial. European Journal of Cancer, 166: 219–228.
 
Gerdes J, Schwab U, Lemke H, Stein H (1983). Production of a mouse monoclonal antibody reactive with a human nuclear antigen associated with cell proliferation. International Journal of Cancer, 31(1): 13–20.
 
Gray RJ (1988). A class of K-sample tests for comparing the cumulative incidence of a competing risk. The Annals of Statistics, 16(3): 1141–1154.
 
Hayes AF (2017). Introduction to mediation, moderation, and conditional process analysis. Guilford Press.
 
Hyndman RJ, Bashtannyk DM, Grunwald GK (1996). Estimating and cvisualizing conditional densities. Journal of Computational and Graphical Statistics, 5: 315–336.
 
Hyndman RJ, Einbeck J, Wand MP (2022). hdrcde: Highest density regions and conditional density estimation. R package version 3.4.
 
Imai K (2017). Quantitative social science. An introduction. Princeton University Press.
 
Lagakos S (2006). The challenge of subgroup analysis – Reporting without distorting. The New England Journal of Medicine, 354: 1667–1669.
 
Lazar A, Bonetti M, Cole BF, Yip WK, Gelber RD (2016). Identifying treatment effect heterogeneity in clinical trials using subpopulations of events: STEPP. Clinical Trials, 13(2): 169–179.
 
Lazar A, Cole BF, Bonetti M, Gelber RD (2010). Evaluation of treatment-effect heterogeneity using biomarkers measured on a continuous scale: Subpopulation treatment effect pattern plot. Journal of Clinical Oncology, 28(29): 4539–4544. PMID: 20837942.
 
Li J, Zhao L, Tian L, Cai T, Claggett B, Callegaro A, et al. (2015). A predictive enrichment procedure to identify potential responders to a new therapy for randomized, comparative, controlled clinical studies. Biometrics, 72: 877–887.
 
Pesarin F (2001). Multivariate permutation tests. John Wiley & Sons.
 
Peto R, Pike MC, Armitage P, Breslow NE, Cox DR, Howard SV, et al. (1977). Design and analysis of randomized clinical trials requiring prolonged observation of each patient. II. Analysis and examples. British Journal of Cancer, 35(1): 1–39.
 
Pocock S (2008). More on subgroup analysis in clinical trials. The New England Journal of Medicine, 358: 2076–2077.
 
Regan MM, Francis PA, Pagani O, Fleming GF, Walley BA, Viale G, et al. (2016). Absolute benefit of adjuvant endocrine therapies for premenopausal women with hormone receptor–positive, human epidermal growth factor receptor 2–negative early breast cancer: text and soft trials. Journal of Clinical Oncology, 34(19): 2221–2231. PMID: 27044936.
 
Royston P, Altman DG (1994). Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. Applied Statistics, 43: 429–467.
 
Royston P, Ambler G (1998). sg81: Multivariable fractional polynomials. Stata Technical Bulletin, 43: 24–32.
 
Royston P, Sauerbrei W (2004). A new approach to modelling interactions between treatment and continuous covariates in clinical trials by using fractional polynomials. Statistics in Medicine, 23: 2509–2525.
 
Royston P, Sauerbrei W (2007). Multivariable modeling with cubic regression splines: A principled approach. Stata Journal, 7(1): 45–70.
 
Royston P, Sauerbrei W (2008). Multivariable model-building. John Wiley & Sons.
 
Royston P, Sauerbrei W (2009). Two techniques for investigating interactions between treatment and continuous covariates in clinical trials. Stata Journal, 9(2): 230–251.
 
Simon R (2002). Bayesian subset analysis: Application to studying treatment-by-gender interactions. Statistics in Medicine, 21 2909–2916.
 
Simon R, Dixon DO, Freidlin BA (1995). A Bayesian model for evaluating specificity of treatment effects in clinical trials. Kluwer Academic Publications.
 
VanderWeele TJ (2015). Explanation in causal inference. Oxford University Press.
 
Viale G, Giobbie-Hurder A, Regan MM, Coates AS, Mastropasqua MG, Dell’Orto P, et al. (2008). Prognostic and predictive value of centrally reviewed ki-67 labeling index in postmenopausal women with endocrine-responsive breast cancer: Results from breast international group trial 1-98 comparing adjuvant tamoxifen with letrozole. Journal of Clinical Oncology, 26(34): 5569–5575.
 
Viale G, Regan M, Dell’Orto P, Mastropasqua M, Maiorano E, Rasmussen B, et al. (2011). Which patients benefit most from adjuvant aromatase inhibitors? Results using a composite measure of prognostic risk in the big 1-98 randomized trial. Annals of Oncology, 22(10): 2201–2207.
 
Wang R, Lagakos SW, Ware JH, Hunter DJ, Drazen JM (2007). Statistics in medicine—Reporting of subgroup analyses in clinical trials. The New England Journal of Medicine, 357(21): 2189–2194.
 
Yip WK, Bonetti M, Cole BF, Barcella W, Wang XV, Lazar A, et al. (2016). Subpopulation treatment effect pattern plot (STEPP) analysis for continuous, binary, and count outcomes. Clinical Trials, 13(4): 382–390.
 
Zhao L, Tian L, Cai T, Claggett B, Wei LJ (2013). Effectively selecting a target population for a future comparative study. Journal of the American Statistical Association, 108(502):527–539.

Related articles PDF XML
Related articles PDF XML

Copyright
2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
clinical trial interaction subgroup analysis subpopulation treatment-covariate interaction

Metrics
since February 2021
1213

Article info
views

895

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy