Journal of Data Science logo


Login Register

  1. Home
  2. To appear
  3. HIMA: An R Package for High-Dimensional ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

HIMA: An R Package for High-Dimensional Mediation Analysis
Haixiang Zhang †   Yinan Zheng †   Lifang Hou     All authors (4)

Authors

 
Placeholder
https://doi.org/10.6339/25-JDS1192
Pub. online: 9 July 2025      Type: Computing In Data Science      Open accessOpen Access

† Co-first authors.

Received
30 August 2024
Accepted
23 June 2025
Published
9 July 2025

Abstract

Mediation analysis plays an important role in many research fields, yet it is very challenging to perform estimation and hypothesis testing for high-dimensional mediation effects. We develop a user-friendly $\mathsf{R}$ package HIMA for high-dimensional mediation analysis with varying mediator and outcome specifications. The HIMA package is a comprehensive tool that accommodates various types of high-dimensional mediation models. This paper offers an overview of the functions within HIMA and demonstrates the practical utility of HIMA through simulated datasets. The HIMA package is publicly available from the Comprehensive $\mathsf{R}$ Archive Network at https://CRAN.R-project.org/package=HIMA.

Supplementary material

 Supplementary Material
In the Supplementary Material, we provide the R code implementations corresponding to the illustrations presented in Section 4.

References

 
An M, Zhang H (2023). High-dimensional mediation analysis for time-to-event outcomes with additive hazards model. Mathematics, 11: 1–11.
 
Bai X, Zheng Y, Hou L, Zheng C, Liu L, Zhang H (2024). An efficient testing procedure for high-dimensional mediators with FDR control. Statistics in Biosciences. https://doi.org/10.1007/s12561-024-09447-4
 
Baron RM, Kenny DA (1986). The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51(6): 1173–1182. https://doi.org/10.1037/0022-3514.51.6.1173
 
Bell B, Rose CL, Damon A (1966). The veterans administration longitudinal study of healthy aging. The Gerontologist, 6: 179–184. https://doi.org/10.1093/geront/6.4.179
 
Fan J, Lv J (2008). Sure independence screening for ultrahigh dimensional feature space. Journal of the Royal Statistical Society, Series B, Statistical Methodology, 70: 849–911. https://doi.org/10.1111/j.1467-9868.2008.00674.x
 
James Y, Dai JLS, LeBlanc M (2022). A multiple-testing procedure for high-dimensional mediation hypotheses. Journal of the American Statistical Association, 117: 198–213. https://doi.org/10.1080/01621459.2020.1765785
 
Luo C, Fa B, Yan Y, Wang Y, Zhou Y, Zhang Y, et al. (2020). High-dimensional mediation analysis in survival models. PLoS Computational Biology, 16(4): e1007768. https://doi.org/10.1371/journal.pcbi.1007768
 
Pearson C, Bartell T, Wang G, et al. (2022). Boston birth cohort profile: Rationale and study design. Precision Nutrition, 1: 1–12.
 
Perera C, Zhang H, Zheng Y, Hou L, Qu A, Zheng C, et al. (2022). HIMA2: High-dimensional mediation analysis and its application in epigenome-wide DNA methylation data. BMC Bioinformatics, 23: 1–14. https://doi.org/10.1186/s12859-021-04477-x
 
Shen E, Chou CP, Pentz MA, Berhane K (2014). Quantile mediation models: A comparison of methods for assessing mediation across the outcome distribution. Multivariate Behavioral Research, 49: 471–485. https://doi.org/10.1080/00273171.2014.904221
 
Storey JD, Bass AJ, Dabney A, Robinson D (2024). qvalue: Q-value estimation for false discovery rate control. R package version 2.38.0.
 
Sun R, Zhou X, Song X (2021). Bayesian causal mediation analysis with latent mediators and survival outcome. Structural Equation Modeling, 28: 778–790. https://doi.org/10.1080/10705511.2020.1863154
 
Tomczak K, Czerwiska P, Wiznerowicz M (2015). Review the cancer genome atlas (TCGA): An immeasurable source of knowledge. Contemporary Oncology, 19: 68–77.
 
Wang C, Hu J, Blaser MJ, Li H (2020). Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data. Bioinformatics, 36: 347–355. https://doi.org/10.1093/bioinformatics/btz565
 
Wu D, Yang H, Winham SJ, Natanzon Y, Koestler DC, Luo T, et al. (2018). Mediation analysis of alcohol consumption, DNA methylation, and epithelial ovarian cancer. Journal of Human Genetics, 63: 339–348. https://doi.org/10.1038/s10038-017-0385-8
 
Yimer BB, Lunt M, Beasley M, Macfarlane GJ, McBeth J (2023). Bayesgmed: An R-package for Bayesian causal mediation analysis. PLoS ONE, 18: 1–14.
 
Yu Q, Li B (2017). mma: An R package for mediation analysis with multiple mediators. Journal of Open Research Software, 5: 1–11. https://doi.org/10.5334/jors.160
 
Zhang CH (2010). Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics, 38: 894–942.
 
Zhang H (2025). Efficient adjusted joint significance test and Sobel-type confidence interval for mediation effect. Structural Equation Modeling, 32: 93–104. https://doi.org/10.1080/10705511.2024.2392139
 
Zhang H, Chen J, Feng Y, Wang C, Li H, Liu L (2021a). Mediation effect selection in high-dimensional and compositional microbiome data. Statistics in Medicine, 40: 885–896. https://doi.org/10.1002/sim.8808
 
Zhang H, Chen J, Li Z, Liu L (2021b). Testing for mediation effect with application to human microbiome data. Statistics in Biosciences, 13: 313–328. https://doi.org/10.1007/s12561-019-09253-3
 
Zhang H, Hong X, Zheng Y, Hou L, Zheng C, Wang X, et al. (2024). High-dimensional quantile mediation analysis with application to a birth cohort study of mother-newborn pairs. Bioinformatics, 40: 1–8.
 
Zhang H, Hou L, Liu L (2022). A review of high-dimensional mediation analyses in DNA methylation studies. In: Epigenome-Wide Association Studies: Methods and Protocols (W Guan, ed.), 2432.
 
Zhang H, Li X (2023). A framework for mediation analysis with massive data. Statistics and Computing, 33: 1–16. https://doi.org/10.1007/s11222-022-10178-z
 
Zhang H, Zheng Y, Hou L, Zheng C, Liu L (2021). Mediation analysis for survival data with high-dimensional mediators. Bioinformatics, 37: 3815–3821. https://doi.org/10.1093/bioinformatics/btab564
 
Zhang H, Zheng Y, Zhang Z, Gao T, Joyce B, Yoon G, et al. (2016). Estimating and testing high-dimensional mediation effects in epigenetic studies. Bioinformatics, 32(20): 3150–3154. https://doi.org/10.1093/bioinformatics/btw351
 
Zhang J, Wei Z, Chen J (2018). A distance-based approach for testing the mediation effect of the human microbiome. Bioinformatics, 34(11): 1875–1883. https://doi.org/10.1093/bioinformatics/bty014

Related articles PDF XML
Related articles PDF XML

Copyright
2025 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
DNA methylation mediator selection penalized estimate quantile regression

Metrics
since February 2021
25

Article info
views

11

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy