A Practical Guide to Differentially Private Deep Learning Using the Pseudo Posterior Mechanism
Pub. online: 9 June 2026
Type: Statistical Data Science
Open Access
Received
9 September 2025
9 September 2025
Accepted
2 June 2026
2 June 2026
Published
9 June 2026
9 June 2026
Abstract
Privacy-preserving machine learning methods seek to train useful models that do not disclose information about the data on which they were trained. Such methods are vital when organizations train neural networks on sensitive individual-level data and seek to release the models publicly. Their goal poses a trade-off between predictive performance (utility) and privacy protection. That trade-off makes privacy-preserving machine learning methods difficult to apply in practice, usually requiring extensive iteration and hyperparameter tuning. Yet, practitioners often have little guidance for navigating competing statistical, computational, and privacy demands. We present an implementation algorithm for the Stochastic Weight Averaging–Gaussian Pseudo Posterior Mechanism (SWAG-PPM), a Bayesian differentially private deep learning method. The implementation algorithm focuses on the joint tuning of two key hyperparameters whose interaction governs model convergence and the privacy–utility trade-off. We introduce novel diagnostic tools to evaluate convergence and guide hyperparameter adjustments. Using a transformer model for occupational injury classification, we demonstrate that diagnostic-guided tuning with SWAG-PPM can achieve strong privacy protection and utility. While our case study uses a specific dataset and model architecture, all methodological steps can apply to other settings where privacy risk is heterogeneously distributed.
References
Chew R (2025). OSHA Severe Injury Reports: Jan 2015 - Sep 2023. https://doi.org/10.6084/m9.figshare.28669604.v1
Chew R, Williams MR, Segarra EA, Preiss AJ, Konet A, Savitsky TD (2025). Bayesian pseudo posterior mechanism for differentially private machine learning. arXiv preprint: arXiv:2503.21528.
Dwork C, Kohli N, Mulligan D (2019). Differential privacy in practice: Expose your epsilons! Journal of Privacy and Confidentiality, 9(2). https://doi.org/10.29012/jpc.689
Hu J, Williams MR, Savitsky TD (2022). Mechanisms for global differential privacy under bayesian data synthesis. arXiv preprint: arXiv:2205.05003.
Rigaki M, Garcia S (2024). A survey of privacy attacks in machine learning. ACM Computing Surveys, 56(4): 1–34. https://doi.org/10.1145/3624010