A Designed Look at Artificial Intelligence from the Lens of Fairness
Pub. online: 4 February 2026
Type: Statistical Data Science
Open Access
Received
30 November 2025
30 November 2025
Accepted
24 January 2026
24 January 2026
Published
4 February 2026
4 February 2026
Abstract
As the use of Artificial Intelligence (AI), especially Generative AI, becomes ubiquitous, we take a look at the performance of these methods. We specifically focus on concept of fairness element of trustworthiness. We use Statistical Parity Difference and Equalized Odds Difference to mathematically measure fairness. To systematically study how various factors like bias, access to protected categories, types of intervention affect fairness and accuracy, we performed a simulation as a multi-factor experiment. Our results indicate that accuracy and fairness (in terms of statistical parity and equalized odds) tend to go in opposite directions. This opens up the question of whether we can look at methods that can consider both accuracy and fairness simultaneously.
Supplementary material
Supplementary MaterialThe supplementary materials include Data generation process described in 3.2 as well as the full Python code. The Python implementation is also available at Https://github.com/borhan-stat/fairness-simulation-paper.
References
Chouldechova A (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data, 5(2): 153–163. https://doi.org/10.1089/big.2016.0047
Darlington RB (1971). Another look at “cultural fairness”. Journal of Educational Measurement, 8(2): 71–82. https://doi.org/10.1111/j.1745-3984.1971.tb00908.x
Doshi-Velez F, Kortz M, Budish R, Bavitz C, Gershman S, O’Brien D, et al. (2017). Accountability of ai under the law: The role of explanation. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3064761
Dutta S, Wei D, Yueksel H, Chen PY, Liu S, Varshney K (2020). Is there a trade-off between fairness and accuracy? A perspective using mismatched hypothesis testing. In: Proceedings of the 37th International Conference on Machine Learning, (H Daumé III, A Singh, eds.), volume 119 of ICML, 2803–2813. PMLR.
Gupta N, Khatri K, Malik Y, Lakhani A, Kanwal A, Aggarwal S, et al. (2024). Exploring prospects, hurdles, and road ahead for generative artificial intelligence in orthopedic education and training. BMC Medical Education, 24: 1544. https://doi.org/10.1186/s12909-024-06592-8
Haltaufderheide J, Ranisch R (2024). The ethics of ChatGPT in medicine and healthcare: A systematic review on large language models (LLMs). npj Digital Medicine, 7: 183. https://doi.org/10.1038/s41746-024-01157-x
Hardt M, Price E, Srebro N (2016). Equality of opportunity in supervised learning. In: Proceedings of the 30th International Conference on Neural Information Processing Systems, NIPS’16, (DD Lee, M Sugiyama, U von Luxburg, I Guyon, R Garnett, eds.), 3323–3331. Curran Associates Inc., Red, Hook, NY, USA.
Ioannidis JPA (2005). Why most published research findings are false. PLoS Medicine, 2(8): e124. https://doi.org/10.1371/journal.pmed.0020124
Kleinberg J, Mullainathan S, Raghavan M (2017). Inherent trade-offs in the fair determination of risk scores. In: 8th Innovations in Theoretical Computer Science Conference (ITCS 2017) (CH Papadimitriou, ed.), volume 67 of Leibniz International Proceedings in Informatics (LIPIcs), 43:1–43:23. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik.
Koh PW, Sagawa S, Marklund H, Xie SM, Zhang M, Balsubramani A, et al. (2021). WILDS: A benchmark of in-the-wild distribution shifts. In: Proceedings of the 38th International Conference on Machine Learning (ICML) (M Meila, T Zhang, eds.), volume 139 of Proceedings of Machine Learning Research, 5637–5664. PMLR.
Tabassum A, Elmahjub E, Padela AI, Zwitter A, Qadir J (2025). Generative ai and the metaverse: A scoping review of ethical and legal challenges. IEEE Open Journal of the Computer Society, 6: 1–15. https://doi.org/10.1109/OJCS.2025.3536082
Weerts H, Pfisterer F, Feurer M, Eggensperger K, Bergman E, Awad N, et al. (2024). Can fairness be automated? Guidelines and opportunities for fairness-aware automl. Journal of Artificial Intelligence Research, 79: 639–677. https://doi.org/10.1613/jair.1.14747
World Medical Association (2013). Declaration of Helsinki: Ethical principles for medical research involving human subjects. JAMA, 310(20): 2191–2194. https://doi.org/10.1001/jama.2013.281053