Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 22, Issue 3 (2024): Special issue: The Government Advances in Statistical Programming (GASP) 2023 conference
  4. Traditional and GenAI Text Analysis of C ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

Traditional and GenAI Text Analysis of COVID-19 Pandemic Trends in Hospital Community Benefits IRS Documentation
Volume 22, Issue 3 (2024): Special issue: The Government Advances in Statistical Programming (GASP) 2023 conference, pp. 393–408
Emily Hadley   Laura Marcial   Wes Quattrone     All authors (4)

Authors

 
Placeholder
https://doi.org/10.6339/24-JDS1144
Pub. online: 23 July 2024      Type: Data Science In Action      Open accessOpen Access

Received
1 December 2023
Accepted
14 June 2024
Published
23 July 2024

Abstract

The coronavirus disease 2019 (COVID-19) pandemic presented unique challenges to the U.S. healthcare system, particularly for nonprofit U.S. hospitals that are obligated to provide community benefits in exchange for federal tax exemptions. We sought to examine how hospitals initiated, modified, or disbanded community benefits programming in response to the COVID-19 pandemic. We used the free-response text in Part IV of Internal Revenue Service (IRS) Form 990 Schedule H (F990H) to assess health equity and disparities. We combined traditional key term frequency and Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) clustering approaches with a novel Generative Pre-trained Transformer (GPT) 3.5 summarization approach. Our research reveals shifts in community benefits programming. We observed an increase in COVID-related terms starting in the 2019 tax year, indicating a pivot in community focus and efforts toward pandemic-related activities such as telehealth services and COVID-19 testing and prevention. The clustering analysis identified themes related to COVID-19 and community benefits. Generative Artificial Intelligence (GenAI) summarization with GPT3.5 contextualized these changes, revealing examples of healthcare system adaptations and program cancellations. However, GPT3.5 also encountered some accuracy and validation challenges. This multifaceted text analysis underscores the adaptability of hospitals in maintaining community health support during crises and suggests the potential of advanced AI tools in evaluating large-scale qualitative data for policy and public health research.

Supplementary material

 Supplementary Material
The zipped supplementary material file includes code and output for this analysis.

References

 
Alomari A, Idris N, Sabri AQM, Alsmadi I (2022). Deep reinforcement and transfer learning for abstractive text summarization: A review. Computer Speech & Language, 71: 101276. https://doi.org/10.1016/j.csl.2021.101276
 
Azam N, Yao J (2012). Comparison of term frequency and document frequency based feature selection metrics in text categorization. Expert Systems with Applications, 39(5): 4760–4768. https://doi.org/10.1016/j.eswa.2011.09.160
 
Hadley E, Marcial LH, Quattrone W, Bobashev G (2023). Text analysis of trends in health equity and disparities from the internal revenue service tax documentation submitted by US nonprofit hospitals between 2010 and 2019: Exploratory study. Journal of Medical Internet Research, 25(1): e44330. Company: Journal of Medical Internet Research Distributor: Journal of Medical Internet Research Institution: Journal of Medical Internet Research Label: Journal of Medical Internet Research Publisher: JMIR Publications Inc., Toronto, Canada. https://doi.org/10.2196/44330
 
HDBSCAN (2023). How HDBSCAN Works — hdbscan 0.8.1 documentation.
 
Hearle K (2020). Coronavirus Pandemic and Community Benefit Reporting, Technical report, Verité Healthcare Consulting.
 
House TW (2023). Community Benefit.
 
Nelson LK (2020). Computational grounded theory: A methodological framework. Sociological Methods & Research, 49(1): 3–42. Publisher: SAGE Publications Inc.
 
Ortiz A, Quattrone W, Underwood M, Zmuda M, Goode LSA, Saur C, et al. (2022). The Development and Management of Community Benefit Insight: A Web-Based Resource That Aggregates US-Based Nonprofit Hospital Community Benefit Spending Data. RTI Press. Publisher: RTI Press.
 
Atkeson A., Rosenthal J. (2020). States Explore Pivoting Hospital Community Benefit Requirements to Address Disparities Exposed by COVID-19.
 
Rubin DB, Singh SR, Jacobson PD (2013). Evaluating hospitals’ provision of community benefit: An argument for an outcome-based approach to nonprofit hospital tax exemption. American Journal of Public Health, 103(4): 612–616. https://doi.org/10.2105/AJPH.2012.301048
 
Saghafian S, Song LD, Raja AS (2022). Towards a more efficient healthcare system: Opportunities and challenges caused by hospital closures amid the COVID-19 pandemic. Health Care Management Science, 25(2): 187–190. https://doi.org/10.1007/s10729-022-09591-7
 
scikit learn (2023). sklearn.feature_extraction.text.TfidfVectorizer.
 
Service IR (2023a). About Schedule H (Form 990), Hospitals |. Internal Revenue Service.
 
Service IR (2023b). Charitable Hospitals - General Requirements for Tax-Exemption Under Section 501(c) (3) |. Internal Revenue Service.
 
Williams D, Reiter KL, Pink GH, Holmes GM, Song PH (2020). Rural hospital mergers increased between 2005 and 2016—what did those hospitals look like? INQUIRY: The Journal of Health Care Organization, Provision, and Financing, 57: 0046958020935666. Publisher: SAGE Publications Inc.
 
Young GJ, Chou CH, Alexander J, Lee SYD, Raver E (2013). Provision of community benefits by tax-exempt U.S. hospitals. The New England Journal of Medicine, 368(16): 1519–1527. Publisher: Massachusetts Medical Society. https://doi.org/10.1056/NEJMsa1210239
 
Zare H, Eisenberg M, Anderson G (2021). Charity care and community benefit in non-profit hospitals: Definition and requirements. INQUIRY: The Journal of Health Care Organization, Provision, and Financing, 58: 00469580211028180. Publisher: SAGE Publications Inc.

Related articles PDF XML
Related articles PDF XML

Copyright
2024 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
generative artificial intelligence hospital administration natural language processing text mining

Funding
Funding for this work was provided by the Robert Wood Johnson Foundation under grants 77387 and 80508.

Metrics
since February 2021
539

Article info
views

163

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy