Skip to Main Content

An official website of the United States government

About this Publication
Title
EEC-GIFT: a fairness-aware machine learning framework for lung cancer screening eligibility using real-world data.
Pubmed ID
40111867 (View this publication on the PubMed website)
Digital Object Identifier
Publication
JNCI Cancer Spectr. 2025 Mar 20
Authors
Conahan P, Robinson LA, Le T, Valdes G, Schabath MB, Byrne MM, Green L, El Naqa I, Luo Y
Affiliations
  • Department of Machine Learning, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA.
  • Division of Thoracic Oncology (Surgery), H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA.
  • Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, Florida, USA.
  • Department of Health Outcomes and Behavior, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA.
Abstract

OBJECTIVE: We use real-world data to develop a lung cancer screening (LCS) eligibility mechanism that is both accurate and free from racial bias.

METHODS: Our data came from the Prostate, Lung, Colorectal, and Ovarian (PLCO) cancer screening trial. We built a systematic fairness-aware machine learning framework by integrating a Group and Intersectional Fairness and Threshold (GIFT) strategy with an easy ensemble classifier- (EEC-) or logistic regression- (LR-) based model. The best LCS eligibility mechanism EEC-GIFT* and LR-GIFT* were applied to the testing dataset and their performances were compared to the 2021 US Preventive Services Task Force (USPSTF) criteria and PLCOM2012 model. The equal opportunity difference (EOD) of developing lung cancer between Black and White smokers was used to evaluate mechanism fairness.

RESULTS: The fairness of LR-GIFT* or EEC-GIFT* during training was notably greater than that of the LR or EEC models without greatly reducing their accuracy. During testing, the EEC-GIFT* (85.16% vs 78.08%, P < .001) and LR-GIFT* (85.98% vs 78.08%, P < .001) models significantly improved sensitivity without sacrificing specificity compared to the 2021 USPSTF criteria. The EEC-GIFT* (0.785 vs 0.788, P = .28) and LR-GIFT* (0.785 vs 0.788, P = .30) showed similar area under receiver operating characteristic curve (AUC) values compared to the PLCOM2012 model. While the average EODs between Blacks and Whites were significant for the 2021 USPSTF criteria (0.0673, P < .001), PLCOM2012 (0.0566, P < .001), and LR-GIFT* (0.0081, P < .001), the EEC-GIFT* model was unbiased (0.0034, P = .07).

CONCLUSION: Our EEC-GIFT* LCS eligibility mechanism can significantly mitigate racial biases in eligibility determination without compromising its predictive performance.

Related CDAS Studies
Related CDAS Projects