Skip to Main Content

An official website of the United States government

About this Publication
Cancer Risk Estimation Combining Lung Screening CT with Clinical Data Elements.
Pubmed ID
34870220 (View this publication on the PubMed website)
Digital Object Identifier
Radiol Artif Intell. 2021 Nov; Volume 3 (Issue 6): Pages e210032
Gao R, Tang Y, Khan MS, Xu K, Paulson AB, Sullivan S, Huo Y, Deppen S, Massion PP, Sandler KL, Landman BA
  • Departments of Computer Science (R.G., K.X., Y.H., B.A.L.) and Electrical and Computer Engineering (Y.T., Y.H., B.A.L.), Vanderbilt University, 400 24th Ave S, Featheringill Hall, Room 371, Nashville, TN 37235; and Departments of Radiology and Radiological Sciences (A.B.P., K.L.S.), Thoracic Surgery (S.S., S.D.), General Internal Medicine and Public Health (M.S.K.), Biomedical Informatics (M.S.K.), and Medicine, Division of Allergy, Pulmonary and Critical Care Medicine (P.P.M.), Vanderbilt University Medical Center, Nashville, Tenn.

PURPOSE: To develop a model to estimate lung cancer risk using lung cancer screening CT and clinical data elements (CDEs) without manual reading efforts.

MATERIALS AND METHODS: Two screening cohorts were retrospectively studied: the National Lung Screening Trial (NLST; participants enrolled between August 2002 and April 2004) and the Vanderbilt Lung Screening Program (VLSP; participants enrolled between 2015 and 2018). Fivefold cross-validation using the NLST dataset was used for initial development and assessment of the co-learning model using whole CT scans and CDEs. The VLSP dataset was used for external testing of the developed model. Area under the receiver operating characteristic curve (AUC) and area under the precision-recall curve were used to measure the performance of the model. The developed model was compared with published risk-prediction models that used only CDEs or imaging data alone. The Brock model was also included for comparison by imputing missing values for patients without a dominant pulmonary nodule.

RESULTS: A total of 23 505 patients from the NLST (mean age, 62 years ± 5 [standard deviation]; 13 838 men, 9667 women) and 147 patients from the VLSP (mean age, 65 years ± 5; 82 men, 65 women) were included. Using cross-validation on the NLST dataset, the AUC of the proposed co-learning model (AUC, 0.88) was higher than the published models predicted with CDEs only (AUC, 0.69; P < .05) and with images only (AUC, 0.86; P < .05). Additionally, using the external VLSP test dataset, the co-learning model had a higher performance than each of the published individual models (AUC, 0.91 [co-learning] vs 0.59 [CDE-only] and 0.88 [image-only]; P < .05 for both comparisons).

CONCLUSION: The proposed co-learning predictive model combining chest CT images and CDEs had a higher performance for lung cancer risk prediction than models that contained only CDE or only image data; the proposed model also had a higher performance than the Brock model.Keywords: Computer-aided Diagnosis (CAD), CT, Lung, Thorax Supplemental material is available for this article. © RSNA, 2021.

Related CDAS Studies
Related CDAS Projects