Development and Validation of a Prediction Model for Identifying Small Malignant Pulmonary Nodules
being lung cancer would be of great value to patients, clinicians and researchers.
The proposed study will prepare logistic regression prediction models to estimate the probability that a CT-detected pulmonary nodule (>4mm to <15 mm) is lung cancer. Potential predcitors will include sociodemographic, exposure, medical history and imaging variables. Model building will attempt to maximize goodness-of-fit, calibration (do predicted probabilities correspond to observed probabilities) and discrimination (classification accuracy). Internal validation of the model will be carried out using bootstrap resampling methods to correct estimates of predictive performance for overfitting or optimism. The study sample used for modeling will include all available National Lung Screening Trial (NLST) participants in the CT-screening arm who had one or more pulmonary nodules on a CT screening. External validation will use Early Detection of Lung Cancer - a Pan-Canadian Study data.
(1) To develop a prediction model that will discriminate true lung cancers from false positive CT-detected nodules (>4-<15 mm) detected on chest screening, using sociodemographic, exposure, medical history and imaging data.
(2) To internally validate the predictive performance of the model, including discrimination and calibration, using bootstrap methods.
(3) To externally validate the predictive accuracy of the model using data from the Early Detection of Lung Cancer – A Pan-Canadian Study and the British Columbia Cancer Agency Screening Study (PI's Drs. Stephen Lam and Ming Tsao).