Statistical models to predict future subject’s lung cancer risk: application to NLST and PLCO data: Extended Incidence and Mortality
Principal Investigator
Name
Ping Hu
Degrees
ScD
Institution
NCI
Position Title
Mathematical Statistician
Email
About this CDAS Project
Study
NLST
(Learn more about this study)
Project ID
NLST-666
Initial CDAS Request Approval
Apr 23, 2020
Title
Statistical models to predict future subject’s lung cancer risk: application to NLST and PLCO data: Extended Incidence and Mortality
Summary
The National Lung Screening Trial (NLST) compared two ways of detecting lung cancer: low-dose helical computed tomography (CT) and standard chest X-ray. The lung component of the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial was undertaken to determine whether there is a reduction in lung cancer mortality from screening using chest X-ray. Using the data from these two large randomized screening trials with well-defined groups of healthy people, we utilize methods developed by Yong, Wei, etc (2014) to create an optimal stratified prediction procedure to estimate potential lung cancer risk for individuals.
We fitted the NLST data relating to the lung cancer outcome with its baseline covariates. For each fitted model, we create a scoring system for predicting potential lung cancer risk and obtain a corresponding optimal stratification rule. The subpopulation of participants satisfying any given level of risk score can be identified accordingly. Then, all the resulting stratification strategies are evaluated via a conventional cross-validation process.
We illustrate the proposed methods using NLST chest X-ray group as the training and test set, and PLCO lung component as the independent validation set.
We fitted the NLST data relating to the lung cancer outcome with its baseline covariates. For each fitted model, we create a scoring system for predicting potential lung cancer risk and obtain a corresponding optimal stratification rule. The subpopulation of participants satisfying any given level of risk score can be identified accordingly. Then, all the resulting stratification strategies are evaluated via a conventional cross-validation process.
We illustrate the proposed methods using NLST chest X-ray group as the training and test set, and PLCO lung component as the independent validation set.
Aims
The aim of this study is to develop a quantitative stratification procedure for predicting potential cancer risk to identify individuals at higher risk of specific cancers.
Collaborators
SuChun Cheng, ScD, Dana-Farber Cancer Institute
Lu Tian, PhD, Standford University
L.J. Wei, PhD, Harvard University