Skip to Main Content

An official website of the United States government

About this Publication
Penalized deep partially linear cox models with application to CT scans of lung cancer patients.
Pubmed ID
38412302 (View this publication on the PubMed website)
Digital Object Identifier
Biometrics. 2024 Jan 29; Volume 80 (Issue 1)
Sun Y, Kang J, Haridas C, Mayne N, Potter A, Yang CF, Christiani DC, Li Y
  • Department of Mathematics, William & Mary, Williamsburg, VA 23185, United States.
  • Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, United States.
  • Division of Thoracic Surgery, Department of Surgery, Massachusetts General Hospital, Boston, MA 02114, United States.
  • Department of Medicine, Duke University, Durham, NC 27710, United States.
  • Department of Environmental Health and Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States.

Lung cancer is a leading cause of cancer mortality globally, highlighting the importance of understanding its mortality risks to design effective patient-centered therapies. The National Lung Screening Trial (NLST) employed computed tomography texture analysis, which provides objective measurements of texture patterns on CT scans, to quantify the mortality risks of lung cancer patients. Partially linear Cox models have gained popularity for survival analysis by dissecting the hazard function into parametric and nonparametric components, allowing for the effective incorporation of both well-established risk factors (such as age and clinical variables) and emerging risk factors (eg, image features) within a unified framework. However, when the dimension of parametric components exceeds the sample size, the task of model fitting becomes formidable, while nonparametric modeling grapples with the curse of dimensionality. We propose a novel Penalized Deep Partially Linear Cox Model (Penalized DPLC), which incorporates the smoothly clipped absolute deviation (SCAD) penalty to select important texture features and employs a deep neural network to estimate the nonparametric component of the model. We prove the convergence and asymptotic properties of the estimator and compare it to other methods through extensive simulation studies, evaluating its performance in risk prediction and feature selection. The proposed method is applied to the NLST study dataset to uncover the effects of key clinical and imaging risk factors on patients' survival. Our findings provide valuable insights into the relationship between these factors and survival outcomes.

Related CDAS Studies
Related CDAS Projects