Applying deep learning models to predict nodal metastasis from pathology imaging
Traditionally, patients without LNM would receive either surgery or radiation therapy alone. However, the ADAURA recently demonstrated a survival benefit for adjuvant systemic therapy after surgery even to early-stage NSCLC patients with an EGFR mutation. This suggests that there may be a high risk subset of early stage patients that could benefit from treatment intensification. Deep learning methods have been developed to predict EGFR mutation status of NSCLC from primary histology and CT imaging. A tool to identify subsets of early-stage patients could improve patient outcomes and the development of personalized cancer care.
To the best of our knowledge, there does not exist a deep learning method of predicting LNM from purely the primary tumor histology imaging in lung cancer. Other studies have been done to predict the presence of LNM in cervical, prostate, and colorectal cancer from primary pathology . A deep learning tool that utilizes primary pathology images to predict LNM would serve as an additional screening tool for LNM and potentially aid in accurate staging and potentially help to identify subsets of patients that would benefit from treatment intensification. Additionally, some patients may find it challenging to endure staging procedures like EBUS, and a screening tool to detect nodal metastasis from the primary biopsy could assist in identifying those who would benefit from undergoing additional diagnostic efforts.
A variety of deep learning techniques have been published and made publicly available for pathology whole slide image analysis. Due to the large image size of pathology slides, techniques often partition the image into patches to be used in the deep learning models. Patches can also be assigned attention scores to highlight important regions for classification.
We aim to use the NLST dataset to predict nodal metastasis from whole slide images.
Aim 1: Using whole slide images, we will train a deep learning model to predict nodal metastasis from the images of primary pathology.
Aim 2: We will analyze various methods of data augmentation including geometric transformation and stain augmentation to determine if certain data augmentation techniques can improve prediction accuracy.
Aim 3: We will create a multimodal model that uses input from chest CT imaging and pathology imaging to study if a multimodal model improves prediction.
Sanjay Aneja, Yale School of Medicine
Victor Lee, Yale School of Medicine
Amber Loren Ong King, Yale School of Medicine