Automatic detection and classification of lung abnormalities via deep learning and machine learning
Aim 1: Create a deep neural network of human lungs. One limitation of deep learning is that it generally requires a large number of images. Digitized chest radiographs from the PLCO trial provide the largest, curated chest X-ray image database, and thus we believe will be an essential component for generating deep learning models. We plan to create a convolutional neural network (CNN) for chest X-rays to initially detect and then classify detected lesions (benign vs. malignant). The networks will serve multiple goals (see Aims 2 and 3), and also can be correlated with secondary data sources (e.g. clinical outcomes) to serve those goals. We also plan to visualize the networks to learn the hierarchy of features and compare it to human radiologist features.
Aim 2: Automatic identification of suspect regions and estimation of likelihood of malignancy . In order to accomplish this, we require annotations on a subset of the images. If these annotations are unavailable from PLCO, we may create our own for the purposes of this study. In addition to generating our own annotations, we also plan to use annotated images to create deep learning features using unsupervised training. We will then analyse these features using machine learning.
Aim 3: Determination of malignancy likelihood. We will use the patient diagnoses and outcome data to estimate likelihood that detected lesion represents a malignant vs. benign etiology. We will use a technique similar to that in Beck et al (2011), but using deep learning features in addition to domain specific features.
Kevin Lyman (Enlitic, Inc.)
Brian Basham (Enlitic, Inc.)
Diogo Almeida (Enlitic, Inc.)
Alan Liu (Enlitic, Inc.)
Scott McKinney (Enlitic, Inc.)