Using Big Data for Computer-Aided Diagnosis of Chest X-Rays
Given the prevalence of lung cancer, and the relative ease of obtaining x-rays, as compared to CT scans, developing algorithms for automated chest x-ray screening represents an important societal aim. As such, the PLCO Lung dataset provides a uniquely rich source, as PLCO chest X-rays are accompanied by structured annotations, risk factors, demographics, and outcomes.
The Imaging Biomarkers and Computer-Aided Diagnosis Laboratory is an internationally recognized group with extensive experience in medical imaging and machine learning to extract knowledge from complex multi-factorial data. We aim to use the PLCO Lung dataset, in concert with other data sources, to develop large-scale and reliable computer-aided diagnosis tools for chest x-rays. With this, we hope to develop clinically relevant tools that can be used for automated or computer-aided screening.
-Develop machine learning algorithms, trained on PLCO data, to automatically screen for lung cancer and other diseases
-Develop machine learning algorithms to roughly localize nodules, masses, and other abnormalities
-Investigate how to best use PLCO data in concert with other, less structured datasets, in order to further improve performance
Adam P. Harrison, NIH
Le Lu, NIH
Ke Yan, NIH
Xiaosong Wang, NIH
Yuxing Tang, NIH