Skip to Main Content

An official website of the United States government

About this Publication
Title
Deep hiearchical multi-label classification applied to chest X-ray abnormality taxonomies.
Pubmed ID
32937229 (View this publication on the PubMed website)
Digital Object Identifier
Publication
Med Image Anal. 2020 Sep 5; Volume 66: Pages 101811
Authors
Chen H, Miao S, Xu D, Hager GD, Harrison AP
Affiliations
  • Johns Hopkins University, Baltimore, MD, United States. Electronic address: hchen135@jhu.edu.
  • PAII Inc., Bethesda, MD, United States.
  • NVIDIA AI-Infra, Bethesda, MD, United States.
  • Johns Hopkins University, Baltimore, MD, United States.
  • PAII Inc., Bethesda, MD, United States. Electronic address: adam.p.harrison@gmail.com.
Abstract

Chest X-rays (CXRs) are a crucial and extraordinarily common diagnostic tool, leading to heavy research for computer-aided diagnosis (CAD) solutions. However, both high classification accuracy and meaningful model predictions that respect and incorporate clinical taxonomies are crucial for CAD usability. To this end, we present a deep hierarchical multi-label classification (HMLC) approach for CXR CAD. Different than other hierarchical systems, we show that first training the network to model conditional probability directly and then refining it with unconditional probabilities is key in boosting performance. In addition, we also formulate a numerically stable cross-entropy loss function for unconditional probabilities that provides concrete performance improvements. Finally, we demonstrate that HMLC can be an effective means to manage missing or incomplete labels. To the best of our knowledge, we are the first to apply HMLC to medical imaging CAD. We extensively evaluate our approach on detecting abnormality labels from the CXR arm of the Prostate, Lung, Colorectal and Ovarian (PLCO) dataset, which comprises over 198,000 manually annotated CXRs. When using complete labels, we report a mean area under the curve (AUC) of 0.887, the highest yet reported for this dataset. These results are supported by ancillary experiments on the PadChest dataset, where we also report significant improvements, 1.2% and 4.1% in AUC and average precision, respectively over strong "flat" classifiers. Finally, we demonstrate that our HMLC approach can much better handle incompletely labelled data. These performance improvements, combined with the inherent usefulness of taxonomic predictions, indicate that our approach represents a useful step forward for CXR CAD.

Related CDAS Studies
Related CDAS Projects