Skip to Main Content

An official website of the United States government

About this Publication
Title
MGS-Fast: Metagenomic shotgun data fast annotation using microbial gene catalogs.
Pubmed ID
30942867 (View this publication on the PubMed website)
Publication
Gigascience. 2019 Apr; Volume 8 (Issue 4)
Authors
Brown SM, Chen H, Hao Y, Laungani BP, Ali TA, Dong C, Lijeron C, Kim B, Wultsch C, Pei Z, Krampis K
Affiliations
  • New York University Langonne Medical Center, 333 E 38th St, New York, NY, 10016, USA.
  • Department of Biological Sciences and Center for Translational and Basic Research, Belfer Research Building, Hunter College of The City University of New York, 333 E 38th St, New York, NY, 10016, US.
  • Research Foundation of The City University of New York, 333 E 38th St, New York, NY, 10016, USA.
  • Department of Veterans Affairs New York Harbor Healthcare System, 333 E 38th St, New York, NY, 10016, USA.
Abstract

BACKGROUND: Current methods used for annotating metagenomics shotgun sequencing (MGS) data rely on a computationally intensive and low-stringency approach of mapping each read to a generic database of proteins or reference microbial genomes.

RESULTS: We developed MGS-Fast, an analysis approach for shotgun whole-genome metagenomic data utilizing Bowtie2 DNA-DNA alignment of reads that is an alternative to using the integrated catalog of reference genes database of well-annotated genes compiled from human microbiome data. This method is rapid and provides high-stringency matches (>90% DNA sequence identity) of the metagenomics reads to genes with annotated functions. We demonstrate the use of this method with data from a study of liver disease and synthetic reads, and Human Microbiome Project shotgun data, to detect differentially abundant Kyoto Encyclopedia of Genes and Genomes gene functions in these experiments. This rapid annotation method is freely available as a Galaxy workflow within a Docker image.

CONCLUSIONS: MGS-Fast can confidently transfer functional annotations from gene databases to metagenomic reads, with speed and accuracy.

Related CDAS Studies
Related CDAS Projects