Abstract
Except for bacteria, the taxonomic diversity of the human fecal metagenome has not been widely studied, despite the potential importance of viruses and eukaryotes. Widely used bioinformatic tools contain limited numbers of non-bacterial species in their databases compared to available genomic sequences and their methodologies do not favour classification of rare sequences which may represent only a small fraction of their parent genome. In seeking to optimise identification of non-bacterial species, we evaluated five widely-used metagenome classifier programs (BURST, Kraken2, Centrifuge, MetaPhlAn2 and CCMetagen) for their ability to correctly assign and count simulations of bacterial, viral and eukaryotic DNA sequence reads, including the effect of taxonomic order of analysis of bacteria, viruses and eukaryotes and the effect of sequencing depth. We found that the precision of metagenome classifiers varied significantly between programs and between taxonomic groups. When classifying viruses and eukaryotes, ordering the analysis such that bacteria were classified first significantly improved classification precision. Increasing sequencing depth decreased classification precision and did not improve recall of rare species. Choic...Continue Reading
References
Oct 9, 2008·PloS One·Daniel C RichterDaniel H Huson
Aug 17, 2010·Bioinformatics·Robert C Edgar
Mar 20, 2012·Cell·Jose C ClementeRob Knight
Jun 13, 2012·Nature Methods·Nicola SegataCurtis Huttenhower
Jun 16, 2012·Nature·UNKNOWN Human Microbiome Project Consortium
Sep 24, 2015·MBio·Thomas BrieseW Ian Lipkin
Aug 20, 2016·PLoS Biology·Ron SenderRon Milo
Oct 30, 2016·Virulence·Heather E Hallen-Adams, Mallory J Suhr
Dec 16, 2016·The New England Journal of Medicine·Susan V Lynch, Oluf Pedersen
Oct 14, 2017·Briefings in Bioinformatics·Florian P BreitwieserSteven L Salzberg
Nov 28, 2017·Microbiome·Andrea K NashJoseph F Petrosino
Mar 3, 2018·Genome Research·Patrick T WestJillian F Banfield
Jul 22, 2018·Scientific Reports·Franziska PfeifferGünter Mayer
Aug 31, 2018·BMC Bioinformatics·Philip T L C ClausenOle Lund
Aug 10, 2019·Cell·Simon H YePardis C Sabeti
Oct 30, 2019·BMC Biology·Sanzhima GarmaevaAlexandra Zhernakova
Nov 30, 2019·Genome Biology·Derrick E WoodBen Langmead
Apr 30, 2020·Genome Biology·Vanessa R MarcelinoEdward C Holmes
May 14, 2020·Genome Biology·Martin Steinegger, Steven L Salzberg