Recovering complete and draft population genomes from metagenome datasets

Naseer SangwanJack A Gilbert


Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem of chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution.


Oct 6, 1999·Genome Research·X Huang, A Madan
Jan 10, 2003·Nucleic Acids Research·Daniel H HaftOwen White
Apr 6, 2006·Bioinformatics·Ryota Suzuki, Hidetoshi Shimodaira
Aug 12, 2006·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Michael C Wendl
Feb 28, 2007·BMC Bioinformatics·Daniel D SommerMihai Pop
May 1, 2007·Nature Methods·Konstantinos MavromatisNikos C Kyrpides
Mar 20, 2008·Genome Research·Daniel R Zerbino, Ewan Birney
Oct 15, 2008·Genome Biology·Martin Wu, Jonathan A Eisen
Mar 3, 2009·Genome Research·Jared T SimpsonInanç Birol
Aug 25, 2009·Genome Biology·Gregory J DickJillian F Banfield
Oct 10, 2009·Science·P S G ChainJ C Detter
Nov 7, 2009·Nucleic Acids Research·Tanja DavidsenGranger Sutton
Dec 25, 2009·Nature·Dongying WuJonathan A Eisen
Oct 21, 2010·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Sébastien BoisvertJacques Corbeil
Dec 31, 2010·Proceedings of the National Academy of Sciences of the United States of America·Michael J MorowitzJillian F Banfield
Jan 29, 2011·Science·Matthias HessEdward M Rubin
Feb 19, 2011·Annual Review of Marine Science·Jack A Gilbert, Christopher L Dupont
Oct 26, 2011·BMC Genomics·Anveshi Charuvaka, Huzefa Rangwala
Oct 28, 2011·The ISME Journal·Chengwei LuoKonstantinos T Konstantinidis
Nov 30, 2011·Nature Reviews. Genetics·Todd J Treangen, Steven L Salzberg
Dec 1, 2011·Nucleic Acids Research·Marco PuntaRobert D Finn
Dec 8, 2011·Genome Research·Steven L SalzbergJames A Yorke
Dec 16, 2011·The ISME Journal·Chris L DupontJ Craig Venter
Feb 15, 2012·Bioinformatics·Martin Wu, Alexandra J Scott
Apr 18, 2012·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Anton BankevichPavel A Pevzner
Jul 24, 2012·Nucleic Acids Research·Toshiaki NamikiYasubumi Sakakibara
Aug 1, 2012·Proceedings of the National Academy of Sciences of the United States of America·Jason PellC Titus Brown
Dec 25, 2012·Genome Biology·Sébastien BoisvertJacques Corbeil
Jan 10, 2013·Scientific Reports·Sajia AkhterRobert A Edwards
Mar 22, 2013·Genome Announcements·Mark EppingerJacques Ravel
Apr 12, 2013·Microbial Informatics and Experimentation·David J Edwards, Kathryn E Holt
May 29, 2013·Genome Biology·Martin HuntThomas D Otto
Jul 17, 2013·BMC Bioinformatics·Riccardo VicedominiAlberto Policriti
Aug 8, 2013·Nucleic Acids Research·Masa RollerKristian Vlahovicek
Aug 28, 2013·Nature Communications·Cindy J CastelleJillian F Banfield
Oct 15, 2013·Bioinformatics·Luis M Rodriguez-R, Konstantinos T Konstantinidis
Feb 26, 2014·Algorithms for Molecular Biology : AMB·Kamil SalikhovGregory Kucherov
May 16, 2014·The ISME Journal·Luis M Rodriguez-R, Konstantinos T Konstantinidis
Jun 3, 2014·BMC Bioinformatics·Sergey KorenAdam M Phillippy
Sep 15, 2014·Nature Methods·Johannes AlnebergChristopher Quince
Sep 19, 2014·Nature Communications·Maya Ofek-LalzarDror Minz
Oct 3, 2014·Genome Research·Dent EarlBenedict Paten
Oct 31, 2014·Molecular Ecology·Rohit GhaiFrancisco Rodriguez-Valera
Nov 16, 2014·Applied and Environmental Microbiology·Sean M GibbonsScott T Kelley
May 9, 2015·Environmental Microbiology Reports·Naseer SangwanRup Lal
Jun 11, 2015·Bioinformatics·Felipe A SimãoEvgeny M Zdobnov
Jun 18, 2015·Nature·Christopher T BrownJillian F Banfield
Oct 9, 2015·Nature Biotechnology·C Titus Brown


Jul 6, 2016·Scientific Data·Mohamed F HaroonUlrich Stingl
Aug 12, 2016·Translational Research : the Journal of Laboratory and Clinical Medicine·Cecilia NoeckerElhanan Borenstein
Sep 22, 2016·Marine Genomics·Ângela M RibeiroRute R da Fonseca
Oct 22, 2016·Annals of the New York Academy of Sciences·Boahemaa Adu-OppongGautam Dantas
Sep 20, 2017·Microbial Biotechnology·Cristal ZuñigaKarsten Zengler
Jan 5, 2018·Genes·Sandra Christine Andersen, Jeffrey Hoorfar
Dec 1, 2017·BMC Genomics·Bhavya PapudeshiElizabeth A Dinsdale
Oct 9, 2018·Nature Biotechnology·Min-Jung KwakJihyun F Kim
Feb 9, 2018·The ISME Journal·Min-Soo Kim, Jin-Woo Bae
Mar 2, 2018·Nature Communications·Robert D StewartMick Watson
Oct 3, 2017·Nature Methods·Alexander SczyrbaAlice C McHardy
Apr 13, 2019·Bioinformatics·Ziye WangShanfeng Zhu
Mar 17, 2019·Genes·Gherman V Uritskiy, Jocelyne DiRuggiero
Jun 30, 2019·Journal of Animal Science·Nirosh D AluthgeThomas E Burkey
Sep 17, 2018·Microbiome·Gherman V UritskiyJames Taylor
Mar 20, 2020·Genome Research·Lin-Xing ChenJillian F Banfield
Apr 7, 2020·Nature Communications·Mohammad MoniruzzamanFrank O Aylward
Mar 13, 2020·Genome Biology and Evolution·Bastian Greshake TzovarasIngo Ebersberger
Mar 14, 2020·Bioinformatics·Vijini MallawaarachchiYu Lin
Oct 22, 2016·Journal of Molecular Medicine : Official Organ of the Gesellschaft Deutscher Naturforscher Und Ärzte·Matthias Willmann, Silke Peter
May 26, 2017·Frontiers in Microbiology·Danillo O AlvarengaAlessandro M Varani
May 5, 2017·Frontiers in Microbiology·Leandro N LemosJoão C Setubal
Mar 9, 2018·Frontiers in Microbiology·Rafael R C CuadratHans-Peter Grossart
Jan 30, 2019·BMC Genomics·Nancy MerinoHaruo Suzuki
Dec 12, 2019·Extremophiles : Life Under Extreme Conditions·Júnia Schultz, Alexandre Soares Rosado
Feb 14, 2020·Genome Biology·Laura GlendinningMick Watson
Jan 1, 2016·Journal of the Royal Society, Interface·Willi GottsteinBas Teusink
May 11, 2017·Frontiers in Microbiology·Sarah M Hird
Feb 3, 2017·Indian Journal of Microbiology·Anukriti Sharma, Rup Lal
May 24, 2019·MSystems·Rex R Malmstrom, Emiley A Eloe-Fadrosh
Apr 19, 2019·FEMS Microbiology Ecology·Bastian V H HornungEd J Kuijper
Jun 7, 2019·Foodborne Pathogens and Disease·Heather A CarletonPeter Gerner-Smidt
Nov 20, 2016·Annals of the New York Academy of Sciences·José L MartínezFernando Baquero
Oct 14, 2017·Briefings in Bioinformatics·Florian P BreitwieserSteven L Salzberg
Mar 7, 2019·Nucleic Acids Research·Ian J MillerJason C Kwan
May 31, 2019·Journal of the International AIDS Society·Salim S Abdool KarimBrent L Williams
Dec 6, 2017·Microbial Biotechnology·Yvonne StolzeAndreas Schlüter
May 26, 2018·Nature Reviews. Microbiology·Rob KnightPieter C Dorrestein
May 10, 2018·Frontiers in Microbiology·Sam NooijMarion P G Koopmans
Jan 9, 2019·Frontiers in Microbiology·Anukriti SharmaRup Lal
Nov 17, 2020·PeerJ·William C NelsonJennifer M Mobberley
Dec 12, 2020·Frontiers in Microbiology·Krisztián LacziKatalin Perei
Jun 18, 2019·Current Opinion in Biotechnology·Mary Jo KirisitsAmeet J Pinto
Mar 24, 2021·Briefings in Bioinformatics·Masood Ur Rehman KayaniLei Chen
Feb 13, 2019·Angewandte Chemie·Jörn Piel, Jackson Cahn

Related Concepts

Sequence Determinations, DNA
Contig Mapping
Genome, Microbial
Datasets as Topic
Biological Evolution
In Situ

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Alzheimer's Disease: MS4A

Variants within the membrane-spanning 4-domains subfamily A (MS4A) gene cluster have recently been implicated in Alzheimer's disease in genome-wide association studies. Here is the latest research on Alzheimer's disease and MS4A.

Pediculosis pubis

Pediculosis pubis is a disease caused by a parasitic insect known as Pthirus pubis, which infests human pubic hair, as well as other areas with hair including eye lashes. Here is the latest research.

Rh Isoimmunization

Rh isoimmunization is a potentially preventable condition that occasionally is associated with significant perinatal morbidity or mortality. Discover the latest research on Rh Isoimmunization here.

Genetic Screens in iPSC-derived Brain Cells

Genetic screening is a critical tool that can be employed to define and understand gene function and interaction. This feed focuses on genetic screens conducted using induced pluripotent stem cell (iPSC)-derived brain cells. It also follows CRISPR-Cas9 approaches to generating genetic mutants as a means of understanding the effect of genetics on phenotype.

Enzyme Evolution

This feed focuses on molecular models of enzyme evolution and new approaches (such as adaptive laboratory evolution) to metabolic engineering of microorganisms. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Pharmacology of Proteinopathies

This feed focuses on the pharmacology of proteinopathies - diseases in which proteins abnormally aggregate (i.e. Alzheimer’s, Parkinson’s, etc.). Discover the latest research in this field with this feed.

Alignment-free Sequence Analysis Tools

Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. Here is the latest research.