Computational pan-genomics: status, promises and challenges

Briefings in Bioinformatics
Computational Pan-Genomics Consortium

Abstract

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. W...Continue Reading

References

Jul 28, 1995·Science·R D FleischmannJ M Merrick
Oct 25, 1996·Science·A GoffeauS G Oliver
Jun 26, 1999·Science·W F Doolittle
Feb 22, 2001·Science·J Craig VenterX Zhu
Mar 10, 2001·Nature·Eric S LanderInternational Human Genome Sequencing Consortium
Aug 10, 2002·Bioinformatics·Steffen HeberPavel A Pevzner
Jul 3, 2004·Genome Research·Aaron C E DarlingNicole T Perna
Sep 3, 2004·Genome Research·Pavel A PevznerGlenn Tesler
May 12, 2005·Nature Reviews. Microbiology·Robert A Edwards, Forest Rohwer
Sep 13, 2005·Annual Review of Microbiology·B SnelBas E Dutilh
Sep 21, 2005·Proceedings of the National Academy of Sciences of the United States of America·Hervé TettelinClaire M Fraser
Oct 29, 2005·Nature·International HapMap Consortium
Mar 4, 2006·Science·Francesca D CiccarelliPeer Bork
May 9, 2006·Current Opinion in Structural Biology·Robert C Edgar, Serafim Batzoglou
Sep 19, 2006·Nature Reviews. Microbiology·Thomas Lengauer, Tobias Sing
Jan 24, 2007·Bioinformatics·Bas E DutilhMartijn A Huynen
Feb 16, 2007·Current Opinion in Plant Biology·Michele MorganteSlobodanka Radovic
Sep 6, 2007·PLoS Computational Biology·Cédric Notredame
Jan 20, 2009·Bioinformatics·Andrew M WaterhouseGeoffrey J Barton
Feb 13, 2009·Nature·Gianni LitiEdward J Louis
Apr 11, 2009·Nature·Michael R StrattonP Andrew Futreal
Jun 13, 2009·Genome Biology·Detlef Weigel, Richard Mott
Sep 19, 2009·Genome Biology·Korbinian SchneebergerDetlef Weigel
Mar 18, 2010·Nature Methods·Cydney B NielsenTing Wang
Mar 10, 2010·Genomics·Jason R MillerGranger Sutton
Apr 10, 2010·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Veli MäkinenNiko Välimäki
May 13, 2010·Briefings in Bioinformatics·Heng Li, Nils Homer
Jun 3, 2010·Nature Reviews. Genetics·Jonathan L Marchini, Bryan Howie
Jun 11, 2010·Proceedings of the National Academy of Sciences of the United States of America·Brian TeagueDavid C Schwartz
Nov 18, 2010·Genome Biology and Evolution·Daniel H Huson, Celine Scornavacca
Feb 9, 2011·Nature Reviews. Genetics·Ryan TewheyNicholas J Schork
Mar 2, 2011·Nature Reviews. Genetics·Can AlkanEvan E Eichler
Mar 10, 2011·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Benedict PatenDavid Haussler
Mar 26, 2011·PloS One·Lisa Zeigler AllenShannon J Williamson
Apr 28, 2011·BMC Bioinformatics·Osvaldo ZagordiNiko Beerenwinkel
May 19, 2011·Nature Reviews. Genetics·Rasmus NielsenYun S Song
Jun 15, 2011·Genome Research·Benedict PatenDavid Haussler
Aug 9, 2011·Molecular Biotechnology·Delfina BarabaschiLuigi Cattivelli
Aug 13, 2011·BMC Bioinformatics·Páll Melsted, Jonathan K Pritchard
Sep 17, 2011·Nature Reviews. Genetics·Sharon R Browning, Brian L Browning
Oct 14, 2011·Genome Research·Chris D GreenmanPeter J Campbell
Jan 11, 2012·Nature Genetics·Zamin IqbalGil A McVean
Mar 24, 2012·Current Opinion in Virology·John L MokiliBas E Dutilh
Apr 12, 2012·Nature Biotechnology·Grégory F Schneider, Cees Dekker
Apr 20, 2012·Nature Reviews. Cancer·Andriy MarusykKornelia Polyak
Apr 21, 2012·Briefings in Bioinformatics·Helga ThorvaldsdóttirJill P Mesirov
Jun 5, 2012·Nature Genetics·Yinping JiaoJinsheng Lai
Jun 13, 2012·Bioinformatics·A HerbigKay Nieselt
Jun 16, 2012·Nature·Human Microbiome Project Consortium
Sep 11, 2012·Nucleic Acids Research·Christine M MalboeufJoshua Z Levin
Oct 12, 2012·Journal of Virology·Jonathan M CarlsonInternational HIV Adaptation Collaborative
Nov 14, 2012·Annual Review of Genetics·Matthew D Daugherty, Harmit S Malik
Jan 18, 2013·Bioinformatics·Guillaume RizkRayan Chikhi
Mar 22, 2013·BMC Bioinformatics·Giulia MenconiRoberto Marangoni
Apr 11, 2013·JAMA : the Journal of the American Medical Association·Nicholas J LomanMark J Pallen
Apr 30, 2013·Briefings in Functional Genomics·Bas E DutilhSacha A F T van Hijum
May 31, 2013·Genome Biology·Michael G RossDavid B Jaffe
Jun 14, 2013·BMC Bioinformatics·Manuel AllhoffTobias Marschall
Jul 3, 2013·Bioinformatics·Derek Aguiar, Sorin Istrail
Jul 3, 2013·Bioinformatics·Lin HuangSerafim Batzoglou
Nov 5, 2013·Nature Biotechnology·Joshua N BurtonJay Shendure
Dec 18, 2013·Nature·Tom A WilliamsT Martin Embley
Feb 25, 2014·Nature Biotechnology·Volodymyr KuleshovMichael Snyder
Mar 4, 2014·Genome Biology·Derrick E Wood, Steven L Salzberg
Mar 19, 2014·Proceedings of the National Academy of Sciences of the United States of America·Adina Chuang HoweC Titus Brown
Mar 29, 2014·PLoS Computational Biology·Emily BergerBonnie Berger
Mar 29, 2014·PLoS Computational Biology·Armin TöpferNiko Beerenwinkel
Apr 9, 2014·Nature Reviews. Genetics·Jeffrey Rogers, Richard A Gibbs
Apr 10, 2014·BMC Bioinformatics·Birte KehrKnut Reinert
May 23, 2014·Nature·Leonid L MorozAndrea B Kohn
Jul 6, 2014·Bioinformatics·Erwan DrezenDominique Lavenier
Jul 7, 2014·Nature Biotechnology·Junhua LiMetaHIT Consortium
Jul 22, 2014·The Plant Journal : for Cell and Molecular Biology·Saulo AflitosSander Peters
Aug 21, 2014·Bioinformatics·Ngan NguyenBenedict Paten
Aug 28, 2014·Bioinformatics·Volodymyr Kuleshov
Aug 28, 2014·Nature Genetics·Dina N PaltooNational Institutes of Health Genomic Data Sharing Governance Committees
Oct 8, 2014·PloS One·Agnieszka DanekSzymon Grabowski
Oct 20, 2014·Nature Genetics·Neil I WeisenfeldDavid B Jaffe
Nov 11, 2014·Nature·Mark J P ChaissonEvan E Eichler
Nov 16, 2014·Bioinformatics·Shoshana MarcusMichael C Schatz
Dec 5, 2014·Genome Medicine·Gustavo GlusmanJared C Roach
Dec 9, 2014·Current Opinion in Microbiology·George VernikosHervé Tettelin
Jan 8, 2015·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Ngan NguyenBenedict Paten
Jan 15, 2015·Cancer Cell·Nicholas McGranahan, Charles Swanton
Jan 23, 2015·Bioinformatics·Sebastian DeorowiczAgnieszka Debudaj-Grabysz
Feb 7, 2015·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Murray PattersonAlexander Schönhuth
Feb 28, 2015·Genomics, Proteomics & Bioinformatics·Jingfa XiaoJun Yu
Apr 11, 2015·Frontiers in Microbiology·Richard J HallBas E Dutilh
Apr 29, 2015·Nature Genetics·Alexander DiltheyGil A McVean
May 1, 2015·BMC Genomics·Mohammed-Amin MadouiJean-Marc Aury
May 8, 2015·Nature Reviews. Genetics·Matthew W SnyderJay Shendure
May 23, 2015·Science·Jennifer R BrumMatthew B Sullivan
Jun 16, 2015·Nature Methods·Nicholas J LomanJared T Simpson
Jun 24, 2015·Bioinformatics·Ryan R WickKathryn E Holt
Jul 3, 2015·Genome Biology and Evolution·Brigitte BoeckmannQuest for Orthologs Species Tree Working Group
Jul 11, 2015·Nature·Lincoln D SteinJan O Korbel
Jul 11, 2015·Frontiers in Bioengineering and Biotechnology·Lorenzo TattiniAlberto Magi
Aug 6, 2015·Evolutionary Bioinformatics Online·Bojian ZhongDavid Penny
Sep 1, 2015·Bioinformatics·Yuri PirolaPaola Bonizzoni
Sep 4, 2015·BMC Bioinformatics·André HennigKay Nieselt
Mar 1, 2014·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Jouni SirénVeli Mäkinen
Sep 19, 2015·PloS One·Pınar KavakMahmut Şamil Sağıroğlu
Oct 4, 2015·Nature·1000 Genomes Project ConsortiumGonçalo R Abecasis
Nov 10, 2015·Nature Methods·Ryan M LayerAaron R Quinlan
Nov 19, 2015·Nucleic Acids Research·Paul Julian KerseyDaniel M Staines
Jan 12, 2016·Biomolecular Detection and Quantification·T LaverD J Studholme
Feb 2, 2016·Nature Biotechnology·Grace X Y ZhengHanlee P Ji
Jun 17, 2016·BMC Bioinformatics·Antoine LimassetPierre Peterlongo
Aug 19, 2016·Nature·Monkol LekExome Aggregation Consortium
Aug 26, 2016·Nature Protocols·Mihaela PerteaSteven L Salzberg
Sep 3, 2016·Bioinformatics·Siavash SheikhizadehSandra Smit

Citations

Jan 18, 2018·BMC Genomics·Christine JandrasitsBernhard Y Renard
Sep 26, 2017·Nature Genetics·Hannes P EggertssonBjarni V Halldorsson
Mar 20, 2020·PLoS Computational Biology·Guillaume GautreauDavid Vallenet
Jun 10, 2017·Information Retrieval·Travis GagieJouni Sirén
Jan 8, 2019·F1000Research·Evan BiederstedtAlexander Dilthey
Oct 17, 2019·International Journal of Molecular Sciences·Ennys GheyoucheStéphane Téletchéa
Apr 1, 2017·Genome Research·Benedict PatenErik P Garrison
May 27, 2020·Genome Biology·Cristian GrozaGuillaume Bourque
Dec 19, 2018·BMC Genomics·Nathan S Watson-HaighUte Baumann
Dec 7, 2018·Frontiers in Plant Science·Maria KyriakidouMartina V Strömvik
Nov 27, 2020·Genome Research·Sanjida H RangwalaValerie A Schneider

Related Concepts

Computer Programs and Programming
Genome, Human
Computational Molecular Biology
Genomics
Awareness
Breeding
Genome
Joints
Research
Science of Virology

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Synthetic Genetic Array Analysis

Synthetic genetic arrays allow the systematic examination of genetic interactions. Here is the latest research focusing on synthetic genetic arrays and their analyses.

Congenital Hyperinsulinism

Congenital hyperinsulinism is caused by genetic mutations resulting in excess insulin secretion from beta cells of the pancreas. Here is the latest research.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Epigenetic Memory

Epigenetic memory refers to the heritable genetic changes that are not explained by the DNA sequence. Find the latest research on epigenetic memory here.

Cell Atlas of the Human Eye

Constructing a cell atlas of the human eye will require transcriptomic and histologic analysis over the lifespan. This understanding will aid in the study of development and disease. Find the latest research pertaining to the Cell Atlas of the Human Eye here.

Femoral Neoplasms

Femoral Neoplasms are bone tumors that arise in the femur. Discover the latest research on femoral neoplasms here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.