Phylogenetic and functional assessment of orthologs inference projects and methods.

PLoS Computational Biology
Adrian M Altenhoff, Christophe Dessimoz

Abstract

Accurate genome-wide identification of orthologs is a central problem in comparative genomics, a fact reflected by the numerous orthology identification projects developed in recent years. However, only a few reports have compared their accuracy, and indeed, several recent efforts have not yet been systematically evaluated. Furthermore, orthology is typically only assessed in terms of function conservation, despite the phylogeny-based original definition of Fitch. We collected and mapped the results of nine leading orthology projects and methods (COG, KOG, Inparanoid, OrthoMCL, Ensembl Compara, Homologene, RoundUp, EggNOG, and OMA) and two standard methods (bidirectional best-hit and reciprocal smallest distance). We systematically compared their predictions with respect to both phylogeny and function, using six different tests. This required the mapping of millions of sequences, the handling of hundreds of millions of predicted pairs of orthologs, and the computation of tens of thousands of trees. In phylogenetic analysis or in functional analysis where high specificity is required, we find that OMA and Homologene perform best. At lower functional specificity but higher coverage level, OrthoMCL outperforms Ensembl Compara, and...Continue Reading

References

Jun 1, 1970·Systematic Zoology·W M Fitch
Mar 25, 1981·Journal of Molecular Biology·T F Smith, M S Waterman
Oct 24, 1997·Science·R L TatusovD J Lipman
Mar 17, 1999·Proceedings of the National Academy of Sciences of the United States of America·R OverbeekN Maltsev
Oct 26, 1999·Trends in Genetics : TIG·C Ouzounis
Dec 11, 1999·Nucleic Acids Research·A Bairoch
Apr 27, 2000·Trends in Genetics : TIG·W M Fitch
Jun 8, 2000·Bioinformatics·G H GonnetL Bernardin
Sep 5, 2001·Genome Biology·R A Jensen
Dec 18, 2001·Journal of Molecular Biology·M RemmE L Sonnhammer
Feb 16, 2002·Science·Cristina AzevedoPaul Schulze-Lefert
Nov 20, 2002·Molecular Plant-microbe Interactions : MPMI·Candace ElliottPaul Schulze-Lefert
Sep 4, 2003·Genome Research·Li LiDavid S Roos
Sep 13, 2003·BMC Bioinformatics·Roman L TatusovDarren A Natale
Dec 19, 2003·Nucleic Acids Research·David L WheelerLukas Wagner
Dec 19, 2003·Nucleic Acids Research·M A HarrisUNKNOWN Gene Ontology Consortium
Mar 23, 2004·Nucleic Acids Research·Robert C Edgar
Dec 14, 2004·Bioinformatics·D P WallA E Hirsh
Oct 12, 2005·PLoS Computational Biology·Barbara E EngelhardtSteven E Brenner
Nov 1, 2005·Nucleic Acids Research·Richard A NotebaartBerend Snel
Nov 11, 2005·Molecular Biology and Evolution·Ben-Yang Liao, Jianzhi Zhang
Mar 3, 2006·Genome Research·Sourav BandyopadhyayTrey Ideker
Apr 15, 2006·Genome Biology·Tim HulsenPeter M A Groenen
Jun 17, 2006·Bioinformatics·Todd F DelucaDennis P Wall
Jul 29, 2006·Bioinformatics·Andrey AlexeyenkoErik L L Sonnhammer
Aug 29, 2006·Nucleic Acids Research·Marshall BernEugenia Lyashenko
Dec 7, 2006·Nucleic Acids Research·T J P HubbardE Birney
Dec 16, 2006·Nucleic Acids Research·David L WheelerEugene Yaschenko
Mar 10, 2007·BMC Bioinformatics·René T J M van der HeijdenMartijn A Huynen
Oct 19, 2007·Nucleic Acids Research·Lars Juhl JensenPeer Bork
Nov 15, 2007·Nucleic Acids Research·P FlicekS Searle
Nov 30, 2007·Nucleic Acids Research·David L WheelerEugene Yaschenko
Dec 7, 2007·Nucleic Acids Research·Ann-Charlotte BerglundErik L L Sonnhammer
Dec 6, 2008·BMC Bioinformatics·Alexander C J RothChristophe Dessimoz

❮ Previous
Next ❯

Citations

Jan 7, 2014·BMC Genomics·Matthew N BenedictNathan D Price
Mar 20, 2014·International Journal of Molecular Sciences·Jie ChenFafu Shen
Mar 13, 2014·Genome Biology and Evolution·Igor B RogozinEugene V Koonin
Jun 3, 2014·BMC Genomics·José B Pereira-LealCândido P P Ricardo
Dec 14, 2011·Annual Review of Entomology·Michelle D TrautweinDavid K Yeates
Aug 11, 2011·Biometals : an International Journal on the Role of Metal Ions in Biology, Biochemistry, and Medicine·Christian HödarVerónica Cambiazo
Dec 6, 2008·BMC Bioinformatics·Alexander C J RothChristophe Dessimoz
May 2, 2009·Nucleic Acids Research·Alexandre GattikerJacques Rougemont
May 22, 2009·BMC Bioinformatics·Jochen BlomAlexander Goesmann
Sep 30, 2009·Genome Biology·Toni GabaldónSuzanna Lewis
Nov 7, 2009·Nucleic Acids Research·Gabriel OstlundErik L L Sonnhammer
Nov 26, 2009·BMC Bioinformatics·Valentín Ruano-RubioJulie D Thompson
Feb 4, 2010·BMC Genomics·Sandrine GrossetêteOlivier Lespinet
Apr 3, 2010·PLoS Computational Biology·Gang FangMark B Gerstein
Apr 8, 2010·Genome Biology·Christophe Dessimoz, Manuel Gil
Oct 5, 2010·Genome Biology and Evolution·Robert EkblomTerry Burke
Oct 26, 2010·Nucleic Acids Research·Robert M WaterhouseEvgenia V Kriventseva
Nov 30, 2010·Nucleic Acids Research·Adrian M AltenhoffChristophe Dessimoz
Dec 22, 2010·BMC Genomics·Yizhen JiaDavid K Smith
Dec 15, 2010·Database : the Journal of Biological Databases and Curation·Aurélie LardenoisMichael Primig
Feb 22, 2011·Nucleic Acids Research·Marina Marcet-Houben, Toni Gabaldón
May 3, 2011·PloS One·Leonidas Salichos, Antonis Rokas
May 17, 2011·Nucleic Acids Research·Chenggang YuJaques Reifman
May 25, 2011·BMC Bioinformatics·Yves-Pol DeniélouAlain Viari
Jun 8, 2011·PloS One·Daniel D Shaye, Iva Greenwald
Jun 17, 2011·DNA Research : an International Journal for Rapid Publication of Reports on Genes and Genomes·Anuj SrivastavaRussell L Malmberg
Jun 28, 2011·Briefings in Bioinformatics·Colin N Dewey
Jul 9, 2011·Briefings in Bioinformatics·Brigitte BoeckmannChristophe Dessimoz
Nov 5, 2011·Nucleic Acids Research·Guilhem FaureRaphaël Guerois
Jan 11, 2012·Nucleic Acids Research·Ionas ErbCédric Notredame
Feb 15, 2012·Bioinformatics·Christophe DessimozUNKNOWN Quest for Orthologs Consortium
Feb 18, 2012·BMC Bioinformatics·Matti KankainenLiisa Holm
May 31, 2012·DNA Research : an International Journal for Rapid Publication of Reports on Genes and Genomes·Anuj SrivastavaTravis C Glenn

❮ Previous
Next ❯

Methods Mentioned

BETA
chips

Software Mentioned

HPC
Ensembl Compara
OMA
Fasta
OMA Pairwiseâ
EggNOG
HumanâWorm
RaxML
COG
Inparanoid

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.