Systematic identification of functional orthologs based on protein network comparison

Genome Research
Sourav BandyopadhyayTrey Ideker

Abstract

Annotating protein function across species is an important task that is often complicated by the presence of large paralogous gene families. Here, we report a novel strategy for identifying functionally related proteins that supplements sequence-based comparisons with information on conserved protein-protein interactions. First, the protein interaction networks of two species are aligned by assigning proteins to sequence homology clusters using the Inparanoid algorithm. Next, probabilistic inference is performed on the aligned networks to identify pairs of proteins, one from each species, that are likely to retain the same function based on conservation of their interacting partners. Applying this method to Drosophila melanogaster and Saccharomyces cerevisiae, we analyze 121 cases for which functional orthology assignment is ambiguous when sequence similarity is used alone. In 61 of these cases, the network supports a different protein pair than that favored by sequence comparisons. These results suggest that network analysis can be used to provide a key source of information for refining sequence-based homology searches.

References

Oct 1, 1993·Trends in Genetics : TIG·L Guarente
Sep 1, 1997·Nucleic Acids Research·S F AltschulD J Lipman
Jun 25, 1998·Molecular and Cellular Biology·M C SiomiG Dreyfuss
Apr 28, 1999·Trends in Genetics : TIG·S E Brenner
Dec 11, 1999·Nucleic Acids Research·R L TatusovE V Koonin
Apr 26, 2000·Genome Research·M G ReeseS E Lewis
Dec 18, 2001·Journal of Molecular Biology·M RemmE L Sonnhammer
Aug 9, 2002·Theoretical Population Biology·Jonathan A Eisen, Martin Wu
Mar 14, 2003·Nature·Ruedi Aebersold, Matthias Mann
Mar 19, 2003·Proceedings. Biological Sciences·Andreas Wagner
Mar 29, 2003·Journal of Molecular Biology·Einat SprinzakHanah Margalit
Apr 25, 2003·Trends in Genetics : TIG·Vera van NoortMartijn A Huynen
May 13, 2003·Nature Biotechnology·Alexei VazquezAlessandro Vespignani
Sep 4, 2003·Genome Research·Li LiDavid S Roos
Sep 25, 2003·Proceedings of the National Academy of Sciences of the United States of America·Brian P KelleyTrey Ideker
Dec 4, 2003·Gene·Cory J Evans, Renato J Aguilera
Jun 18, 2004·Nucleic Acids Research·Brandon Ason, William S Reznikoff
Jun 25, 2004·Nucleic Acids Research·Brian P KelleyTrey Ideker
Sep 3, 2004·Nature·Christopher T HarbisonRichard A Young
Sep 21, 2004·Bioinformatics·Michele Leone, Andrea Pagnani
Dec 21, 2004·Nucleic Acids Research·Rachel A DrysdaleUNKNOWN FlyBase Consortium
Feb 3, 2005·Proceedings of the National Academy of Sciences of the United States of America·Roded SharanTrey Ideker
May 11, 2005·Proceedings of the National Academy of Sciences of the United States of America·Jordi EspadalerBaldomero Oliva

❮ Previous
Next ❯

Citations

Apr 8, 2006·Nature Biotechnology·Roded Sharan, Trey Ideker
Dec 15, 2006·Nature Reviews. Cancer·Pingzhao HuAndrew Emili
Apr 5, 2013·Nature Reviews. Genetics·Toni Gabaldón, Eugene V Koonin
Sep 21, 2013·Nature Reviews. Genetics·Koyel MitraTrey Ideker
Aug 30, 2008·Proceedings of the National Academy of Sciences of the United States of America·Rohit SinghBonnie Berger
Apr 2, 2011·The Journal of Biological Chemistry·Alessandro OriDavid G Fernig
Feb 6, 2009·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Shaul KarniRoded Sharan
Apr 10, 2010·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Yong LuZiv Bar-Joseph
Jun 28, 2011·Briefings in Bioinformatics·Colin N Dewey
May 1, 2007·Bioinformatics·Li ZhenpingLuonan Chen
May 30, 2008·Bioinformatics·Magali MichautHenning Hermjakob
May 30, 2009·Bioinformatics·Mikhail ZaslavskiyJean-Philippe Vert
Oct 11, 2012·Bioinformatics·Rob Patro, Carl Kingsford
Feb 16, 2013·Bioinformatics·Ahmet E Aladag, Cesim Erten
Oct 6, 2006·Nucleic Acids Research·Richard A GeorgeMerridee A Wouters
Apr 3, 2008·Genome Research·Trey Ideker, Roded Sharan
Aug 7, 2007·Plant Physiology·Jane Geisler-LeeMatt Geisler
Dec 7, 2007·The Plant Cell·Réka Albert
Oct 14, 2009·BMC Bioinformatics·Steffen BraschGeorg Fuellen
Oct 16, 2009·BMC Bioinformatics·Sinan ErtenMehmet Koyutürk
Feb 16, 2010·BMC Bioinformatics·Like FokkensBerend Snel
May 14, 2010·BMC Bioinformatics·Fadi TowficVasant Honavar
May 2, 2012·BMC Bioinformatics·Yu-Keng Shih, Srinivasan Parthasarathy
May 16, 2013·BMC Bioinformatics·Pietro Di LenaChristine Nardini
Nov 8, 2006·BMC Bioinformatics·Sylvain Brohée, Jacques van Helden
Dec 25, 2009·BMC Genomics·Jin JunCraig E Nelson
Jul 6, 2010·BMC Genomics·Anna HenricsonErik L L Sonnhammer
Oct 30, 2008·BMC Systems Biology·Michal KolárJohannes Berg
Sep 18, 2010·BMC Systems Biology·Mahnaz HabibiLimsoon Wong
Nov 23, 2012·BMC Systems Biology·Michal KolářJohannes Berg
Sep 18, 2013·BMC Systems Biology·Wynand WinterbachDick de Ridder
Feb 26, 2014·BMC Systems Biology·Qiang HuangXiang-Sun Zhang
Oct 3, 2006·Genome Biology·Ronny AloniDoron Lancet
Dec 7, 2006·Genome Biology·G Traver HartEdward M Marcotte

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.

Related Papers

Proceedings of the National Academy of Sciences of the United States of America
Rohit SinghBonnie Berger
Proceedings of the National Academy of Sciences of the United States of America
Roded SharanTrey Ideker
© 2021 Meta ULC. All rights reserved