Supervised learning is an accurate method for network-based gene classification

Bioinformatics
R. LiuArjun Krishnan

Abstract

Assigning every human gene to specific functions, diseases and traits is a grand challenge in modern genetics. Key to addressing this challenge are computational methods, such as supervised learning and label propagation, that can leverage molecular interaction networks to predict gene attributes. In spite of being a popular machine-learning technique across fields, supervised learning has been applied only in a few network-based studies for predicting pathway-, phenotype- or disease-associated genes. It is unknown how supervised learning broadly performs across different networks and diverse gene classification tasks, and how it compares to label propagation, the widely benchmarked canonical approach for this problem. In this study, we present a comprehensive benchmarking of supervised learning for network-based gene classification, evaluating this approach and a classic label propagation technique on hundreds of diverse prediction tasks and multiple networks using stringent evaluation schemes. We demonstrate that supervised learning on a gene's full network connectivity outperforms label propagaton and achieves high prediction accuracy by efficiently capturing local network properties, rivaling label propagation's appeal for ...Continue Reading

References

Oct 5, 1990·Journal of Molecular Biology·S F AltschulD J Lipman
Dec 11, 1999·Nucleic Acids Research·M Kanehisa, S Goto
Dec 2, 2000·Nature Biotechnology·B SchwikowskiS Fields
Dec 26, 2001·Nucleic Acids Research·Ron EdgarAlex E Lash
May 13, 2003·Nature Biotechnology·Alexei VazquezAlessandro Vespignani
Oct 25, 2003·International Journal of Radiation Oncology, Biology, Physics·David J Brenner
Feb 26, 2004·Proceedings of the National Academy of Sciences of the United States of America·Ulas KaraozSimon Kasif
Mar 20, 2004·Quarterly Reviews of Biophysics·James C Whisstock, Arthur M Lesk
Aug 3, 2004·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Minghua DengFengzhu Sun
Sep 21, 2004·Bioinformatics·Michele Leone, Andrea Pagnani
Oct 4, 2005·Proceedings of the National Academy of Sciences of the United States of America·Aravind SubramanianJill P Mesirov
Oct 6, 2005·Bioinformatics·Koji TsudaBernhard Schölkopf
Dec 31, 2005·Nucleic Acids Research·Chris StarkMike Tyers
Jan 18, 2006·Bioinformatics·Zafer BarutcuogluOlga G Troyanskaya
Mar 14, 2007·Molecular Systems Biology·Roded SharanRon Shamir
Mar 29, 2008·American Journal of Human Genetics·Sebastian KöhlerPeter N Robinson
Apr 3, 2008·Genome Research·Trey Ideker, Roded Sharan
Jul 22, 2008·Genome Biology·Lourdes Peña-CastilloFrederick P Roth
Jan 22, 2010·PLoS Computational Biology·Oron VanunuRoded Sharan
Feb 4, 2010·Archives of Microbiology·Roy D Sleator, Paul Walsh
Nov 11, 2010·Nucleic Acids Research·Rasko LeinonenUNKNOWN International Nucleotide Sequence Database Collaboration
Nov 19, 2010·PLoS Computational Biology·Yuanfang GuanMatthew A Hibbs
Dec 21, 2010·American Journal of Human Genetics·Jian YangPeter M Visscher
Mar 3, 2011·PloS One·Jesse Gillis, Paul Pavlidis
May 7, 2011·Bioinformatics·Arthur LiberzonJill P Mesirov
Jul 19, 2011·Briefings in Functional Genomics·Xiujuan WangHaiyuan Yu
Oct 4, 2011·PLoS Computational Biology·T M MuraliMichael G Katze
Nov 15, 2011·Nucleic Acids Research·Lynn Marie SchrimlWarren Alden Kibbe
Jan 10, 2012·The FEBS Journal·Rosario M Piro, Ferdinando Di Cunto
Nov 24, 2012·Nucleic Acids Research·Chunlei WuAndrew I Su
Jan 29, 2013·Nature Methods·Predrag RadivojacIddo Friedberg
Mar 22, 2013·PLoS Computational Biology·Christopher Y ParkOlga G Troyanskaya
Jul 16, 2013·Recent Patents on Biotechnology·Juliana S Bernardes, Carlos E Pedreira
Jun 13, 2014·PLoS Computational Biology·Noah YoungsDennis Shasha
Oct 30, 2014·Nucleic Acids Research·Damian SzklarczykChristian von Mering
Oct 31, 2014·Nucleic Acids Research·Garth R BrownTerence D Murphy
Apr 17, 2015·Database : the Journal of Biological Databases and Curation·Janet PiñeroLaura I Furlong
Apr 29, 2015·Nature Genetics·Casey S GreeneOlga G Troyanskaya
Sep 18, 2015·GigaScience·Indika KahandaAsa Ben-Hur

❮ Previous
Next ❯

Citations

Mar 25, 2021·Bioinformatics·Renming Liu, Arjun Krishnan

❮ Previous
Next ❯

Software Mentioned

LP
BeFree
node2vec
TN
InBioMap
Zenodo
auROC
SL
Github
STRING

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.

Related Papers

IEEE Transactions on Medical Imaging
Annegreet van OpbroekMarleen de Bruijne
IEEE Transactions on Pattern Analysis and Machine Intelligence
D Coulon, D Kayser
BioRxiv : the Preprint Server for Biology
Eduard T KlapwijkLara M Wierenga
© 2021 Meta ULC. All rights reserved