Abstract
A complete repository of gene-gene interactions is key for understanding cellular processes, human disease and drug response. These gene-gene interactions include both protein-protein interactions and transcription factor interactions. The majority of known interactions are found in the biomedical literature. Interaction databases, such as BioGRID and ChEA, annotate these gene-gene interactions; however, curation becomes difficult as the literature grows exponentially. DeepDive is a trained system for extracting information from a variety of sources, including text. In this work, we used DeepDive to extract both protein-protein and transcription factor interactions from over 100,000 full-text PLOS articles. We built an extractor for gene-gene interactions that identified candidate gene-gene relations within an input sentence. For each candidate relation, DeepDive computed a probability that the relation was a correct interaction. We evaluated this system against the Database of Interacting Proteins and against randomly curated extractions. Our system achieved 76% precision and 49% recall in extracting direct and indirect interactions involving gene symbols co-occurring in a sentence. For randomly curated extractions, the system...Continue Reading
References
Dec 19, 2003·Nucleic Acids Research·Lukasz SalwinskiDavid Eisenberg
Jul 1, 2004·Nature Genetics·Robert Hoffmann, Alfonso Valencia
Nov 8, 2008·Nucleic Acids Research·T S Keshava PrasadAkhilesh Pandey
Feb 24, 2009·PloS One·Min HeWei Li
Jun 17, 2010·BMC Bioinformatics·Shao-Wu ZhangQuan Pan
Jul 10, 2010·PLoS Computational Biology·Domonkos TikkUlf Leser
Aug 12, 2010·Scientometrics·Peder Olesen Larsen, Markus von Ins
Aug 17, 2010·Bioinformatics·Alexander LachmannAvi Ma'ayan
Dec 27, 2011·Bioinformatics·Sun KimW John Wilbur
Mar 1, 2012·BMC Bioinformatics·Geoffrey KohDong-Yup Lee
Jul 25, 2012·BMC Bioinformatics·Jan CzarneckiAdrian J Shepherd
Dec 4, 2012·Nucleic Acids Research·Andrea FranceschiniLars J Jensen
Jan 18, 2013·Database : the Journal of Biological Databases and Curation·Kalpana RajaJeyakumar Natarajan
Mar 19, 2013·Cell·Tong Ihn Lee, Richard A Young
Jun 5, 2013·The International Journal of Medical Robotics + Computer Assisted Surgery : MRCAS·Akram I OmaraZhijian Song
Nov 12, 2013·Nucleic Acids Research·Philipp BlohmDmitrij Frishman
Jul 19, 2014·PloS One·Changqin QuanFuji Ren
Nov 28, 2014·Nucleic Acids Research·Andrew Chatr-AryamontriMike Tyers
Dec 2, 2014·PloS One·Shanan E PetersChristopher Ré
Dec 3, 2014·Methods : a Companion to Methods in Enzymology·Nikolas PapanikolaouIoannis Iliopoulos
Citations
Jan 24, 2019·Human Genetics·Jia XuBaiju Parikh
Sep 29, 2020·Frontiers in Cell and Developmental Biology·Nadeesha PereraFrank Emmert-Streib
Mar 24, 2017·International Journal of Genomics·Kalpana RajaLam C Tsoi
Sep 30, 2018·Proceedings of the National Academy of Sciences of the United States of America·Byung-Kwon ChoiOlivier Lichtarge
Jul 9, 2020·Computational and Structural Biotechnology Journal·David N Nicholson, Casey S Greene
Dec 21, 2019·Journal of Biomedical Informatics·Saman FarahmandKourosh Zarringhalam
Aug 16, 2019·Emerging Topics in Life Sciences·J Harry Caufield, Peipei Ping
Jul 3, 2021·Proteomes·Jagajjit Sahu