Ontology based text mining of gene-phenotype associations: application to candidate gene prediction

Database : the Journal of Biological Databases and Curation
Şenay Kafkas, Robert Hoehndorf

Abstract

Gene-phenotype associations play an important role in understanding the disease mechanisms which is a requirement for treatment development. A portion of gene-phenotype associations are observed mainly experimentally and made publicly available through several standard resources such as MGI. However, there is still a vast amount of gene-phenotype associations buried in the biomedical literature. Given the large amount of literature data, we need automated text mining tools to alleviate the burden in manual curation of gene-phenotype associations and to develop comprehensive resources. In this study, we present an ontology-based approach in combination with statistical methods to text mine gene-phenotype associations from the literature. Our method achieved AUC values of 0.90 and 0.75 in recovering known gene-phenotype associations from HPO and MGI respectively. We posit that candidate genes and their relevant diseases should be expressed with similar phenotypes in publications. Thus, we demonstrate the utility of our approach by predicting disease candidate genes based on the semantic similarities of phenotypes associated with genes and diseases. To the best of our knowledge, this is the first study using an ontology based appr...Continue Reading

References

Dec 29, 1999·Human Mutation·A HamoshV A McKusick
Oct 28, 2008·American Journal of Human Genetics·Peter N RobinsonStefan Mundlos
Aug 4, 2009·PLoS Computational Biology·Catia PesquitaFrancisco M Couto
Jan 7, 2010·Wiley Interdisciplinary Reviews. Systems Biology and Medicine·Cynthia L Smith, Janan T Eppig
Jul 9, 2011·Nucleic Acids Research·Robert HoehndorfGeorgios V Gkoutos
May 11, 2013·Database : the Journal of Biological Databases and Curation·Damian SmedleyChristopher Mungall
Oct 29, 2013·Genome Research·Peter N RobinsonDamian Smedley
Mar 24, 2016·Biochemical and Biophysical Research Communications·Yong Hoi LeeKuan Onn Tan
Aug 30, 2016·American Journal of Human Genetics·Damian SmedleyPeter N Robinson
Dec 3, 2016·Nucleic Acids Research·UNKNOWN The UniProt Consortium
Feb 15, 2017·Journal of Biomedical Semantics·Maxat Kulmanov, Robert Hoehndorf
Apr 8, 2017·Briefings in Bioinformatics·Georgios V GkoutosRobert Hoehndorf
Apr 18, 2017·PLoS Computational Biology·Imane BoudelliouaRobert Hoehndorf
Nov 2, 2017·Nucleic Acids Research·Cynthia L SmithUNKNOWN Mouse Genome Database Group
Nov 22, 2017·Nucleic Acids Research·Maria LevchenkoJohanna McEntyre
Dec 8, 2017·Journal of Biomedical Semantics·Maryam Khordad, Robert E Mercer
Feb 28, 2019·Database : the Journal of Biological Databases and Curation·Şenay Kafkas, Robert Hoehndorf
Oct 8, 2018·BioRxiv : the Preprint Server for Biology·Senay Kafkas, R. Hoehndorf

❮ Previous
Next ❯

Citations

Mar 16, 2019·Molecules : a Journal of Synthetic Chemistry and Natural Product Chemistry·Xiufang DongBeiwei Zhu
May 26, 2018·International Journal of Obesity : Journal of the International Association for the Study of Obesity·Yan ChenTibor V Varga
Feb 28, 2019·Database : the Journal of Biological Databases and Curation·Şenay Kafkas, Robert Hoehndorf
Aug 5, 2021·Scientific Reports·Guillermo Serrano NájeraDaniel J Crowther

❮ Previous
Next ❯

Methods Mentioned

BETA
phenotype-based prediction

Software Mentioned

WhatIzIt
PhenomeNET
ClinVar

Related Concepts

Related Feeds

Bioinformatics in Biomedicine

Bioinformatics in biomedicine incorporates computer science, biology, chemistry, medicine, mathematics and statistics. Discover the latest research on bioinformatics in biomedicine here.