EDeepSSP: Explainable deep neural networks for exact splice sites prediction.

Journal of Bioinformatics and Computational Biology
Santhosh Amilpur, Raju Bhukya

Abstract

Splice site prediction is crucial for understanding underlying gene regulation, gene function for better genome annotation. Many computational methods exist for recognizing the splice sites. Although most of the methods achieve a competent performance, their interpretability remains challenging. Moreover, all traditional machine learning methods manually extract features, which is tedious job. To address these challenges, we propose a deep learning-based approach (EDeepSSP) that employs convolutional neural networks (CNNs) architecture for automatic feature extraction and effectively predicts splice sites. Our model, EDeepSSP, divulges the opaque nature of CNN by extracting significant motifs and explains why these motifs are vital for predicting splice sites. In this study, experiments have been conducted on six benchmark acceptors and donor datasets of humans, cress, and fly. The results show that EDeepSSP has outperformed many state-of-the-art approaches. EDeepSSP achieves the highest area under the receiver operating characteristic curve (AUC_ROC) and area under the precision-recall curve (AUC_PR) of 99.32% and 99.26% on human donor datasets, respectively. We also analyze various filter activities, feature activations, and ...Continue Reading

References

Jan 11, 1984·Nucleic Acids Research·R Staden
Apr 25, 1997·Journal of Molecular Biology·C Burge, S Karlin
Feb 27, 2001·Nucleic Acids Research·M PerteaS L Salzberg
Dec 19, 2003·Nucleic Acids Research·Albin SandelinBoris Lenhard
Aug 3, 2004·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Gene Yeo, Christopher B Burge
Aug 18, 2006·Nucleic Acids Research·Nihar ShethRavi Sachidanandam
Oct 19, 2006·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Jagath C Rajapakse, Loi Sy Ho
Jan 27, 2007·BMC Bioinformatics·A K M A BatenJason Li
Feb 28, 2007·Genome Biology·Shobhit GuptaWilliam Stafford Noble
Feb 27, 2008·BMC Bioinformatics·Sören SonnenburgGunnar Rätsch
Jun 3, 2008·Nature Methods·Ali MortazaviBarbara Wold
Jun 3, 2008·Nature Methods·Nicole CloonanSean M Grimmond
Jan 6, 2009·BMC Bioinformatics·A K M A BatenB C H Chang
May 22, 2009·Nucleic Acids Research·Timothy L BaileyWilliam S Noble
Feb 12, 2010·Bioinformatics·Thomas D Wu, Serban Nacu
Aug 31, 2010·Nucleic Acids Research·Kai WangJinze Liu
Nov 30, 2012·Nucleic Acids Research·Dennis A BensonEric W Sayers
Feb 11, 2014·Proceedings of the National Academy of Sciences of the United States of America·Akinori ShimizuJun-Ichi Hayashi
Mar 10, 2015·Nature Methods·Daehwan KimSteven L Salzberg
Jul 28, 2015·Nature Biotechnology·Babak AlipanahiBrendan J Frey
Jan 26, 2016·BioData Mining·Prabina Kumar MeherAtmakuri Ramakrishna Rao
Apr 19, 2018·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Tatsuhiko Naito
Dec 29, 2019·BMC Bioinformatics·Ruohan WangShuaicheng Li

❮ Previous
Next ❯

Related Concepts

Related Feeds

Alternative splicing

Alternative splicing a regulated gene expression process that allows a single genetic sequence to code for multiple proteins. Here is that latest research.