Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities.

GigaScience
Zhen-Hao GuoZhan-Heng Chen

Abstract

The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572. Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and mult...Continue Reading

References

Jul 1, 1998·Annual Review of Biophysics and Biomolecular Structure·P B Moore
Aug 26, 2000·Current Opinion in Chemical Biology·R P Hertzberg, A J Pope
Dec 23, 2000·Science·J B TenenbaumJ C Langford
Dec 23, 2000·Science·S T Roweis, L K Saul
Dec 26, 2001·Nucleic Acids Research·Micheal HewettTeri E Klein
Jan 22, 2004·Nature Reviews. Genetics·Albert-László Barabási, Zoltán N Oltvai
Aug 3, 2004·Nature Reviews. Drug Discovery·Ted T Ashburn, Karl B Thor
Dec 2, 2004·Nature Reviews. Molecular Cell Biology·Bin TianMichael B Mathews
Aug 2, 2005·Trends in Biochemical Sciences·Juan MataJürg Bähler
Mar 16, 2007·Proceedings of the National Academy of Sciences of the United States of America·Juwen ShenHualiang Jiang
Apr 30, 2010·Journal of Chemical Information and Modeling·David Rogers, Mathew Hahn
Nov 24, 2012·Nucleic Acids Research·Geng ChenQinghua Cui
Nov 13, 2013·Nucleic Acids Research·Jiao YuanRunsheng Chen
Sep 23, 2014·RNA·Petar GlažarNikolaus Rajewsky
Feb 24, 2015·Artificial Intelligence in Medicine·Víctor MartínezArmando Blanco
Nov 19, 2015·Nucleic Acids Research·Anindya Bhattacharya, Yan Cui
Feb 18, 2016·Briefings in Bioinformatics·Wei MaQinghua Cui
Dec 3, 2016·Nucleic Acids Research·UNKNOWN NCBI Resource Coordinators
Jan 17, 2017·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Jian-Qiang LiXing Chen
Feb 9, 2017·Oncotarget·Jian-Qiang LiZhu-Hong You
Apr 28, 2017·Bioinformatics·Mona AlshahraniRobert Hoehndorf
Nov 11, 2017·Nucleic Acids Research·David S WishartMichael Wilson
Nov 11, 2017·Nucleic Acids Research·Chih-Hung ChouHsien-Da Huang
Nov 16, 2017·Nucleic Acids Research·ShuangSang FangYi Zhao
May 10, 2018·Database : the Journal of Biological Databases and Curation·Chunyan FanFang-Xiang Wu
May 18, 2018·BioMed Research International·Hui CuiLu Xie
Sep 25, 2018·Nucleic Acids Research·Allan Peter DavisCarolyn J Mattingly
Oct 5, 2018·Nucleic Acids Research·Zhenyu BaoDong Dong
Oct 27, 2018·Nucleic Acids Research·Zhou HuangQinghua Cui
Oct 30, 2018·Nucleic Acids Research·Zhan TongYuan Zhou
Nov 1, 2018·Nucleic Acids Research·Liang ChengQinghua Jiang
Nov 14, 2018·Nucleic Acids Research·Ana KozomaraSam Griffiths-Jones
May 23, 2019·Bioinformatics·Xiangxiang ZengFeixiong Cheng

❮ Previous
Next ❯

Citations


❮ Previous
Next ❯

Key Resources (RRID) Mentioned

SCR_005223
SCR_006472
SCR_003152
SCR_007822
SCR_002700
SCR_014274

Software Mentioned

DeepWalk
mathop
Python
Bioentity2vec
Windows
MAN
Simplified
SkipGram
gensim
RDKit

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.