Path2Surv: Pathway/gene set-based survival analysis using multiple kernel learning

Bioinformatics
Onur DereliMehmet Gönen

Abstract

Survival analysis methods that integrate pathways/gene sets into their learning model could identify molecular mechanisms that determine survival characteristics of patients. Rather than first picking the predictive pathways/gene sets from a given collection and then training a predictive model on the subset of genomic features mapped to these selected pathways/gene sets, we developed a novel machine learning algorithm (Path2Surv) that conjointly performs these two steps using multiple kernel learning. We extensively tested our Path2Surv algorithm on 7655 patients from 20 cancer types using cancer-specific pathway/gene set collections and gene expression profiles of these patients. Path2Surv statistically significantly outperformed survival random forest (RF) on 12 out of 20 datasets and obtained comparable predictive performance against survival support vector machine (SVM) using significantly fewer gene expression features (i.e. less than 10% of what survival RF and survival SVM used). Our implementations of survival SVM and Path2Surv algorithms in R are available at https://github.com/mehmetgonen/path2surv together with the scripts that replicate the reported experiments. Supplementary data are available at Bioinformatics on...Continue Reading

Citations

Sep 8, 2004·Statistics in Medicine·Bart BakkerBert Kappen
Jan 24, 2007·Biometrical Journal. Biometrische Zeitschrift·Thomas A Gerds, Martin Schumacher
Jun 3, 2008·Bioinformatics·Ludger Evers, Claudia-Martina Messow
Oct 4, 2008·Nucleic Acids Research·Carl F SchaeferKenneth H Buetow
Nov 11, 2010·Bioinformatics·Vanya Van BelleJohan A K Suykens
Mar 4, 2011·European Journal of Human Genetics : EJHG·Herbert PangStéphane Minvielle
Aug 9, 2011·Artificial Intelligence in Medicine·Vanya Van BelleJohan A K Suykens
May 2, 2012·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Herbert PangTiejun Tong
Mar 20, 2013·Statistics in Medicine·Ulla B Mogensen, Thomas A Gerds
Jun 2, 2014·Nature Biotechnology·James C CostelloGustavo Stolovitzky
Apr 25, 2015·IEEE Transactions on Neural Networks and Learning Systems·Farkhondeh KiaeeSamaneh Eftekhari Mahabadi
Jan 16, 2016·Cell Systems·Arthur LiberzonPablo Tamayo
Mar 19, 2016·Statistical Methods in Medical Research·Andrea LamontM Lee Van Horn
Apr 18, 2018·Statistics in Medicine·Jennifer A Sinnott, Tianxi Cai

Related Concepts

Genome
Genes
Survival Analysis
Rhizotomy Procedure
Gene Expression
Learning
Positive Regulation of Transcription, DNA-dependent
Script
Research Study

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Sexual Dimorphism in Neurodegeneration

There exist sex differences in neurodevelopmental and neurodegenerative disorders. For instance, multiple sclerosis is more common in women, whereas Parkinson’s disease is more common in men. Here is the latest research on sexual dimorphism in neurodegeneration

HLA Genetic Variation

HLA genetic variation has been found to confer risk for a wide variety of diseases. Identifying these associations and understanding their molecular mechanisms is ongoing and holds promise for the development of therapeutics. Find the latest research on HLA genetic variation here.

Super-resolution Microscopy

Super-resolution microscopy is the term commonly given to fluorescence microscopy techniques with resolutions that are not limited by the diffraction of light. Here are the latest discoveries pertaining to super-resolution microscopy.

Genetic Screens in iPSC-derived Brain Cells

Genetic screening is a critical tool that can be employed to define and understand gene function and interaction. This feed focuses on genetic screens conducted using induced pluripotent stem cell (iPSC)-derived brain cells.

Brain Lower Grade Glioma

Low grade gliomas in the brain form from oligodendrocytes and astrocytes and are the slowest-growing glioma in adults. Discover the latest research on these brain tumors here.

CD4/CD8 Signaling

Cluster of differentiation 4 and 8 (CD8 and CD8) are glycoproteins founds on the surface of immune cells. Here is the latest research on their role in cell signaling pathways.

Alignment-free Sequence Analysis Tools

Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.