LBoost: A boosting algorithm with application for epistasis discovery

PloS One
Bethany J WolfEmily Kistner-Griffin

Abstract

Many human diseases are attributable to complex interactions among genetic and environmental factors. Statistical tools capable of modeling such complex interactions are necessary to improve identification of genetic factors that increase a patient's risk of disease. Logic Forest (LF), a bagging ensemble algorithm based on logic regression (LR), is able to discover interactions among binary variables predictive of response such as the biologic interactions that predispose individuals to disease. However, LF's ability to recover interactions degrades for more infrequently occurring interactions. A rare genetic interaction may occur if, for example, the interaction increases disease risk in a patient subpopulation that represents only a small proportion of the overall patient population. We present an alternative ensemble adaptation of LR based on boosting rather than bagging called LBoost. We compare the ability of LBoost and LF to identify variable interactions in simulation studies. Results indicate that LBoost is superior to LF for identifying genetic interactions associated with disease that are infrequent in the population. We apply LBoost to a subset of single nucleotide polymorphisms on the PRDX genes from the Cancer Gene...Continue Reading

References

Jul 19, 2001·Journal of the National Cancer Institute·Margaret Sullivan PepeY Yasui
Sep 13, 2001·Proceedings of the National Academy of Sciences of the United States of America·T SørlieA L Børresen-Dale
Oct 15, 2003·Biostatistics·Ruth EtzioniPeter H Gann
Nov 30, 2004·Statistics in Medicine·Holly JanesPolly Newcomb
Nov 17, 2005·Cancer Research·Tieli WangJian Jian Li
Feb 4, 2006·Current Treatment Options in Oncology·Virginia G Kaklamani, William J Gradishar
Sep 13, 2006·Biomarkers : Biochemical Indicators of Exposure, Response, and Susceptibility to Chemicals·S KumarR Guleria
Dec 14, 2006·Journal of Cellular Biochemistry·Ji-Yeon BaeDong-Young Noh
Feb 7, 2007·Human Heredity·S KottiF Clerget-Darpoux
Mar 21, 2007·Journal of Clinical Oncology : Official Journal of the American Society of Clinical Oncology·Shelley S TworogerSusan E Hankinson
Feb 21, 2008·BMC Genetics·Ingileif B Hallgrímsdóttir, Debbie S Yuster
Apr 17, 2009·The EMBO Journal·Juxiang CaoCarola A Neumann
Jul 2, 2009·Journal of Experimental & Clinical Cancer Research : CR·Mee-Kyung ChaIl-Han Kim
Sep 19, 2009·IEEE Transactions on Pattern Analysis and Machine Intelligence·Edward K F DangStephen C F Chan
Feb 26, 2010·Genome Medicine·Naomi R Wray, Michael E Goddard
Mar 18, 2010·Cancer Informatics·Adam Ertel
Jul 16, 2010·Bioinformatics·Bethany J WolfElizabeth H Slate
Jul 17, 2010·Journal of the American College of Cardiology·Alison E Baird
Aug 24, 2010·Annals of Human Genetics·Tyler J Vanderweele, Nan M Laird

Citations

Apr 12, 2021·Medical & Biological Engineering & Computing·R Manavalan, S Priya

Methods Mentioned

BETA
blood collection
chip

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Hereditary Sensory Autonomic Neuropathy

Hereditary Sensory Autonomic Neuropathies are a group of inherited neurodegenerative disorders characterized clinically by loss of sensation and autonomic dysfunction. Here is the latest research on these neuropathies.

Spatio-Temporal Regulation of DNA Repair

DNA repair is a complex process regulated by several different classes of enzymes, including ligases, endonucleases, and polymerases. This feed focuses on the spatial and temporal regulation that accompanies DNA damage signaling and repair enzymes and processes.

Glut1 Deficiency

Glut1 deficiency, an autosomal dominant, genetic metabolic disorder associated with a deficiency of GLUT1, the protein that transports glucose across the blood brain barrier, is characterized by mental and motor developmental delays and infantile seizures. Follow the latest research on Glut1 deficiency with this feed.

Separation Anxiety

Separation anxiety is a type of anxiety disorder that involves excessive distress and anxiety with separation. This may include separation from places or people to which they have a strong emotional connection with. It often affects children more than adults. Here is the latest research on separation anxiety.

KIF1A Associated Neurological Disorder

KIF1A associated neurological disorder (KAND) is a rare neurodegenerative condition caused by mutations in the KIF1A gene. KAND may present with a wide range and severity of symptoms including stiff or weak leg muscles, low muscle tone, a lack of muscle coordination and balance, and intellectual disability. Find the latest research on KAND here.

Regulation of Vocal-Motor Plasticity

Dopaminergic projections to the basal ganglia and nucleus accumbens shape the learning and plasticity of motivated behaviors across species including the regulation of vocal-motor plasticity and performance in songbirds. Discover the latest research on the regulation of vocal-motor plasticity here.