Jun 16, 2015

A method to predict the impact of regulatory variants from DNA sequence

Nature Genetics
Dongwon LeeMichael A Beer

Abstract

Most variants implicated in common human disease by genome-wide association studies (GWAS) lie in noncoding sequence intervals. Despite the suggestion that regulatory element disruption represents a common theme, identifying causal risk variants within implicated genomic regions remains a major challenge. Here we present a new sequence-based computational method to predict the effect of regulatory variation, using a classifier (gkm-SVM) that encodes cell type-specific regulatory sequence vocabularies. The induced change in the gkm-SVM score, deltaSVM, quantifies the effect of variants. We show that deltaSVM accurately predicts the impact of SNPs on DNase I sensitivity in their native genomic contexts and accurately predicts the results of dense mutagenesis of several enhancers in reporter assays. Previously validated GWAS SNPs yield large deltaSVM scores, and we predict new risk-conferring SNPs for several autoimmune diseases. Thus, deltaSVM provides a powerful computational approach to systematically identify functional regulatory variants.

  • References44
  • Citations79
  • References44
  • Citations79

Citations

Mentioned in this Paper

Genome-Wide Association Study
Support Vector Machines
Quantitative Trait Loci
Deoxyribonuclease I
Classification
Genome
Genomics
Autoimmune Diseases
Deoxyribonuclease I Activity
Cell Type

Related Feeds

Autoimmune Diseases

Autoimmune diseases occur as a result of an attack by the immune system on the body’s own tissues resulting in damage and dysfunction. There are different types of autoimmune diseases, in which there is a complex and unknown interaction between genetics and the environment. Discover the latest research on autoimmune diseases here.

© 2020 Meta ULC. All rights reserved