A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data

Scientific Reports
Qiongshi LuHongyu Zhao


Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.


Jun 9, 2017·PLoS Computational Biology·Yiming HuHongyu Zhao
Jun 14, 2017·American Journal of Medical Genetics. Part B, Neuropsychiatric Genetics : the Official Publication of the International Society of Psychiatric Genetics·Yosuke EriguchiYukiko KanoMD PhD
Sep 30, 2017·Bioinformatics·Nilah M IoannidisAlice S Whittemore
Feb 10, 2018·NPJ Genomic Medicine·Denise Anderson, Timo Lassmann
Apr 17, 2019·Proceedings of the National Academy of Sciences of the United States of America·Michael HicksAmalio Telenti
Jul 4, 2019·Human Mutation·Maximilian MillerYana Bromberg
Jul 10, 2019·Human Mutation·Zhiqiang HuSteven E Brenner
Aug 31, 2019·Molecular Genetics & Genomic Medicine·Malú ZamariolliMaria Isabel Melaragno
Jun 13, 2018·Briefings in Bioinformatics·Elena RojanoJames R Perkins
Aug 17, 2018·Nucleic Acids Research·Alexandre Amlie-WolfLi-San Wang
Aug 17, 2019·Frontiers in Cellular Neuroscience·Elena PerenthalerTahsin Stefan Barakat
May 13, 2020·Briefings in Functional Genomics·Eleanor G Seaby, Sarah Ennis
Dec 20, 2018·Frontiers in Pharmacology·Yitian ZhouVolker M Lauschke
Jul 31, 2018·Nucleic Acids Research·Jinchen LiZhongsheng Sun
Dec 31, 2019·BioMed Research International·Nagasundaram NagarajanHui-Yuan Yeh


Jan 11, 1996·Nature·G D PennyN Brockdorff
Apr 7, 1999·Nature Genetics·J T LeeD Warshawsky
Mar 10, 2001·Nature·Eric S LanderInternational Human Genome Sequencing Consortium
May 5, 2001·Genome Research·P C Ng, S Henikoff
May 29, 2002·Proceedings of the National Academy of Sciences of the United States of America·Laura A LetticeSumihare Noji
Oct 18, 2002·Blood·Qiliang LiGeorge Stamatoyannopoulos
Oct 23, 2004·Science·ENCODE Project Consortium
Jun 21, 2005·Genome Research·Gregory M CooperArend Sidow
Jan 26, 2006·Nature Cell Biology·Christian P BacherEdith Heard
Dec 8, 2006·American Journal of Medical Genetics. Part a·Christina A GurnettMatthew B Dobbs
Jul 3, 2007·Nature·Tarjei S MikkelsenBradley E Bernstein
Jun 3, 2008·Trends in Genetics : TIG·Greg Elgar, Tanya Vavouri
Mar 17, 2009·Science·Stephen C J ParkerElliott H Margulies
May 29, 2009·Proceedings of the National Academy of Sciences of the United States of America·Lucia A HindorffTeri A Manolio
May 30, 2009·Current Opinion in Cell Biology·Jennifer Chow, Edith Heard
Oct 28, 2009·Genome Research·Katherine S PollardAdam Siepel
Apr 1, 2010·Nature Methods·Ivan A AdzhubeiShamil R Sunyaev
Aug 3, 2010·Nature Methods·Jana Marie SchwarzDominik Seelow
Aug 10, 2010·Genome Research·Stephen MeaderGerton Lunter
Mar 29, 2011·Nature·Jason ErnstBradley E Bernstein
Apr 22, 2011·Developmental Dynamics : an Official Publication of the American Association of Anatomists·Julia E VanderMeer, Nadav Ahituv
May 10, 2011·Human Genetics·Christine YangCarolyn J Brown
May 19, 2011·Nature Reviews. Genetics·Sandrine AuguiEdith Heard
Sep 2, 2011·The New England Journal of Medicine·Vijay G SankaranM A Bender
Oct 14, 2011·Nature·Kerstin Lindblad-TohManolis Kellis
Sep 4, 2012·Genome Biology·Tengfei YinMichael Lawrence
Sep 8, 2012·Nature·ENCODE Project Consortium
Sep 8, 2012·Genome Research·Alan P BoyleMichael Snyder
Nov 10, 2012·Nature Biotechnology·Lucas D Ward, Manolis Kellis
Nov 28, 2012·Journal of Occupational Health·Hideaki KitamuraToshiyuki Someya
Sep 7, 2013·Clinical Genetics·P Makrythanasis, S E Antonarakis
Nov 16, 2013·Nucleic Acids Research·Melissa J LandrumDonna R Maglott

Related Concepts

Positioning Attribute
Human Genetics
Gene Annotation
Conservative Therapy
High-Throughput RNA Sequencing
Whole Genome Sequencing

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Synapse Loss as Therapeutic Target in MS

As we age, the number of synapses present in the human brain starts to decline, but in neurodegenerative diseases this occurs at an accelerated rate. In MS, it has been shown that there is a reduction in synaptic density, which presents a potential target for treatment. Here is the latest research on synapse loss as a therapeutic target in MS.

Artificial Intelligence in Cardiac Imaging

Artificial intelligence (ai) techniques are increasingly applied to cardiovascular (cv) medicine in cardiac imaging analysis. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

Social Learning

Social learning involves learning new behaviors through observation, imitation and modeling. Follow this feed to stay up to date on the latest research.

Cell Atlas of the Human Eye

Constructing a cell atlas of the human eye will require transcriptomic and histologic analysis over the lifespan. This understanding will aid in the study of development and disease. Find the latest research pertaining to the Cell Atlas of the Human Eye here.

Single Cell Chromatin Profiling

Techniques like ATAC-seq and CUT&Tag have the potential to allow single cell profiling of chromatin accessibility, histones, and TFs. This will provide novel insight into cellular heterogeneity and cell states. Discover the latest research on single cell chromatin profiling here.

Genetic Screens in iPSC-derived Brain Cells

Genetic screening is a critical tool that can be employed to define and understand gene function and interaction. This feed focuses on genetic screens conducted using induced pluripotent stem cell (iPSC)-derived brain cells.