Detection of high variability in gene expression from single-cell RNA-seq profiling

BMC Genomics
Hung-I Harry ChenYidong Chen

Abstract

The advancement of the next-generation sequencing technology enables mapping gene expression at the single-cell level, capable of tracking cell heterogeneity and determination of cell subpopulations using single-cell RNA sequencing (scRNA-seq). Unlike the objectives of conventional RNA-seq where differential expression analysis is the integral component, the most important goal of scRNA-seq is to identify highly variable genes across a population of cells, to account for the discrete nature of single-cell gene expression and uniqueness of sequencing library preparation protocol for single-cell sequencing. However, there is lack of generic expression variation model for different scRNA-seq data sets. Hence, the objective of this study is to develop a gene expression variation model (GEVM), utilizing the relationship between coefficient of variation (CV) and average expression level to address the over-dispersion of single-cell data, and its corresponding statistical significance to quantify the variably expressed genes (VEGs). We have built a simulation framework that generated scRNA-seq data with different number of cells, model parameters, and variation levels. We implemented our GEVM and demonstrated the robustness by using a...Continue Reading

References

Sep 23, 2008·Nature Methods·Arjun RajSanjay Tyagi
Apr 8, 2009·Nature Methods·Fuchou TangM Azim Surani
Jun 16, 2009·Nature Methods·Kiyomi TaniguchiHideki Kambara
Oct 29, 2010·Genome Biology·Simon Anders, Wolfgang Huber
Apr 7, 2012·Nature Protocols·Veronica Sanchez-FreireJoseph C Wu
Jul 24, 2012·Nature Biotechnology·Daniel RamsköldRickard Sandberg
Sep 4, 2012·Cell Reports·Tamar HashimshonyItai Yanai
Oct 20, 2012·Methods : a Companion to Methods in Enzymology·Kenneth J LivakChris C Holmes
Jan 31, 2013·Genome Biology·Jong Kyoung Kim, John C Marioni
Dec 24, 2013·Nature Methods·Saiful IslamSten Linnarsson
May 20, 2014·Nature Methods·Peter V KharchenkoDavid T Scadden
Jul 24, 2014·Nucleic Acids Research·Antoine-Emmanuel SalibaJörg Vogel
Oct 10, 2014·Nature Biotechnology·Aaron M Streets, Yanyi Huang
Jan 30, 2015·Nature Reviews. Genetics·Oliver StegleJohn C Marioni

Citations

Mar 14, 2019·Bioinformatics·Ghislain DurifFranck Picard
Feb 27, 2018·Briefings in Bioinformatics·Shun H YipJunwen Wang
Dec 22, 2019·Cells·Daniel OsorioJames J Cai
Oct 12, 2019·Cytometry. Part a : the Journal of the International Society for Analytical Cytology·Diana Ordoñez-RuedaMalte Paulsen
Apr 24, 2020·Bioinformatics·Oscar Franzén, Johan L M Björkegren
Dec 24, 2018·BMC Systems Biology·Hung-I Harry ChenYidong Chen
Sep 1, 2019·Cells·Sne Morag, Mali Salmon-Divon
Oct 28, 2018·BMC Genomics·Marmar Moussa, Ion I Măndoiu

Related Concepts

RNA
Computer Programs and Programming
MRNA Differential Display
Single-Cell Analysis
High-Throughput Nucleotide Sequencing
Clinical Protocols
Gene Expression
Genes
Plant Roots
Simulation

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Synthetic Genetic Array Analysis

Synthetic genetic arrays allow the systematic examination of genetic interactions. Here is the latest research focusing on synthetic genetic arrays and their analyses.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Computational Methods for Protein Structures

Computational methods employing machine learning algorithms are powerful tools that can be used to predict the effect of mutations on protein structure. This is important in neurodegenerative disorders, where some mutations can cause the formation of toxic protein aggregations. This feed follows the latests insights into the relationships between mutation and protein structure leading to better understanding of disease.

Congenital Hyperinsulinism

Congenital hyperinsulinism is caused by genetic mutations resulting in excess insulin secretion from beta cells of the pancreas. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Epigenetic Memory

Epigenetic memory refers to the heritable genetic changes that are not explained by the DNA sequence. Find the latest research on epigenetic memory here.

Cell Atlas of the Human Eye

Constructing a cell atlas of the human eye will require transcriptomic and histologic analysis over the lifespan. This understanding will aid in the study of development and disease. Find the latest research pertaining to the Cell Atlas of the Human Eye here.

Femoral Neoplasms

Femoral Neoplasms are bone tumors that arise in the femur. Discover the latest research on femoral neoplasms here.