The bench scientist's guide to statistical analysis of RNA-Seq data

BMC Research Notes
Craig R YendrekJyothi Thimmapuram

Abstract

RNA sequencing (RNA-Seq) is emerging as a highly accurate method to quantify transcript abundance. However, analyses of the large data sets obtained by sequencing the entire transcriptome of organisms have generally been performed by bioinformatics specialists. Here we provide a step-by-step guide and outline a strategy using currently available statistical tools that results in a conservative list of differentially expressed genes. We also discuss potential sources of error in RNA-Seq analysis that could alter interpretation of global changes in gene expression. When comparing statistical tools, the negative binomial distribution-based methods, edgeR and DESeq, respectively identified 11,995 and 11,317 differentially expressed genes from an RNA-seq dataset generated from soybean leaf tissue grown in elevated O3. However, the number of genes in common between these two methods was only 10,535, resulting in 2,242 genes determined to be differentially expressed by only one method. Upon analysis of the non-significant genes, several limitations of these analytic tools were revealed, including evidence for overly stringent parameters for determining statistical significance of differentially expressed genes as well as increased typ...Continue Reading

References

Mar 5, 2004·The Plant Journal : for Cell and Molecular Biology·Oliver ThimmMark Stitt
May 2, 2006·Statistical Applications in Genetics and Molecular Biology·Gordon K Smyth
Aug 31, 2007·Biostatistics·Mark D Robinson, Gordon K Smyth
Sep 21, 2007·Bioinformatics·Mark D Robinson, Gordon K Smyth
Jun 3, 2008·Nature Methods·Ali MortazaviBarbara Wold
Jun 3, 2008·Nature Methods·Nicole CloonanSean M Grimmond
Nov 19, 2008·Nature Reviews. Genetics·Zhong WangMichael Snyder
Dec 18, 2008·Proceedings of the National Academy of Sciences of the United States of America·Hairi LiGene W Yeo
Feb 11, 2009·Proceedings of the National Academy of Sciences of the United States of America·Andrew D B LeakeyDonald R Ort
Feb 25, 2009·Nucleic Acids Research·J M RuijterA F M Moorman
Jan 16, 2010·Nature·Jeremy SchmutzScott A Jackson
May 5, 2010·Genetics·Paul L Auer, R W Doerge
May 13, 2010·Briefings in Bioinformatics·Heng Li, Nils Homer
May 19, 2010·Animal Genetics·M Pérez-Enciso, L Ferretti
Oct 5, 2010·Current Opinion in Microbiology·Nicholas J Croucher, Nicholas R Thomson
Oct 29, 2010·Genome Biology·Simon Anders, Wolfgang Huber
Dec 24, 2010·Journal of Neurochemistry·Greg T SutherlandJillian J Kril
May 31, 2011·Nature Methods·Manuel GarberCole Trapnell
Aug 23, 2011·Bioinformatics·Matthew RuffaloMehmet Koyutürk
Sep 8, 2011·Bioinformatics·Wenming ZhaoSongnian Hu
Oct 7, 2011·Bioinformatics·Eleanor A HoweJohn Quackenbush
Oct 15, 2011·PloS One·Matthias Zytnicki, Hadi Quesneville
Mar 13, 2012·Annual Review of Plant Biology·Elizabeth A AinsworthLisa D Emberson

Citations

Jun 25, 2013·Methods : a Companion to Methods in Enzymology·Nirav M AminFrank L Conlon
Mar 5, 2013·International Journal of Molecular Sciences·Vicent ArbonaAurelio Gómez-Cadenas
Oct 4, 2017·Human Molecular Genetics·Stephanie E SiegmundEric A Schon
Aug 19, 2017·Scientific Reports·John A HayesChristopher A Del Negro
May 8, 2018·BMC Genomics·Tatsuhiro NomaguchiTsuyoshi Tanaka

Related Concepts

Survey Personnel
RNA
Transcription, Genetic
Sequence Determinations, RNA
Reverse Transcriptase Polymerase Chain Reaction
Gene Expression
Genes
Ozone
RNA
Statistical Analysis

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Systemic Juvenile Idiopathic Arthritis

Systemic juvenile idiopathic arthritis is a rare rheumatic disease that affects children. Symptoms include joint pain, but also fevers and skin rashes. Here is the latest on this disease.

Chromatin Regulation and Circadian Clocks

The circadian clock plays an important role in regulating transcriptional dynamics through changes in chromatin folding and remodelling. Discover the latest research on Chromatin Regulation and Circadian Clocks here.

Central Pontine Myelinolysis

Central Pontine Myelinolysis is a neurologic disorder caused most frequently by rapid correction of hyponatremia and is characterized by demyelination that affects the central portion of the base of the pons. Here is the latest research on this disease.

Myocardial Stunning

Myocardial stunning is a mechanical dysfunction that persists after reperfusion of previously ischemic tissue in the absence of irreversible damage including myocardial necrosis. Here is the latest research.

Pontocerebellar Hypoplasia

Pontocerebellar hypoplasias are a group of neurodegenerative autosomal recessive disorders with prenatal onset, atrophy or hypoplasia of the cerebellum, hypoplasia of the ventral pons, microcephaly, variable neocortical atrophy and severe mental and motor impairments. Here is the latest research on pontocerebellar hypoplasia.

Cell Atlas Along the Gut-Brain Axis

Profiling cells along the gut-brain axis at the single cell level will provide unique information for each cell type, a three-dimensional map of how cell types work together to form tissues, and insights into how changes in the map underlie health and disease of the GI system and its crosstalk with the brain. Disocver the latest research on single cell analysis of the gut-brain axis here.

Chronic Traumatic Encephalopathy

Chronic Traumatic Encephalopathy (CTE) is a progressive degenerative disease that occurs in individuals that suffer repetitive brain trauma. Discover the latest research on traumatic encephalopathy here.