Bias, robustness and scalability in single-cell differential expression analysis

Nature Methods
Charlotte Soneson, Mark D Robinson

Abstract

Many methods have been used to determine differential gene expression from single-cell RNA (scRNA)-seq data. We evaluated 36 approaches using experimental and synthetic data and found considerable differences in the number and characteristics of the genes that are called differentially expressed. Prefiltering of lowly expressed genes has important effects, particularly for some of the methods developed for bulk RNA-seq data analysis. However, we found that bulk RNA-seq analysis methods do not generally perform worse than those developed specifically for scRNA-seq. We also present conquer, a repository of consistently processed, analysis-ready public scRNA-seq data sets that is aimed at simplifying method evaluation and reanalysis of published results. Each data set provides abundance estimates for both genes and transcripts, as well as quality control and exploratory analysis reports.

References

Apr 23, 2005·Nature Methods·Rafael A IrizarryWayne Yu
May 2, 2006·Statistical Applications in Genetics and Molecular Biology·Gordon K Smyth
Aug 2, 2008·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Laura L EloTero Aittokallio
Apr 8, 2009·Nature Methods·Fuchou TangM Azim Surani
Mar 4, 2010·Genome Biology·Mark D Robinson, Alicia Oshlack
May 13, 2010·Proceedings of the National Academy of Sciences of the United States of America·Richard BourgonWolfgang Huber
Dec 1, 2011·Statistical Methods in Medical Research·Jun Li, Robert Tibshirani
Sep 17, 2013·Nature·Tuuli LappalainenEmmanouil T Dermitzakis
Sep 24, 2013·Nature Methods·Simone PicelliRickard Sandberg
Oct 1, 2013·Nature Methods·Joseph N PaulsonMihai Pop
Feb 4, 2014·Genome Biology·Charity W LawGordon K Smyth
Apr 23, 2014·Nucleic Acids Research·Xiaobei ZhouMark D Robinson
May 20, 2014·Nature Methods·Peter V KharchenkoDavid T Scadden
Nov 25, 2014·Nature Neuroscience·Dmitry UsoskinPatrik Ernfors
Dec 18, 2014·Genome Biology·Michael I LoveSimon Anders
Jan 31, 2015·Nature Methods·Wolfgang HuberMartin Morgan
Apr 14, 2015·Nature Biotechnology·Rahul SatijaAviv Regev
Aug 13, 2015·Nucleic Acids Research·Fatemeh SeyednasrollahLaura L Elo
Mar 31, 2016·Nature Methods·Charlotte Soneson, Mark D Robinson
Apr 17, 2016·Genome Biology·Catalina A VallejosJohn C Marioni
May 7, 2016·Bioinformatics·Trung Nghia VuYudi Pawitan
May 31, 2016·Nature Methods·Nikolaos IgnatiadisWolfgang Huber
Jun 25, 2016·Database : the Journal of Biological Databases and Curation·Bronwen L AkenStephen M J Searle
Oct 27, 2016·Genome Biology·Keegan D KorthauerChristina Kendziorski
Jan 17, 2017·Nature Communications·Grace X Y ZhengJason H Bielas
Jan 24, 2017·Nature Methods·Xiaojie QiuCole Trapnell
Mar 7, 2017·Nature Methods·Rob PatroCarl Kingsford
Oct 7, 2017·Science·Michael J T StubbingtonSarah A Teichmann
Oct 14, 2017·Bioinformatics·Charlotte Soneson, Mark D Robinson
Mar 2, 2018·Nature Protocols·Valentine SvenssonSarah A Teichmann

❮ Previous
Next ❯

Citations

Jul 31, 2018·The FEBS Journal·Helena Todorov, Yvan Saeys
Mar 14, 2019·Bioinformatics·Ghislain DurifFranck Picard
May 8, 2019·Nature Biotechnology·Brian HieBonnie Berger
Mar 27, 2018·Briefings in Functional Genomics·Christoph ZiegenhainWolfgang Enard
Oct 11, 2019·Bioinformatics·Giacomo BaruzzoBarbara Di Camillo
Dec 4, 2019·Nature Biotechnology·Jeffrey M GranjaWilliam J Greenleaf
Feb 1, 2020·The Journal of Pathology·Lisa Willemsen, Menno Pj de Winther
Mar 29, 2019·Nature Communications·Serghei MangulJonathan Flint
Apr 3, 2020·ELife·David J ForsthoefelPhillip A Newmark
Apr 28, 2020·PLoS Computational Biology·Yuanchao ZhangDeanne M Taylor
Feb 15, 2019·PLoS Computational Biology·Brandon MonierQin Ma
May 12, 2020·PLoS Computational Biology·Magali RichardDaniel Jost
Jun 5, 2020·Epigenomics·Veronika SuniLaura L Elo
Jun 22, 2019·Genome Biology·Lukas M WeberMark D Robinson
May 28, 2020·Toxicological Sciences : an Official Journal of the Society of Toxicology·Kelly M BakulskiJustin A Colacino
Jul 10, 2020·Genome Research·Margaret R StarostikRajiv C McCoy
Nov 2, 2019·Briefings in Bioinformatics·Xinlei ZhaoJue Fan
Jun 23, 2019·The Journal of Experimental Medicine·James ClarkePandurangan Vijayanand
Mar 12, 2019·The Journal of Clinical Investigation·James V McCannAndrew C Dudley
Sep 17, 2019·ELife·Alexander J TarashanskyBo Wang
Aug 23, 2019·Nature Immunology·François LegouxOlivier Lantz
Dec 4, 2019·Nature Methods·Robert A AmezquitaStephanie C Hicks
Dec 25, 2019·Genome Biology·F William TownesRafael A Irizarry
Feb 9, 2020·Genome Biology·David LähnemannAlexander Schönhuth
Jan 11, 2020·Frontiers in Cardiovascular Medicine·Farhan ChaudhryPhillip D Levy
Mar 30, 2020·Nature Reviews. Nephrology·Yan Wu, Kun Zhang
Jul 23, 2020·Nature Metabolism·Stephan SachsHeiko Lickert
May 31, 2018·GigaScience·Swati ParekhInes Hellmann
Jul 18, 2018·Bioinformatics·Patrick K Kimes, Alejandro Reyes
Apr 27, 2019·Frontiers in Genetics·Geng ChenTieliu Shi

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.