Population genomics from pool sequencing

Molecular Ecology
Luca FerrettiMiguel Pérez-Enciso

Abstract

Next generation sequencing of pooled samples is an effective approach for studies of variability and differentiation in populations. In this paper we provide a comprehensive set of estimators of the most common statistics in population genetics based on the frequency spectrum, namely the Watterson estimator θW, nucleotide pairwise diversity Π, Tajima's D, Fu and Li's D and F, Fay and Wu's H, McDonald-Kreitman and HKA tests and FST, corrected for sequencing errors and ascertainment bias. In a simulation study, we show that pool and individual θ estimates are highly correlated and discuss how the performance of the statistics vary with read depth and sample size in different evolutionary scenarios. As an application, we reanalyse sequences from Drosophila mauritiana and from an evolution experiment in Drosophila melanogaster. These methods are useful for population genetic projects with limited budget, study of communities of individuals that are hard to isolate, or autopolyploid species.

References

Apr 1, 1975·Theoretical Population Biology·G A Watterson
Jun 20, 1991·Nature·J H McDonald, M Kreitman
Jul 1, 1983·Annals of Human Genetics·M Nei, R K Chesser
Oct 1, 1995·Theoretical Population Biology·Y X Fu
Nov 5, 2002·Nature Reviews. Genetics·Pak ShamMichael Owen
Feb 5, 2004·Genome Biology·Stefan KurtzSteven L Salzberg
Jul 14, 2004·Proceedings of the National Academy of Sciences of the United States of America·Hideki Innan, Yuseob Kim
Feb 18, 2005·Genetics·Joachim Hermisson, Pleuni S Pennings
Jan 7, 2006·Evolution; International Journal of Organic Evolution·Molly PrzeworskiJeffrey D Wall
Sep 9, 2006·Journal of Mathematical Biology·Carsten Wiuf
Jun 20, 2008·Genetics·Guillaume Achaz
Oct 15, 2008·PloS One·Nathan A BairdEric A Johnson
Nov 6, 2008·Genetics·Rong JiangPaul Marjoram
Nov 26, 2008·Genome Research·Gary K ChenJeffrey D Wall
Mar 3, 2009·Nature Methods·Todd E DruleyRobi D Mitra
Jul 23, 2009·Journal of the American Statistical Association·Feng GuoKent E Holsinger
Aug 14, 2009·BMC Genomics·Andreia J AmaralMartien A M Groenen
Jan 14, 2010·Genome Research·John E PoolRasmus Nielsen
May 19, 2010·Animal Genetics·Miguel Pérez-Enciso, Luca Ferretti
Jun 30, 2010·PLoS Genetics·Talia KarasovDmitri A Petrov
Aug 28, 2010·Proceedings of the National Academy of Sciences of the United States of America·Kevin J EmersonChristina M Holzapfel
Sep 15, 2010·Nature Reviews. Genetics·Jonathan K Pritchard, Anna Di Rienzo
Sep 17, 2010·Nature·Molly K BurkeAnthony D Long
Sep 22, 2010·Genetics·David J Cutler, Jeffrey D Jensen
Jan 8, 2011·Genetics·Zachariah Gompert, C Alex Buerkle
Jun 21, 2011·Bioinformatics·Andre AltmannBertram Müller-Myhsok
Sep 20, 2011·Nature Genetics·Ilan GronauAdam Siepel
Mar 3, 2012·Wiley Interdisciplinary Reviews. RNA·Christopher Iain JonesSarah Faith Newbury
Mar 14, 2012·Molecular Biology and Evolution·Simon BoitardAndreas Futschik
Jun 5, 2012·Genetics·Luca FerrettiSebastian E Ramos-Onsins
Jul 5, 2012·Journal of Visualized Experiments : JoVE·Francesco VallaniaTodd E Druley
Jul 27, 2012·Proceedings. Biological Sciences·Molly K Burke
Sep 21, 2012·BMC Bioinformatics·Emanuele RaineriMiguel Pérez-Enciso
Mar 19, 2013·BMC Genomics·Anna Esteve-CodinaMiguel Pérez-Enciso

Citations

May 3, 2014·Genome Biology and Evolution·Michael LynchScott Harrison
Sep 24, 2014·Nature Reviews. Genetics·Christian SchlöttererViola Nolte
Jan 6, 2016·Nature Reviews. Genetics·Kimberly R AndrewsPaul A Hohenlohe
Oct 17, 2014·Molecular Ecology·Kimberly R AndrewsGordon Luikart
Dec 7, 2013·Molecular Ecology·Eric C AndersonDaniel J Barshis
Dec 30, 2014·Ecology and Evolution·Alexander I Putman, Ignazio Carbone
Jan 18, 2015·Theoretical Population Biology·Luca Ferretti, Sebastian E Ramos-Onsins
Apr 18, 2016·Molecular Ecology·Laura Marilyn BenestanGordon Luikart
Nov 9, 2016·Scientific Data·Samir M KadriAmro Zayed
Oct 16, 2015·PloS One·Marco FracassettiYvonne Willi
Mar 4, 2017·Molecular Ecology·Benjamin M Van DorenMiriam Liedvogel
Jan 19, 2018·Molecular Biology and Evolution·Yvonne WilliJosh Van Buskirk
Dec 22, 2017·Methods in Ecology and Evolution·R Axel W WibergMichael G Ritchie
Nov 10, 2018·Molecular Ecology·Alessia GuggisbergAlex Widmer
Sep 11, 2016·G3 : Genes - Genomes - Genetics·Robert KoflerChristian Schlötterer
Jun 13, 2019·PLoS Genetics·Michael ØrstedTorsten Nygaard Kristensen
Apr 2, 2015·Briefings in Functional Genomics·Martin HasselmannAmro Zayed
Apr 12, 2020·G3 : Genes - Genomes - Genetics·Thomas M LilleySteve Paterson
Jun 17, 2020·Integrative and Comparative Biology·April D GarrettMelissa H Pespeni
Jun 17, 2020·Genome Biology and Evolution·James G Baldwin-Brown, Anthony D Long
Nov 8, 2017·Molecular Ecology·Xiaoheng ChengMichael DeGiorgio
Nov 9, 2018·Frontiers in Genetics·Luca FerrettiSebastian E Ramos-Onsins
Aug 1, 2018·Genetics·Valentin HivertRenaud Vitalis
Oct 20, 2020·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Martin D GarlovskyRhonda R Snook

Related Concepts

In Silico
Drosophila
Genetics, Population
Two-Parameter Models
Sample Size
Genomics
High-Throughput Nucleotide Sequencing
Cell Differentiation Process
Drosophila melanogaster
Fluorouracil

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Systemic Juvenile Idiopathic Arthritis

Systemic juvenile idiopathic arthritis is a rare rheumatic disease that affects children. Symptoms include joint pain, but also fevers and skin rashes. Here is the latest on this disease.

Chromatin Regulation and Circadian Clocks

The circadian clock plays an important role in regulating transcriptional dynamics through changes in chromatin folding and remodelling. Discover the latest research on Chromatin Regulation and Circadian Clocks here.

Central Pontine Myelinolysis

Central Pontine Myelinolysis is a neurologic disorder caused most frequently by rapid correction of hyponatremia and is characterized by demyelination that affects the central portion of the base of the pons. Here is the latest research on this disease.

Myocardial Stunning

Myocardial stunning is a mechanical dysfunction that persists after reperfusion of previously ischemic tissue in the absence of irreversible damage including myocardial necrosis. Here is the latest research.

Pontocerebellar Hypoplasia

Pontocerebellar hypoplasias are a group of neurodegenerative autosomal recessive disorders with prenatal onset, atrophy or hypoplasia of the cerebellum, hypoplasia of the ventral pons, microcephaly, variable neocortical atrophy and severe mental and motor impairments. Here is the latest research on pontocerebellar hypoplasia.

Cell Atlas Along the Gut-Brain Axis

Profiling cells along the gut-brain axis at the single cell level will provide unique information for each cell type, a three-dimensional map of how cell types work together to form tissues, and insights into how changes in the map underlie health and disease of the GI system and its crosstalk with the brain. Disocver the latest research on single cell analysis of the gut-brain axis here.

Chronic Traumatic Encephalopathy

Chronic Traumatic Encephalopathy (CTE) is a progressive degenerative disease that occurs in individuals that suffer repetitive brain trauma. Discover the latest research on traumatic encephalopathy here.