Multiway admixture deconvolution using phased or unphased ancestral panels

Genetic Epidemiology
Claire Churchhouse, Jonathan Marchini

Abstract

We describe a novel method for inferring the local ancestry of admixed individuals from dense genome-wide single nucleotide polymorphism data. The method, called MULTIMIX, allows multiple source populations, models population linkage disequilibrium between markers and is applicable to datasets in which the sample and source populations are either phased or unphased. The model is based upon a hidden Markov model of switches in ancestry between consecutive windows of loci. We model the observed haplotypes within each window using a multivariate normal distribution with parameters estimated from the ancestral panels. We present three methods to fit the model-Markov chain Monte Carlo sampling, the Expectation Maximization algorithm, and a Classification Expectation Maximization algorithm. The performance of our method on individuals simulated to be admixed with European and West African ancestry shows it to be comparable to HAPMIX, the ancestry calls of the two methods agreeing at 99.26% of loci across the three parameter groups. In addition to it being faster than HAPMIX, it is also found to perform well over a range of extent of admixture in a simulation involving three ancestral populations. In an analysis of real data, we estim...Continue Reading

References

Apr 17, 2002·Science·Howard M CannL Luca Cavalli-Sforza
Jan 25, 2005·Nature Genetics·Xiaofeng ZhuAlan Weder
Feb 16, 2005·Genetic Epidemiology·Hua TangNeil Risch
Oct 29, 2005·Nature·International HapMap Consortium
Jun 15, 2006·American Journal of Human Genetics·Hua TangNeil Risch
Sep 2, 2006·Proceedings of the National Academy of Sciences of the United States of America·Matthew L FreedmanDavid Reich
Oct 19, 2007·Nature·International HapMap ConsortiumJohn Stewart
Feb 7, 2008·American Journal of Human Genetics·Sriram SankararamanEran Halperin
Mar 21, 2008·Genome Research·Andreas SundquistSerafim Batzoglou
Mar 21, 2008·Genome Research·Sriram SankararamanMichael I Jordan
Jul 9, 2008·American Journal of Human Genetics·Alkes L PriceDavid Reich
Sep 17, 2008·Nature Genetics·W H Linda KaoFamily Investigation of Nephropathy and Diabetes Research Group
May 30, 2009·Bioinformatics·Bogdan PasaniucEran Halperin
Jul 2, 2009·Human Genetics·Cathryn H BockBenjamin A Rybicki
Aug 4, 2009·Genome Research·David H AlexanderKenneth Lange
Oct 22, 2009·Cancer Epidemiology, Biomarkers & Prevention : a Publication of the American Association for Cancer Research, Cosponsored by the American Society of Preventive Oncology·Laura FejermanMatthew L Freedman
Jan 19, 2010·Proceedings of the National Academy of Sciences of the United States of America·Katarzyna BrycCarlos D Bustamante
May 7, 2010·Proceedings of the National Academy of Sciences of the United States of America·Katarzyna BrycHarry Ostrer
Jun 3, 2010·Genetics·Graham CoopJonathan K Pritchard
Sep 3, 2010·Nature·International HapMap 3 ConsortiumJean E McEwen
Apr 12, 2011·The Annals of Applied Statistics·Xiaoquan Wen, Matthew Stephens
Jul 22, 2011·Nature·Anjali G HinchSimon R Myers
Jul 22, 2011·Nature Genetics·Daniel WegmannJohn Novembre
Dec 6, 2011·Nature Methods·Olivier DelaneauJean-François Zagury
Mar 3, 2012·G3 : Genes - Genomes - Genetics·Bryan HowieMatthew Stephens

Citations

Apr 20, 2014·PLoS Computational Biology·Robert Brown, Bogdan Pasaniuc
Dec 19, 2014·BMC Proceedings·Mengjie ChenHongyu Zhao
Oct 21, 2015·Theoretical Population Biology·Xiuwen Zheng, Bruce S Weir
Apr 23, 2015·European Journal of Human Genetics : EJHG·Robert BrownBogdan Pasaniuc
Aug 6, 2015·Journal of the Royal Army Medical Corps·Thomas EvansE Nicol
Nov 19, 2013·Alzheimer's & Dementia : the Journal of the Alzheimer's Association·Matthew A LalliFrancisco Lopera
Oct 21, 2015·Frontiers in Genetics·Tesfaye B Mersha
Jun 4, 2013·American Journal of Human Genetics·Yael BaranEran Halperin
Jun 26, 2015·BMC Genomics·Konstantin KozlovTatiana V Tatarinova
Aug 13, 2013·American Journal of Human Genetics·Giulio GenoveseSteven A McCarroll
Mar 18, 2015·Nature Communications·Susana EyheramendyGabriela M Repetto
Sep 24, 2014·Molecular Biology and Evolution·Simon H MartinChris D Jiggins
Jul 12, 2016·Nature·Christian FuchsbergerMark I McCarthy
Jul 21, 2016·DNA Research : an International Journal for Rapid Publication of Reports on Genes and Genomes·Irina MorozovaTatiana V Tatarinova
Aug 24, 2016·Genetics, Selection, Evolution : GSE·Jérémie VandenplasJohn W M Bastiaansen
Feb 11, 2016·PLoS Genetics·Quan ZhouYongtao Guan
Apr 5, 2017·PLoS Genetics·James XueShai Carmi
Aug 9, 2017·Journal of Evolutionary Biology·M RavinetA M Westram
Aug 13, 2014·Genetic Epidemiology·Timothy A Thornton, Justo Lorenzo Bermejo
Oct 29, 2018·The New Phytologist·Garrett M JanzenMatthew B Hufford
Sep 30, 2018·Journal of Medical Genetics·Xiao ChangHakon Hakonarson
Apr 2, 2019·Scientific Reports·Huaizhen QinXiaofeng Zhu
Jan 7, 2014·Genetics·Yongtao Guan
Jul 17, 2018·Briefings in Bioinformatics·Ephifania GezaGaston K Mazandu
May 28, 2019·Genetics·Michael Salter-Townshend, Simon Myers
Apr 19, 2015·BMC Genetics·Randall C JohnsonCheryl A Winkler

Related Concepts

Genetics, Population
Native Americans
Markov Chains
Monte Carlo Method
Genealogical Tree
Linkage Disequilibrium
Genome, Human
Mexican Americans
Single Nucleotide Polymorphism
African Continental Ancestry Group

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Alzheimer's Disease: MS4A

Variants within the membrane-spanning 4-domains subfamily A (MS4A) gene cluster have recently been implicated in Alzheimer's disease in genome-wide association studies. Here is the latest research on Alzheimer's disease and MS4A.

Pediculosis pubis

Pediculosis pubis is a disease caused by a parasitic insect known as Pthirus pubis, which infests human pubic hair, as well as other areas with hair including eye lashes. Here is the latest research.

Rh Isoimmunization

Rh isoimmunization is a potentially preventable condition that occasionally is associated with significant perinatal morbidity or mortality. Discover the latest research on Rh Isoimmunization here.

Genetic Screens in iPSC-derived Brain Cells

Genetic screening is a critical tool that can be employed to define and understand gene function and interaction. This feed focuses on genetic screens conducted using induced pluripotent stem cell (iPSC)-derived brain cells. It also follows CRISPR-Cas9 approaches to generating genetic mutants as a means of understanding the effect of genetics on phenotype.

Enzyme Evolution

This feed focuses on molecular models of enzyme evolution and new approaches (such as adaptive laboratory evolution) to metabolic engineering of microorganisms. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Pharmacology of Proteinopathies

This feed focuses on the pharmacology of proteinopathies - diseases in which proteins abnormally aggregate (i.e. Alzheimer’s, Parkinson’s, etc.). Discover the latest research in this field with this feed.

Alignment-free Sequence Analysis Tools

Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. Here is the latest research.