New Statistical Criteria Detect Phylogenetic Bias Caused by Compositional Heterogeneity

Molecular Biology and Evolution
David A DuchêneSimon Y W Ho

Abstract

In statistical phylogenetic analyses of DNA sequences, models of evolutionary change commonly assume that base composition is stationary through time and across lineages. This assumption is violated by many data sets, but it is unclear whether the magnitude of these violations is sufficient to mislead phylogenetic inference. We investigated the impacts of compositional heterogeneity on phylogenetic estimates using a method for assessing model adequacy. Based on a detailed simulation study, we found that common frequentist criteria are highly conservative, such that the model is often rejected when the phylogenetic estimates do not show clear signs of bias. We propose new criteria and provide guidelines for their usage. We apply these criteria to genome-scale data from 40 birds and find that loci with severely non-homogeneous base composition are uncommon. Our results show the importance of using well-informed diagnostic statistics when testing model adequacy for phylogenomic analyses.

References

Nov 21, 1995·Proceedings of the National Academy of Sciences of the United States of America·N Galtier, M Gouy
Feb 1, 1993·Journal of Molecular Evolution·N Goldman
Jan 1, 1995·Molecular Biology and Evolution·G M LentoD Penny
Aug 10, 2000·Trends in Ecology & Evolution·A Ø Mooers, E C Holmes
Jun 26, 2002·Molecular Biology and Evolution·Jonathan P Bollback
Sep 17, 2004·Systematic Biology·Simon Y Ho, Lars Jermiin
Oct 27, 2004·Systematic Biology·Peter G Foster
Aug 26, 2006·Molecular Biology and Evolution·Samuel Blanquart, Nicolas Lartillot
Oct 25, 2006·Systematic Biology·Bastien Boussau, Manolo Gouy
Mar 10, 2007·Molecular Biology and Evolution·Vivek Gowri-Shankar, Magnus Rattray
Sep 24, 2008·BMC Evolutionary Biology·Julien Dutheil, Bastien Boussau
Jul 3, 2009·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Peter G FosterT Martin Embley
Apr 13, 2010·Molecular Biology and Evolution·Maximilian P NesnidalBernhard Hausdorf
Jun 9, 2010·Systematic Biology·Nathan C SheffieldMichael F Whiting
Mar 12, 2011·Molecular Biology and Evolution·Benoit NabholzHans Ellegren
Jul 15, 2015·Molecular Biology and Evolution·David A DuchêneSimon Y W Ho
Sep 30, 2015·Molecular Biology and Evolution·Sebastián DuchêneEdward C Holmes

❮ Previous
Next ❯

Citations

Feb 27, 2018·Bioinformatics·David A DuchêneSimon Y W Ho
Nov 15, 2017·Molecular Biology and Evolution·Sebastian HöhnaJeremy M Brown
Oct 14, 2017·Systematic Biology·David A DuchêneMark D B Eldridge
May 23, 2018·Genome Biology and Evolution·David A DuchêneSimon Y W Ho
Sep 20, 2019·Genome Biology and Evolution·Suha Naser-KhdourRobert Lanfear
Oct 18, 2018·Systematic Biology·Daisy A Shepherd, Steffen Klaere
Feb 3, 2021·Molecular Biology and Evolution·Robert Literman, Rachel Schwartz
Sep 12, 2021·Systematic Biology·David A DuchêneSimon Y W Ho

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.