Empirical profile mixture models for phylogenetic reconstruction

Bioinformatics
Le Si QuangNicolas Lartillot

Abstract

Previous studies have shown that accounting for site-specific amino acid replacement patterns using mixtures of stationary probability profiles offers a promising approach for improving the robustness of phylogenetic reconstructions in the presence of saturation. However, such profile mixture models were introduced only in a Bayesian context, and are not yet available in a maximum likelihood (ML) framework. In addition, these mixture models only perform well on large alignments, from which they can reliably learn the shapes of profiles, and their associated weights. In this work, we introduce an expectation-maximization algorithm for estimating amino acid profile mixtures from alignment databases. We apply it, learning on the HSSP database, and observe that a set of 20 profiles is enough to provide a better statistical fit than currently available empirical matrices (WAG, JTT), in particular on saturated data.

References

Jun 1, 1992·Computer Applications in the Biosciences : CABIOS·D T JonesJ M Thornton
Jan 1, 1981·Journal of Molecular Evolution·J Felsenstein
Jan 1, 1996·Molecular Biology and Evolution·J Felsenstein, G A Churchill
Apr 1, 1996·Journal of Molecular Evolution·J Adachi, M Hasegawa
May 1, 1996·Molecular Biology and Evolution·J L ThorneD T Jones
Dec 1, 1996·Molecular Biology and Evolution·W J Bruno
Jul 10, 1998·Molecular Biology and Evolution·A L Halpern, W J Bruno
May 4, 2001·Trends in Genetics : TIG·S WhelanN Goldman
Apr 17, 2002·Journal of Molecular Biology·I Holmes, G M Rubin
Jul 27, 2002·Annual Review of Microbiology·Patrick J Keeling, Naomi M Fast
Oct 25, 2002·Systematic Biology·Rasmus Nielsen
Oct 8, 2003·Systematic Biology·Stéphane Guindon, Olivier Gascuel
Mar 12, 2004·Molecular Biology and Evolution·Nicolas Lartillot, Hervé Philippe
Nov 9, 2004·Bioinformatics·Gavin E Crooks, Steven E Brenner
Feb 11, 2005·Molecular Biology and Evolution·Hervé PhilippeHenner Brinkmann
Jun 1, 2005·Trends in Genetics : TIG·Mike Steel
Jun 1, 2005·Trends in Genetics : TIG·Joseph W Thornton, Bryan Kolaczkowski
Oct 26, 2005·Systematic Biology·Henner BrinkmannHervé Philippe
Mar 9, 2006·Systematic Biology·Nicolas Lartillot, Hervé Philippe
Apr 12, 2006·Proceedings of the National Academy of Sciences of the United States of America·John P HuelsenbeckSergei L Kosakovsky Pond
Dec 8, 2006·Genetics·Jan Engelstädter, Gregory D D Hurst
Feb 10, 2007·BMC Evolutionary Biology·Nicolas LartillotHervé Philippe
Dec 14, 2007·Systematic Biology·John P Huelsenbeck, Marc A Suchard
Mar 28, 2008·Molecular Biology and Evolution·Si Quang Le, Olivier Gascuel

❮ Previous
Next ❯

Citations

Aug 11, 2010·Journal of Molecular Evolution·Georgia TsagkogeorgaFrédéric Delsuc
Feb 24, 2010·Proceedings of the National Academy of Sciences of the United States of America·Nicolas RodrigueNicolas Lartillot
Jun 19, 2009·Bioinformatics·Nicolas LartillotSamuel Blanquart
Apr 19, 2011·Molecular Biology and Evolution·Mathieu Groussin, Manolo Gouy
Jun 10, 2011·Molecular Biology and Evolution·Alexis Criscuolo, Simonetta Gribaldo
Apr 12, 2012·Molecular Biology and Evolution·Si Quang LeOlivier Gascuel
Dec 13, 2012·Molecular Biology and Evolution·Chieh-Hsi WuAlexei J Drummond
Feb 20, 2013·Molecular Biology and Evolution·Ben MurrellKonrad Scheffler
May 24, 2013·Molecular Biology and Evolution·Laurent GuéguenJulien Y Dutheil
Jul 3, 2013·Molecular Biology and Evolution·Claire C MorganMary J O'Connell
Oct 26, 2012·Proceedings. Biological Sciences·Tom A WilliamsT Martin Embley
Aug 30, 2013·Proceedings. Biological Sciences·Matthew W BrownAndrew J Roger
Oct 15, 2008·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Si Quang LeOlivier Gascuel
Jan 18, 2011·BMC Evolutionary Biology·Béatrice Roure, Hervé Philippe
May 16, 2009·BMC Evolutionary Biology·Paweł Mackiewicz, Elzbieta Wyroba
Feb 18, 2014·Genome Biology and Evolution·Tom A Williams, T Martin Embley
Jan 21, 2014·Molecular Biology and Evolution·Huai-Chun WangAndrew J Roger
Jan 9, 2014·Molecular Biology and Evolution·Nicolas C RochetteManolo Gouy
Jul 27, 2014·Genome Biology and Evolution·Roberto FeudaDavide Pisani
Oct 2, 2012·Molecular Phylogenetics and Evolution·Matthias BerntPeter F Stadler
Oct 20, 2015·Mitochondrial DNA. Part A. DNA Mapping, Sequencing, and Analysis·Nan SongFan Song
Mar 12, 2016·Genome Génome / Conseil National De Recherches Canada·Kasandra Labonté, Stéphane Aris-Brosou
Mar 7, 2013·Evolution; International Journal of Organic Evolution·Kalle T RytkönenMikko Nikinmaa
Apr 9, 2016·Biology Direct·Leonardo de Oliveira Martins, David Posada
Sep 2, 2015·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Richard GouyHervé Philippe
Sep 2, 2015·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Nicolas Lartillot
Mar 5, 2015·Molecular Biology and Evolution·Richard A GoldsteinDavid D Pollock
May 20, 2016·BMC Evolutionary Biology·Ugo CenciJohn M Archibald
May 9, 2015·PloS One·Roberto Feuda, Andrew B Smith
Mar 31, 2016·Protein Science : a Publication of the Protein Society·Richard A Goldstein, David D Pollock

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.