Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega

Molecular Systems Biology
Fabian SieversDesmond G Higgins

Abstract

Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high-quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam.

References

Jan 1, 1984·Journal of Molecular Evolution·P Hogeweg, B Hesper
Jun 6, 1998·Bioinformatics·B MorgensternT Werner
Nov 25, 1998·Protein Science : a Publication of the Protein Society·K MizuguchiJ P Overington
Jan 27, 1999·Bioinformatics·S R Eddy
Aug 31, 2000·Journal of Molecular Biology·C NotredameJ Heringa
Jul 24, 2002·Nucleic Acids Research·Kazutaka KatohTakashi Miyata
Feb 13, 2004·Bioinformatics·Michele ClampGeoffrey J Barton
Mar 23, 2004·Nucleic Acids Research·Robert C Edgar
Nov 9, 2004·Bioinformatics·Johannes Söding
Feb 3, 2005·Genome Research·Chuong B DoSerafim Batzoglou
Jul 27, 2005·Proteins·Julie D ThompsonOlivier Poch
Dec 14, 2005·BMC Bioinformatics·Timo Lassmann, Erik L L Sonnhammer
May 13, 2006·FEMS Microbiology Ecology·Milagros ZaballosFrancisco Rodríguez-Valera
Sep 12, 2007·Bioinformatics·M A LarkinD G Higgins
Jan 5, 2008·Bioinformatics·Walter PirovanoJaap Heringa
Apr 19, 2008·Nucleic Acids Research·Andreas WilmCédric Notredame
May 30, 2009·PLoS Computational Biology·Robert K BradleyLior Pachter
Nov 19, 2009·Nucleic Acids Research·Robert D FinnAlex Bateman
Jan 6, 2010·Nucleic Acids Research·Robert C Edgar
May 18, 2010·Algorithms for Molecular Biology : AMB·Gordon BlackshieldsDesmond G Higgins
Jul 20, 2010·Nucleic Acids Research·Mohamed Radhouene AnibaJulie D Thompson

❮ Previous
Next ❯

Citations

Dec 20, 2013·Applied Microbiology and Biotechnology·Michael ToeschKurt Faber
Oct 17, 2013·European Journal of Human Genetics : EJHG·Luitgard M Graul-NeumannPetra Seemann
Oct 16, 2012·Nature·Michael S BreenFyodor A Kondrashov
Jul 13, 2013·Nature·Debashish RayTimothy R Hughes
Dec 25, 2012·Nature Structural & Molecular Biology·Alex J NobleScott M Stagg
Sep 24, 2013·Nature Structural & Molecular Biology·Fabian GrussTimm Maier
Jun 19, 2013·Proceedings of the National Academy of Sciences of the United States of America·Sabine Brinkmann-ChenFrances H Arnold
Jul 10, 2013·Proceedings of the National Academy of Sciences of the United States of America·So NakagawaMartha L Bulyk
Oct 2, 2013·Proceedings of the National Academy of Sciences of the United States of America·Jorge AzpuruaAndrei Seluanov
Sep 18, 2013·Proceedings of the National Academy of Sciences of the United States of America·Tarang K MehtaByrappa Venkatesh
Nov 22, 2011·Antioxidants & Redox Signaling·Michael C GretesP Andrew Karplus
Apr 27, 2013·Bioinformatics·Thomas SchnattingerHans A Kestler
Jun 13, 2013·Database : the Journal of Biological Databases and Curation·Zsolt KarányiMárton Miskei
Feb 7, 2014·Database : the Journal of Biological Databases and Curation·Michele TintiCarol Mackintosh
May 8, 2013·DNA Research : an International Journal for Rapid Publication of Reports on Genes and Genomes·Sandy S C HungMinoru S H Ko
Apr 24, 2012·Molecular Biology and Evolution·Guan-Zhu Han, Michael Worobey
Jan 19, 2013·Molecular Biology and Evolution·Kazutaka Katoh, Daron M Standley
Sep 17, 2013·Molecular Biology and Evolution·Sara LightArne Elofsson
Nov 26, 2013·Molecular Biology and Evolution·Leanne S HaggertyJames O McInerney
Nov 28, 2012·Nucleic Acids Research·Edgar WingenderJürgen Dönitz
Mar 7, 2013·Nucleic Acids Research·John R P KnightJo Milner
Dec 24, 2013·Nucleic Acids Research·Dario Ghersi, Mona Singh
Apr 27, 2013·Nucleic Acids Research·Layal Al AitBurkhard Morgenstern
May 15, 2013·Nucleic Acids Research·Hamish McWilliamRodrigo Lopez
May 30, 2013·Nucleic Acids Research·Maria D ParaskevopoulouGeorge Spyrou
Jun 19, 2013·Nucleic Acids Research·Subrata PanjaSarah A Woodson
Aug 15, 2012·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Melissa D Lehti-Shiu, Shin-Han Shiu
Jun 1, 2012·The Plant Cell·Antje KlempienNatalia Dudareva
Nov 24, 2012·The Plant Cell·Maria J PeñaMalcolm A O'Neill
Nov 6, 2013·Acta Crystallographica. Section D, Biological Crystallography·Debora Lika Makino, Elena Conti
Feb 18, 2014·Acta Crystallographica. Section D, Biological Crystallography·Vadim RimsaWilliam N Hunter
Mar 17, 2012·Science·Cajetan NeubauerV Ramakrishnan

❮ Previous
Next ❯

Software Mentioned

MUSCLE
OpenMP
FreeBSD
Probalign
Omega
HomFam
Probcons
Coffee
Homstrad
MAFFT L - INS - i

Related Concepts

Related Feeds

Bioinformatics in Biomedicine

Bioinformatics in biomedicine incorporates computer science, biology, chemistry, medicine, mathematics and statistics. Discover the latest research on bioinformatics in biomedicine here.