PPalign: optimal alignment of Potts models representing proteins with direct coupling information.

BMC Bioinformatics
Hugo Talibart, F. Coste

Abstract

To assign structural and functional annotations to the ever increasing amount of sequenced proteins, the main approach relies on sequence-based homology search methods, e.g. BLAST or the current state-of-the-art methods based on profile Hidden Markov Models, which rely on significant alignments of query sequences to annotated proteins or protein families. While powerful, these approaches do not take coevolution between residues into account. Taking advantage of recent advances in the field of contact prediction, we propose here to represent proteins by Potts models, which model direct couplings between positions in addition to positional composition, and to compare proteins by aligning these models. Due to non-local dependencies, the problem of aligning Potts models is hard and remains the main computational bottleneck for their use. We introduce here an Integer Linear Programming formulation of the problem and PPalign, a program based on this formulation, to compute the optimal pairwise alignment of Potts models representing proteins in tractable time. The approach is assessed with respect to a non-redundant set of reference pairwise sequence alignments from SISYPHUS benchmark which have lowest sequence identity (between [Form...Continue Reading

References

Oct 5, 1990·Journal of Molecular Biology·S F AltschulD J Lipman
Sep 5, 1993·Journal of Molecular Biology·L Holm, C Sander
Jan 27, 1999·Bioinformatics·S R Eddy
May 21, 2004·Protein Science : a Publication of the Protein Society·Guoli Wang, Roland L Dunbrack
Sep 24, 2005·Nature·Michael SocolichRama Ranganathan
Oct 28, 2006·Nucleic Acids Research·Antonina AndreevaAlexey G Murzin
Aug 20, 2008·Bioinformatics·Vinay PulimJadwiga Bienkowska
Jan 1, 2009·Proceedings of the National Academy of Sciences of the United States of America·Martin WeigtTerence Hwa
Jun 10, 2009·Bioinformatics·Salvador Capella-GutiérrezToni Gabaldón
Feb 12, 2010·Proceedings of the National Academy of Sciences of the United States of America·Matt MenkeLenore Cowen
Jul 9, 2010·Bioinformatics·Pietro Di LenaRita Casadio
Jan 8, 2011·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Rumen AndonovNicola Yanev
Nov 23, 2011·Proceedings of the National Academy of Sciences of the United States of America·Faruck MorcosMartin Weigt
Feb 16, 2013·Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics·Magnus EkebergErik Aurell
May 25, 2013·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Inken WohlersGunnar W Klau
Mar 29, 2014·PLoS Computational Biology·Jianzhu MaJinbo Xu
Sep 12, 2015·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Noah M DanielsLenore J Cowen
Oct 17, 2015·Proteins·Bohdan MonastyrskyyAndriy Kryshtafovych
Dec 3, 2016·Nucleic Acids Research·Milot MirditaMartin Steinegger
Jan 20, 2018·Molecular Biology and Evolution·Matteo FigliuzziMartin Weigt
Dec 1, 2020·PLoS Computational Biology·Grey W Wilburn, Sean R Eddy
Jan 21, 2021·Physical Review. E·Anna Paola MuntoniFrancesco Zamponi
Feb 25, 2019·BioRxiv : the Preprint Server for Biology·M. SteineggerJohannes Soeding

❮ Previous
Next ❯

Software Mentioned

HMMER
CCMpred
MRFalign
CCMpredPy
Hyperopt
HHmake
HHalign
PPalign
BLASTp
SMURF

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.

Related Papers

BioRxiv : the Preprint Server for Biology
Hugo Talibart, F. Coste
California State Journal of Medicine
E S Merritt
Journal of Neurosurgery
M Turgut
South African Medical Journal = Suid-Afrikaanse Tydskrif Vir Geneeskunde
C J KAPLAN
© 2021 Meta ULC. All rights reserved