TAPO: A combined method for the identification of tandem repeats in protein structures

FEBS Letters
Phuong Do VietAndrey V Kajava

Abstract

In recent years, there has been an emergence of new 3D structures of proteins containing tandem repeats (TRs), as a result of improved expression and crystallization strategies. Databases focused on structure classifications (PDB, SCOP, CATH) do not provide an easy solution for selection of these structures from PDB. Several approaches have been developed, but no best approach exists to identify the whole range of 3D TRs. Here we describe the TAndem PrOtein detector (TAPO) that uses periodicities of atomic coordinates and other types of structural representation, including strings generated by conformational alphabets, residue contact maps, and arrangements of vectors of secondary structure elements. The benchmarking shows the superior performance of TAPO over the existing programs. In accordance with our analysis of PDB using TAPO, 19% of proteins contain 3D TRs. This analysis allowed us to identify new families of 3D TRs, suggesting that TAPO can be used to regularly update the collection and classification of existing repetitive structures.

References

Nov 15, 1992·Proceedings of the National Academy of Sciences of the United States of America·S Henikoff, J G Henikoff
Oct 5, 1990·Journal of Molecular Biology·S F AltschulD J Lipman
Jul 1, 1987·Proceedings of the National Academy of Sciences of the United States of America·M GribskovD Eisenberg
Dec 1, 1994·Computer Applications in the Biosciences : CABIOS·A Godzik, J Skolnick
Mar 21, 1998·Journal of Clinical Oncology : Official Journal of the American Society of Clinical Oncology·W H Liggett, D Sidransky
Jul 17, 1998·Current Opinion in Structural Biology·J Heringa
Dec 2, 1999·BioEssays : News and Reviews in Molecular, Cellular and Developmental Biology·P J Morin
Dec 11, 1999·Nucleic Acids Research·H M BermanP E Bourne
Sep 12, 2001·Journal of Structural Biology·M A AndradeC P Ponting
Sep 12, 2001·Journal of Structural Biology·A V Kajava
Dec 26, 2001·Current Opinion in Structural Biology·B Kobe, A V Kajava
Feb 20, 2002·Journal of Molecular Biology·Kevin B MurrayJanet M Thornton
Mar 28, 2002·Protein Engineering·William R TaylorTomas P Flores
Jan 9, 2003·Structure·Kit S TangLaura S Itzhaki
Jun 25, 2004·Nucleic Acids Research·Yuzhen Ye, Adam Godzik
Jul 21, 2004·Bioinformatics·Radek Szklarczyk, Jaap Heringa
Sep 2, 2004·Proteins·Kevin B MurrayJanet M Thornton
Apr 12, 2005·Proteins·Catherine EtchebestAlexandre G de Brevern
Apr 26, 2005·Nucleic Acids Research·Yang Zhang, Jeffrey Skolnick
Apr 4, 2006·Journal of Molecular Biology·Jérôme HennetinAndrey V Kajava
Jun 1, 2006·Proteins·Arun S KonagurthuArthur M Lesk
Jul 18, 2006·Nucleic Acids Research·Edward S C ShihMing-Jing Hwang
May 20, 2008·Bioinformatics·Anne-Laure AbrahamJoël Pothier
Aug 13, 2009·Bioinformatics·Julien Jorda, Andrey V Kajava
Feb 19, 2010·Bioinformatics·Jinrui Xu, Yang Zhang
May 1, 2010·Computational Biology and Chemistry·R SabarinathanK Sekar
Nov 13, 2010·Nucleic Acids Research·Robbie P JoostenGert Vriend
May 17, 2011·Biochimie·Agnel Praveen JosephAlexandre G de Brevern
Sep 3, 2011·Journal of Structural Biology·Andrey V Kajava
Aug 11, 2012·Bioinformatics·Andreas PrlićScooter Willis
Jun 14, 2013·The Journal of Physical Chemistry. B·R Gonzalo ParraDiego U Ferreiro
Jun 14, 2013·Nucleic Acids Research·Daniel B RocheLiam J McGuffin
Oct 25, 2013·Nucleic Acids Research·Tjaart A P de BeerRoman A Laskowski
Dec 7, 2013·Nucleic Acids Research·Tomás Di DomenicoSilvio C E Tosatto
Mar 13, 2014·The Journal of Clinical Investigation·Valter TucciPatrick M Nolan
Apr 1, 2014·Journal of Molecular Biology·Douglas Myers-TurnbullAndreas Prlić
Apr 1, 2014·Journal of Structural Biology·François D Richard, Andrey V Kajava
Oct 30, 2014·Nucleic Acids Research·Wouter G TouwGert Vriend
Dec 31, 2014·BMC Bioinformatics·Broto Chakrabarty, Nita Parekh

❮ Previous
Next ❯

Citations

Dec 3, 2016·Nucleic Acids Research·Lisanna PaladinSilvio C E Tosatto
Apr 23, 2019·PLoS Computational Biology·Spencer E BlivenPhilip E Bourne
Jun 7, 2019·Journal of Cheminformatics·Inbal Tuvi-Arad, Gil Alon
Jul 25, 2020·PloS One·Yaffa Shalit, Inbal Tuvi-Arad
Feb 24, 2016·Amino Acids·Layla HirshSilvio C E Tosatto
May 11, 2018·Nucleic Acids Research·Layla HirshSilvio C E Tosatto
Jun 24, 2016·Statistical Applications in Genetics and Molecular Biology·Valentina PugachevaEugene Korotkov
Nov 26, 2020·Nucleic Acids Research·Lisanna PaladinSilvio C E Tosatto
Oct 20, 2019·Journal of Molecular Biology·Antoniya A AleksandrovaLucy R Forrest

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.