Evolutionary information for specifying a protein fold

Nature
Michael SocolichRama Ranganathan

Abstract

Classical studies show that for many proteins, the information required for specifying the tertiary structure is contained in the amino acid sequence. Here, we attempt to define the sequence rules for specifying a protein fold by computationally creating artificial protein sequences using only statistical information encoded in a multiple sequence alignment and no tertiary structure information. Experimental testing of libraries of artificial WW domain sequences shows that a simple statistical energy function capturing coevolution between amino acid residues is necessary and sufficient to specify sequences that fold into native structures. The artificial proteins show thermodynamic stabilities similar to natural WW domains, and structure determination of one artificial protein shows excellent agreement with the WW fold at atomic resolution. The relative simplicity of the information used for creating sequences suggests a marked reduction to the potential complexity of the protein-folding problem.

References

Apr 5, 1992·Journal of Molecular Biology·Amnon Horovitz, Alan Fersht
Jul 5, 1987·Journal of Molecular Biology·D BashfordA M Lesk
Jul 20, 1973·Science·C B Anfinsen
Dec 1, 1994·Trends in Biochemical Sciences·P Bork, M Sudol
Mar 14, 1995·Biochemistry·V J LiCata, G K Ackers
Nov 1, 1993·Quarterly Reviews of Biophysics·F M Richards, W A Lim
Nov 1, 1995·Journal of Biomolecular NMR·F DelaglioA Bax
Mar 29, 1996·Journal of Molecular Biology·O LichtargeF E Cohen
Feb 1, 1996·Journal of Molecular Graphics·R KoradiK Wüthrich
Sep 17, 1996·Proceedings of the National Academy of Sciences of the United States of America·Mark Gerstein, C Chothia
Dec 1, 1996·Journal of Biomolecular NMR·R A LaskowskiJ M Thornton
Sep 1, 1997·Nucleic Acids Research·S F AltschulD J Lipman
Oct 3, 1998·Acta Crystallographica. Section D, Biological Crystallography·A T BrüngerG L Warren
Oct 9, 1999·Science·Steve W Lockless, Rama Ranganathan
Jun 29, 2000·Trends in Biochemical Sciences·A R DinnerM Karplus
Aug 10, 2000·Protein Science : a Publication of the Protein Society·D M John, K M Weeks
Apr 21, 2001·The Journal of Biological Chemistry·R WintjensI Landrieu
Apr 27, 2001·Nature Structural Biology·V KanelisJ D Forman-Kay
Jul 21, 2001·Biophysical Journal·J Liang, K A Dill
Aug 2, 2001·Journal of Molecular Biology·Marcus JagerMartin Gruebele
Feb 23, 2002·Science·Elan Zohar EisenmesserDorothee Kern
May 4, 2002·Annual Review of Biophysics and Biomolecular Structure·Irene LuqueErnesto Freire
Dec 17, 2002·Nature Structural Biology·Gürol M SüelRama Ranganathan
Jun 5, 2003·Nature Reviews. Molecular Cell Biology·Valerie Daggett, Alan Fersht
Jul 16, 2003·Proceedings of the National Academy of Sciences of the United States of America·H FrauenfelderP W Fenimore
Aug 30, 2003·Science·Stephen J Benkovic, Sharon Hammes-Schiffer
Nov 19, 2003·Proceedings of the National Academy of Sciences of the United States of America·Mark E HatleyRama Ranganathan
Dec 31, 2003·Journal of Molecular Biology·Ernesto J FuentesAndrew L Lee
Mar 16, 2004·Cell·Andrew I ShulmanRama Ranganathan
Mar 17, 2004·Molecular Cell·Francis C PetersonKenneth E Prehoda
Mar 17, 2004·The Journal of Biological Chemistry·Anthony A Fodor, Richard W Aldrich
Jan 15, 2005·Nature·Kresten Lindorff-LarsenMichele Vendruscolo
Jan 20, 2005·Proceedings of the National Academy of Sciences of the United States of America·R August EstabrookNorbert O Reich
Jul 12, 2005·Journal of Molecular Biology·Nobuyuki Ota, David A Agard
Sep 24, 2005·Nature·William P RussRama Ranganathan
Sep 1, 1994·Journal of Biomolecular NMR·B A Johnson, R A Blevins

Citations

Sep 24, 2005·Nature·Jeffery W Kelly
Sep 12, 2012·Nature Biotechnology·Pau CreixellRune Linding
Nov 12, 2005·Nature Methods·Allison Doerr
Mar 6, 2013·Nature Reviews. Genetics·David de JuanAlfonso Valencia
Jan 1, 2009·Faraday Discussions·Craig T ArmstrongDerek N Woolfson
Jul 25, 2008·Proceedings of the National Academy of Sciences of the United States of America·Shachi GosaviJosé N Onuchic
Sep 4, 2008·Proceedings of the National Academy of Sciences of the United States of America·Gue Su ChangDamian B van Rossum
Jun 23, 2009·Proceedings of the National Academy of Sciences of the United States of America·Steve W Lockless, Tom W Muir
Dec 17, 2009·Proceedings of the National Academy of Sciences of the United States of America·Cheryl T S Wong Po FooSarah C Heilshorn
Mar 9, 2011·Proceedings of the National Academy of Sciences of the United States of America·Greg J StephensWilliam Bialek
Jul 9, 2009·Journal of Biomolecular Structure & Dynamics·Milana Frenkel-MorgensternMark Safro
Apr 10, 2010·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Wei ZhengChris Bailey-Kellogg
Sep 20, 2011·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Andrew S ParkerChris Bailey-Kellogg
Mar 3, 2007·Bioinformatics·Jason S Papadopoulos, Richa Agarwala
Dec 11, 2007·Bioinformatics·Angelika FuchsDmitrij Frishman
Dec 7, 2007·Bioinformatics·Kevin Y YipMark Gerstein
Feb 6, 2009·Molecular Biology and Evolution·Simon G Williams, Simon C Lovell
Feb 18, 2010·Molecular Biology and Evolution·Claudia L KleinmanHervé Philippe
Jun 17, 2010·Molecular Biology and Evolution·Simon C Lovell, David L Robertson
Dec 5, 2008·Protein Engineering, Design & Selection : PEDS·Vagmita Pabuwal, Zhijun Li
May 31, 2013·Journal of the Royal Society, Interface·Briana J Dunn, Chaitan Khosla
Mar 27, 2007·Cold Spring Harbor Symposia on Quantitative Biology·S R Eddy
Apr 28, 2011·Journal of Bioinformatics and Computational Biology·Andrew S ParkerChris Bailey-Kellogg
May 5, 2007·Annual Review of Biophysics and Biomolecular Structure·Jayanth R Banavar, Amos Maritan
Nov 16, 2011·BMC Bioinformatics·Janardanan SreekumarAalt D J van Dijk
Aug 18, 2006·BMC Bioinformatics·Elijah RobertsZaida Luthey-Schulten
Mar 27, 2009·BMC Evolutionary Biology·Rui AlvesEnrique Herrero
Oct 29, 2010·BMC Genomics·Aalt D J van Dijk, Roeland C H J van Ham
May 18, 2006·BMC Structural Biology·Tarmo P RoosildSenyon Choe
Oct 5, 2007·Algorithms for Molecular Biology : AMB·Rodrigo Gouveia-Oliveira, Anders G Pedersen
May 9, 2006·PLoS Computational Biology·Philip Wong, Dmitrij Frishman
Jun 23, 2006·PLoS Computational Biology·Gevorg GrigoryanAmy E Keating
Dec 13, 2006·PLoS Computational Biology·Jinfeng Zhang, Jun S Liu
Mar 14, 2009·PLoS Computational Biology·Wolfram StackliesFrauke Gräter
Oct 10, 2009·PLoS Computational Biology·Omar N A DemerdashJulie C Mitchell
Sep 24, 2010·PLoS Computational Biology·Andreas KowarschPhilipp Pagel
Jun 22, 2012·PLoS Computational Biology·Hector Garcia-SeisdedosJose M Sanchez-Ruiz
Oct 27, 2010·PLoS Genetics·Mark LunzerAntony M Dean
Mar 9, 2011·PLoS Genetics·Benjamin CallahanBoris I Shraiman
Dec 14, 2011·PloS One·Qi-Shi DuRi-Bo Huang
Dec 4, 2012·International Journal of Molecular Sciences·Alexandre BarrozoShina Caroline Lynn Kamerlin
Dec 7, 2013·Journal of Computer-aided Molecular Design·Joseph J CrivelliJens Meiler
Mar 16, 2007·Proceedings of the National Academy of Sciences of the United States of America·Tong LiuVincent J Hilser
Jun 30, 2006·Proceedings of the National Academy of Sciences of the United States of America·Marcus JagerJeffery W Kelly
Jun 15, 2007·Proceedings of the National Academy of Sciences of the United States of America·Matthew A WrightDaniel Segrè
Dec 21, 2006·Proceedings of the National Academy of Sciences of the United States of America·Thomas P TreynorStephen L Mayo
Jan 2, 2007·Proceedings of the National Academy of Sciences of the United States of America·Andrew D FergusonJohann Deisenhofer
Dec 3, 2014·Journal of Environmental Sciences (China)·Muhil Vannan SeralathanTapan Chakrabarti
Nov 6, 2014·Acta Crystallographica. Section F, Structural Biology Communications·Venuka Durani GoyalRavindra D Makde
Mar 10, 2010·Proceedings of the National Academy of Sciences of the United States of America·Thierry MoraCurtis G Callan
Oct 1, 2010·Pharmaceuticals·Naveena Yanamala, Judith Klein-Seetharaman
Feb 20, 2008·FEBS Letters·David JuanAlfonso Valencia
Oct 22, 2015·Proceedings of the National Academy of Sciences of the United States of America·Ludovico SuttoFrancesco Luigi Gervasio
Dec 3, 2014·Journal of Molecular Biology·Fabio ParmeggianiDavid Baker
May 18, 2013·Current Opinion in Structural Biology·William R TaylorMichael I Sadowski
Apr 19, 2011·Current Opinion in Chemical Biology·Thomas J MaglieryBrandon J Sullivan
Mar 23, 2010·Current Opinion in Structural Biology·Gregory D Friedland, Tanja Kortemme
Aug 29, 2009·Current Opinion in Biotechnology·Michael LappeRajagopal Sathyapriya
Jul 21, 2009·Journal of Physiology, Paris·Marc Mézard, Thierry Mora
Aug 10, 2007·Journal of Molecular Biology·Muthuraman MeiyappanJohn A A Ladias
Aug 26, 2014·Journal of Theoretical Biology·Elisa CalistriRoberto Livi
Jun 5, 2007·Journal of Theoretical Biology·Anupam Nath JhaSaraswathi Vishveshwara
Jan 23, 2015·Protein Science : a Publication of the Protein Society·Deeptak VermaChris Bailey-Kellogg
Jul 1, 2009·Protein Science : a Publication of the Protein Society·Marcus JagerJeffery W Kelly
Dec 18, 2008·Proteins·Norman WangStephen J Demarest
Jun 26, 2007·Protein Science : a Publication of the Protein Society·Marcus JagerJeffery W Kelly
Mar 15, 2016·Molecular BioSystems·Silvia GrigolonMatteo Marsili
Sep 27, 2008·The EMBO Journal·Florencio Pazos, Alfonso Valencia
Oct 10, 2007·Environmental Microbiology·Tuck Seng WongUlrich Schwaneberg
Jan 3, 2013·Proteins·Elizabeth A ProctorNikolay V Dokholyan
Jan 27, 2011·Proteins·Sivaraman BalakrishnanChristopher James Langmead
Apr 25, 2013·Biotechnology and Bioengineering·Dirk AertsTom Desmet
Sep 25, 2010·Molecular Systems Biology·Robert G SmockLila M Gierasch
May 3, 2008·IEEE/ACM Transactions on Computational Biology and Bioinformatics·John ThomasChris Bailey-Kellogg
Aug 1, 2009·IEEE/ACM Transactions on Computational Biology and Bioinformatics·John ThomasChris Bailey-Kellogg
Jul 16, 2015·Journal of Applied Microbiology·Y TakashimaM Matsumoto-Nakano
Jan 13, 2016·Protein Science : a Publication of the Protein Society·Chuanning TangDacheng He
Sep 29, 2006·Current Opinion in Genetics & Development·Hunter B Fraser
Jul 18, 2006·Current Opinion in Structural Biology·Alan M Poole, Rama Ranganathan
Sep 24, 2015·Interdisciplinary Sciences, Computational Life Sciences·Hongyun GaoJun Wang
May 20, 2015·Advanced Materials·Huiyuan Wang, Sarah C Heilshorn
Jul 31, 2013·Journal of Molecular Biology·Maximilian HechtBurkhard Rost
Apr 14, 2016·Chemical Reviews·Jeffrey R WagnerRommie E Amaro
Dec 14, 2011·Journal of Biotechnology·John StraffordPaul A Dalby
Sep 17, 2011·Biochimica Et Biophysica Acta·Lúcio M F Mendonça, Sandro R Marana
Jan 24, 2007·Current Biology : CB·Sebastian MeierSuat Ozbek
Oct 18, 2011·Computational Biology and Chemistry·Michael I SadowskiWilliam R Taylor
Jun 17, 2008·Cell·Jeffrey M SkerkerMichael T Laub
Dec 27, 2011·Cell·Kimberly A ReynoldsRama Ranganathan
Aug 26, 2009·Cell·Najeeb HalabiRama Ranganathan
Dec 21, 2010·Chemistry & Biology·Shaun M LippowKristala L J Prather
Aug 21, 2007·Biophysical Journal·Jin YuKlaus Schulten
Jun 26, 2007·Archives of Biochemistry and Biophysics·Thomas R Jahn, Sheena E Radford
Jun 9, 2016·Protein Engineering, Design & Selection : PEDS·Benjamin T Porebski, Ashley M Buckle
Feb 28, 2015·PLoS Computational Biology·Tiberiu TeşileanuStanislas Leibler
Jun 21, 2016·Cell·Arjun RamanRama Ranganathan
Feb 3, 2016·Protein Science : a Publication of the Protein Society·Tyler N Starr, Joseph W Thornton
Jul 13, 2016·Molecular Therapy : the Journal of the American Society of Gene Therapy·György AbrusánZoltán Ivics
Oct 27, 2015·Current Opinion in Structural Biology·Thomas J Magliery
Jun 24, 2016·PLoS Computational Biology·Frank J PoelwijkRama Ranganathan
Jun 3, 2016·PLoS Computational Biology·Olivier RivoireRama Ranganathan
May 24, 2016·PLoS Computational Biology·Ana Zafra RuanoTom Lenaerts
Jan 8, 2008·Protein Engineering, Design & Selection : PEDS·Vagmita Pabuwal, Zhijun Li
Feb 3, 2007·Molecular Biology and Evolution·Simon A A Travers, Mario A Fares
Mar 28, 2006·Protein Engineering, Design & Selection : PEDS·Usha K Muppirala, Zhijun Li
Jan 20, 2018·Molecular Biology and Evolution·Matteo FigliuzziMartin Weigt
Mar 8, 2018·Angewandte Chemie·Pengfei TianRobert B Best
Oct 9, 2012·Nature·Richard N McLaughlinRama Ranganathan
Dec 8, 2017·Reports on Progress in Physics·William Bialek
Apr 27, 2018·Expert Opinion on Drug Discovery·Dani SetiawanYang Zhang
Nov 10, 2017·Reports on Progress in Physics·Simona CoccoMartin Weigt
Sep 24, 2005·Nature·William P RussRama Ranganathan
Feb 16, 2018·Angewandte Chemie·Jozef Adamcik, Raffaele Mezzenga
Nov 13, 2009·Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics·Jeffrey D Fitzgerald, Tatyana O Sharpee
May 22, 2019·Proceedings of the National Academy of Sciences of the United States of America·Matt SternkeDoug Barrick
Nov 3, 2018·Scientific Reports·Joe G GreenerDavid T Jones
Oct 19, 2018·Current Medicinal Chemistry·Juan Zeng, Zunnan Huang
Feb 5, 2020·International Journal of Molecular Sciences·Olivier Sheik AmamuddyOzlem Tastan Bishop
Jan 8, 2020·FASEB Journal : Official Publication of the Federation of American Societies for Experimental Biology·Zhiyun WuYan Feng
Mar 17, 2020·Bioinformatics·Tyler C ShimkoYaron Orenstein
Sep 19, 2019·Nature Communications·Frank J PoelwijkRama Ranganathan
Aug 17, 2020·Angewandte Chemie·Dean StrotzRoland Riek
Aug 24, 2018·Genes·Anna PosfaiDavid M McCandlish
Mar 11, 2020·Nature Chemical Biology·Robert W NewberryWilliam F DeGrado
Jul 3, 2020·BMC Molecular and Cell Biology·Diego MarianoRaquel Cardoso de Melo-Minardi
Feb 6, 2007·Protein Engineering, Design & Selection : PEDS·Mingha DaiAndrew R M Bradbury
Feb 8, 2018·Biologie aujourd'hui·Pierre Barrat-Charlaix, Martin Weigt
Dec 23, 2018·Neural Computation·Christophe GardellaThierry Mora
Apr 9, 2019·PLoS Computational Biology·Elena FaccoAlessandro Laio
Feb 21, 2020·Annual Review of Biophysics·Paul CampitelliS Banu Ozkan
Jul 21, 2009·Biochemical Society Transactions·David L Robertson, Simon C Lovell
Nov 15, 2016·Physical Review. E·Hugo Jacquin, A Rançon
Apr 4, 2017·Molecular Biology and Evolution·William F FlynnRonald M Levy
Apr 25, 2019·PLoS Computational Biology·Shou-Wen WangNed S Wingreen
Jan 29, 2019·Protein Engineering, Design & Selection : PEDS·Candice GautierStefano Gianni
Feb 20, 2020·Physical Review. E·Francesca RizzatoSimona Cocco
Aug 1, 2020·Frontiers in Molecular Biosciences·Gennady M VerkhivkerPeng Tao
Dec 10, 2019·Computational and Mathematical Methods in Medicine·Wei WangJunwei Huang
Oct 24, 2019·Physical Review. E·Kai Shimagaki, Martin Weigt
Aug 10, 2020·Molecular Biology and Evolution·Jorge Fernandez-de-Cossio-DiazAndrea Pagnani
Oct 18, 2020·Proceedings of the National Academy of Sciences of the United States of America·Zhiqiang Yan, Jin Wang
Dec 2, 2020·Nature Biotechnology·Muhammad Saqib SohailJohn P Barton

Related Concepts

In Vivo NMR Spectroscopy
Protein Denaturation
Thermodynamics
Determination, Sequence Homology
Tertiary Protein Structure
Protein Folding, Globular
Evolution, Molecular
Computational Molecular Biology

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Synthetic Genetic Array Analysis

Synthetic genetic arrays allow the systematic examination of genetic interactions. Here is the latest research focusing on synthetic genetic arrays and their analyses.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Computational Methods for Protein Structures

Computational methods employing machine learning algorithms are powerful tools that can be used to predict the effect of mutations on protein structure. This is important in neurodegenerative disorders, where some mutations can cause the formation of toxic protein aggregations. This feed follows the latests insights into the relationships between mutation and protein structure leading to better understanding of disease.

Congenital Hyperinsulinism

Congenital hyperinsulinism is caused by genetic mutations resulting in excess insulin secretion from beta cells of the pancreas. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Epigenetic Memory

Epigenetic memory refers to the heritable genetic changes that are not explained by the DNA sequence. Find the latest research on epigenetic memory here.

Cell Atlas of the Human Eye

Constructing a cell atlas of the human eye will require transcriptomic and histologic analysis over the lifespan. This understanding will aid in the study of development and disease. Find the latest research pertaining to the Cell Atlas of the Human Eye here.

Femoral Neoplasms

Femoral Neoplasms are bone tumors that arise in the femur. Discover the latest research on femoral neoplasms here.