Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes

Genome Research
Alexandre LomsadzeM Borodovsky

Abstract

In a conventional view of the prokaryotic genome organization, promoters precede operons and ribosome binding sites (RBSs) with Shine-Dalgarno consensus precede genes. However, recent experimental research suggesting a more diverse view motivated us to develop an algorithm with improved gene-finding accuracy. We describe GeneMarkS-2, an ab initio algorithm that uses a model derived by self-training for finding species-specific (native) genes, along with an array of precomputed "heuristic" models designed to identify harder-to-detect genes (likely horizontally transferred). Importantly, we designed GeneMarkS-2 to identify several types of distinct sequence patterns (signals) involved in gene expression control, among them the patterns characteristic for leaderless transcription as well as noncanonical RBS patterns. To assess the accuracy of GeneMarkS-2, we used genes validated by COG (Clusters of Orthologous Groups) annotation, proteomics experiments, and N-terminal protein sequencing. We observed that GeneMarkS-2 performed better on average in all accuracy measures when compared with the current state-of-the-art gene prediction tools. Furthermore, the screening of ∼5000 representative prokaryotic genomes made by GeneMarkS-2 pre...Continue Reading

References

Aug 7, 1992·Cell·C S Shean, M E Gottesman
Dec 25, 1992·Nucleic Acids Research·J W Fickett, C S Tung
Apr 1, 1974·Proceedings of the National Academy of Sciences of the United States of America·J Shine, L Dalgarno
Sep 11, 1995·Nucleic Acids Research·M BorodovskyA Danchin
Apr 11, 1994·Nucleic Acids Research·D BarrickG D Stormo
Oct 24, 1997·Science·R L TatusovD J Lipman
Dec 11, 1999·Nucleic Acids Research·K E Rudd
May 24, 2001·Journal of Molecular Biology·M M SlupskaJ H Miller
Jun 26, 2003·Nucleic Acids Research·William ThompsonCharles E Lawrence
Sep 13, 2003·BMC Bioinformatics·Roman L TatusovDarren A Natale
Feb 4, 2006·Molecular & Cellular Proteomics : MCP·Syuji YamazakiKatsumi Isono
Jan 24, 2007·Bioinformatics·Arthur L DelcherSteven L Salzberg
Jun 19, 2009·Molecular Systems Biology·Tie KoideNitin S Baliga
Nov 4, 2009·Genome Research·Omri WurtzelRotem Sorek
Dec 10, 2009·Proceedings of the National Academy of Sciences of the United States of America·Dominik JägerRuth A Schmitz
Mar 10, 2010·BMC Bioinformatics·Doug HyattLoren J Hauser
Apr 21, 2010·Nucleic Acids Research·Wenhan ZhuMark Borodovsky
Oct 29, 2010·Tuberculosis·Jocelyne M LewStewart T Cole
Jan 20, 2011·Proceedings of the National Academy of Sciences of the United States of America·Jan MitschkeWolfgang R Hess
Mar 27, 2012·Molecular Biology and Evolution·Kyungtaek LimIchizo Kobayashi
Dec 1, 2012·Nucleic Acids Research·Jindan Zhou, Kenneth E Rudd
Jan 22, 2013·Applied and Environmental Microbiology·Udo WegmannSimon R Carding
Jul 26, 2013·RNA Biology·Claire Toffano-NiocheDaniel Gautheret
Dec 10, 2013·Nucleic Acids Research·Tatiana TatusovaIgor Tolstoy
Dec 18, 2013·Cell Host & Microbe·Carsten KrögerJay C D Hinton
Jul 16, 2014·Current Opinion in Microbiology·Cynthia M Sharma, Jörg Vogel
Nov 28, 2014·Nucleic Acids Research·Michael Y GalperinEugene V Koonin
Dec 9, 2014·Current Opinion in Microbiology·James P Creecy, Tyrrell Conway
Aug 12, 2015·Cellular and Molecular Life Sciences : CMLS·Claudio O Gualerzi, Cynthia L Pon
Nov 4, 2015·Nucleic Acids Research·Socorro Gama-CastroJulio Collado-Vides

❮ Previous
Next ❯

Citations

Dec 19, 2019·Briefings in Bioinformatics·Richa Bharti, Dominik G Grimm
May 10, 2020·FEMS Microbiology Reviews·Daria FijalkowskaPetra Van Damme
Aug 4, 2020·Environmental Microbiology·Carlos M DuarteXabier Irigoien
Aug 28, 2020·Bioinformatics and Biology Insights·Sávio Souza CostaRafael Azevedo Baraúna
Aug 8, 2020·Frontiers in Microbiology·Jessica C A FriedersdorffChristopher J Creevey
Jul 13, 2019·International Journal of Molecular Sciences·Alicia Salisbury, Philippos K Tsourkas
Aug 21, 2020·Microbiology Resource Announcements·Eleanor I LamontBatbileg Bor
Jul 25, 2020·Microbial Genomics·Ana Elena Pérez-CobasCarmen Buchrieser
Oct 17, 2018·Genes·Mike Dyall-SmithFriedhelm Pfeiffer
Oct 30, 2020·Nucleic Acids Research·I-Min A ChenNikos C Kyrpides
Dec 4, 2020·Nucleic Acids Research·Wenjun LiFrançoise Thibaud-Nissen
Feb 7, 2021·Current Biology : CB·Cintia IhaHeroen Verbruggen
Feb 27, 2021·PLoS Computational Biology·Markus J Sommer, Steven L Salzberg
Feb 26, 2021·MSphere·Jiahui ZhuLiang Xiao
Mar 6, 2021·Frontiers in Microbiology·Colin TittesTessa E F Quax
Apr 15, 2021·TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik·Jacob I MarshDavid Edwards
Feb 26, 2021·Journal of Clinical Microbiology·Michael R WeigandM Lucia Tondella
Apr 22, 2021·Archives of Microbiology·Malek MarianMasafumi Shimizu
May 15, 2021·Microbiology Resource Announcements·Mike Dyall-SmithSen-Lin Tang
Jun 1, 2021·NAR Genomics and Bioinformatics·Alexandre LomsadzeMark Borodovsky
May 20, 2021·MSystems·Alicia ClumNatalia N Ivanova
Dec 10, 2021·Microbiology Resource Announcements·Hiroshi TakagiHideo Dohra
Sep 9, 2019··Mark Borodovsky, Mark Borodovsky

❮ Previous
Next ❯

Related Concepts

Related Feeds

Archaeogenetics

Recent advances in genomic sequencing has led to the discovery of new strains of Archaea and shed light on their evolutionary history. Discover the latest research on Archaeogenetics here.