Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics.

Genome Biology
Damian FerminDavid States

Abstract

Defining the location of genes and the precise nature of gene products remains a fundamental challenge in genome annotation. Interrogating tandem mass spectrometry data using genomic sequence provides an unbiased method to identify novel translation products. A six-frame translation of the entire human genome was used as the query database to search for novel blood proteins in the data from the Human Proteome Organization Plasma Proteome Project. Because this target database is orders of magnitude larger than the databases traditionally employed in tandem mass spectra analysis, careful attention to significance testing is required. Confidence of identification is assessed using our previously described Poisson statistic, which estimates the significance of multi-peptide identifications incorporating the length of the matching sequence, number of spectra searched and size of the target sequence database. Applying a false discovery rate threshold of 0.05, we identified 282 significant open reading frames, each containing two or more peptide matches. There were 627 novel peptides associated with these open reading frames that mapped to a unique genomic coordinate placed within the start/stop points of previously annotated genes. T...Continue Reading

References

Oct 5, 1990·Journal of Molecular Biology·S F AltschulD J Lipman
Nov 9, 2000·Proceedings of the National Academy of Sciences of the United States of America·S J de SouzaA J Simpson
Oct 27, 2001·Proteomics·J S ChoudharyJ S Cottrell
Sep 17, 2002·Briefings in Bioinformatics·Christian J A SigristPhilipp Bucher
Jan 29, 2003·Proceedings of the National Academy of Sciences of the United States of America·Roderic GuigoMichael R Brent
Feb 21, 2004·Bioinformatics·Robertson Craig, Ronald C Beavis
Apr 13, 2004·Molecular & Cellular Proteomics : MCP·Steven CarrUNKNOWN Working Group on Publication Guidelines for Peptide and Protein Identification Data
Jun 15, 2004·Current Opinion in Structural Biology·Michael R Brent, Roderic Guigó
Aug 3, 2004·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Adam Siepel, David Haussler
Oct 12, 2004·Journal of Proteome Research·Benjamin J CargileJames L Stephenson
Dec 21, 2004·Nucleic Acids Research·Amos BairochLai-Su L Yeh
Dec 21, 2004·Nucleic Acids Research·T HubbardE Birney
Sep 10, 2005·Proteomics·Yufeng ShenRichard D Smith

❮ Previous
Next ❯

Citations

Nov 23, 2011·Journal of Proteome Research·Xiaojing WangBing Zhang
Feb 24, 2009·Journal of Proteome Research·Ari M Frank
Nov 19, 2013·Nature Methods·Rui M M BrancaJanne Lehtiö
Oct 15, 2010·Nature Reviews. Molecular Cell Biology·Christian H AhrensRuedi Aebersold
Feb 17, 2009·Nature Reviews. Microbiology·Nathan C VerBerkmoesJillian F Banfield
Dec 23, 2008·Proceedings of the National Academy of Sciences of the United States of America·Natalie E CastellanaSteven P Briggs
Aug 16, 2008·Molecular & Cellular Proteomics : MCP·Sangtae KimPavel A Pevzner
Feb 19, 2010·Molecular & Cellular Proteomics : MCP·Natalie E CastellanaVineet Bafna
Aug 7, 2012·Molecular & Cellular Proteomics : MCP·Manfred Claassen
Dec 26, 2006·Genome Research·Stephen TannerVineet Bafna
Apr 14, 2011·BMC Plant Biology·Mohamed HelmyYasushi Ishihama
Jun 5, 2008·Genome Biology·Qing ZhangSamir M Hanash
Jun 18, 2014·Omics : a Journal of Integrative Biology·Sutopa B DwivediMobolaji Okulate
Dec 19, 2014·BMC Genomics·Julian UszkoreitMartin Eisenacher
Oct 31, 2014·Nature Methods·Alexey I Nesvizhskii
Oct 31, 2014·Nature Methods·Javier A AlfaroPaul C Boutros
Dec 15, 2015·Expert Review of Proteomics·Marie Locard-PauletJean Armengaud
Dec 3, 2015·Journal of Proteome Research·Mikhail KolmogorovPavel A Pevzner
Apr 7, 2016·Annual Review of Analytical Chemistry·Gloria M SheynkmanLloyd M Smith
May 1, 2017·Molecular & Cellular Proteomics : MCP·Kelly V RugglesD R Mani
Jan 7, 2015·Journal of the American Society for Mass Spectrometry·Yelena YefremovaMichael O Glocker
Dec 1, 2017·European Journal of Mass Spectrometry·Yelena YefremovaMichael O Glocker
Jan 20, 2011·Proteomics·Santosh RenuseAkhilesh Pandey
Dec 17, 2015·Mass Spectrometry Reviews·Gerben Menschaert, David Fenyö
Mar 18, 2015·Proteomics·Thilo MuthLennart Martens
Oct 8, 2018·Molecular & Cellular Proteomics : MCP·Zhe RenAndrew R Jones
Aug 1, 2007·Proteomics. Clinical Applications·Gilbert S Omenn

❮ Previous
Next ❯

Software Mentioned

Tandem
Perl
BLASTN
BioPerl
Excel
MEGABLAST
PROSITE
Mascot Generic Format
ENSEMBL
UNIPROT

Related Concepts

Related Feeds

ApoE Phenotypes

Apolipoprotein E (APOE) is a protein involved in fat metabolism and associated with the pathogenesis of Alzheimer's disease and cardiovascular disease. Here is the latest research on APOE phenotypes.