PMID: 9548972Jun 13, 1998Paper

Analysis of EST-driven gene annotation in human genomic sequence

Genome Research
L C BaileyG C Overton

Abstract

We have performed a systematic analysis of gene identification in genomic sequence by similarity search against expressed sequence tags (ESTs) to assess the suitability of this method for automated annotation of the human genome. A BLAST-based strategy was constructed to examine the potential of this approach, and was applied to test sets containing all human genomic sequences longer than 5 kb in public databases, plus 300 kb of exhaustively characterized benchmark sequence. At high stringency, 70%-90% of all annotated genes are detected by near-identity to EST sequence; >95% of ESTs aligning with well-annotated sequences overlap a gene. These ESTs provide immediate access to the corresponding cDNA clones for follow-up laboratory verification and subsequent biologic analysis. At lower stringency, up to 97% of annotated genes were identified by similarity to ESTs. The apparent false-positive rate rose to 55% of ESTs among all sequences and 20% among benchmark sequences at the lowest stringency, indicating that many genes in public database entries are unannotated. Approximately half of the alignments span multiple exons, and thus aid in the construction of gene predictions and elucidation of alternative splicing. In addition, ES...Continue Reading

References

Oct 1, 1992·Journal of Molecular Evolution·J JurkaA Milosavljevic
Sep 1, 1995·Trends in Genetics : TIG·D E BassettP Hieter
Dec 15, 1993·Proceedings of the National Academy of Sciences of the United States of America·F Antequera, A Bird
Aug 1, 1995·Nature Genetics·M S Boguski, G D Schuler
Feb 1, 1995·Journal of Molecular Evolution·J Jurka, C Pethiyagoda
Apr 21, 1995·Journal of Molecular Biology·E E Snyder, G D Stormo
Feb 10, 1995·The Journal of Biological Chemistry·S M Berget
Jul 1, 1994·Nature Genetics·C FieldsJ C Venter
Aug 1, 1993·Nature Genetics·M S BoguskiC M Tolstoshev
Nov 1, 1995·Molecular Immunology·D P CerrettiD J Gilbert
May 17, 1996·The Journal of Biological Chemistry·M P Wilson, P W Majerus
Jan 1, 1996·Methods in Enzymology·O White, A R Kerlavage
Jan 1, 1996·Methods in Enzymology·E C UberbacherR J Mural
Jun 15, 1996·Genomics·M Burset, R Guigó
Jan 1, 1994·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·G C OvertonJ Adams
Sep 1, 1996·Genome Research·L D HillierM Marra
Nov 29, 1996·The Journal of Biological Chemistry·J GreeneY E Shi
Apr 15, 1997·Nucleic Acids Research·T G Wolfsberg, D Landsman
May 1, 1997·Genome Research·D A RuddyJ N Feder
May 16, 1998·Genome Research·L C BaileyG C Overton

❮ Previous
Next ❯

Citations

Nov 4, 2005·Bioinformatics·Miao Zhang, Warren Gish
Mar 10, 2001·Nature·E S LanderUNKNOWN International Human Genome Sequencing Consortium
Dec 10, 1999·Genome Génome / Conseil National De Recherches Canada·W MichalekA Graner
Apr 23, 2008·Current Protocols in Human Genetics·Simon Gregory, John Gilbert
Feb 13, 1999·Nature Genetics·M MarraR Waterston
Jun 7, 2002·Endocrine Reviews·Chandra P LeoAaron J W Hsueh
Mar 29, 2000·Proceedings of the National Academy of Sciences of the United States of America·E Dias NetoA J Simpson
May 16, 1998·Genome Research·L C BaileyG C Overton
Jun 3, 2000·Annual Review of Pharmacology and Toxicology·C Debouck, B Metcalf
Oct 7, 2006·FEBS Letters·Stilianos ArhondakisGiorgio Bernardi
Mar 24, 2000·Drug Discovery Today·D B Searls
Nov 10, 2001·Annual Review of Genomics and Human Genetics·D B Searls
Sep 4, 2003·Genome Research·Derek HuntleyMarek Sergot
Jun 16, 2004·Genome Research·Mari Cleide SogayarUNKNOWN Ludwig-FAPESP Transcript Finishing Initiative
Apr 26, 2000·Genome Research·S SchwartzW Miller

❮ Previous
Next ❯

Related Concepts

Related Feeds

Alternative splicing

Alternative splicing a regulated gene expression process that allows a single genetic sequence to code for multiple proteins. Here is that latest research.