PMID: 11342225May 9, 2001Paper

Identification of the human crooked neck gene by comparative gene identification

Biochimica Et Biophysica Acta
C H LaiW Lin

Abstract

Drosophila melanogaster serves as a useful model organism for functional genomic studies, and its genome project was recently completed. We previously described a comparative-gene-identification approach to assist human ortholog gene identification that involves applying an entire proteome as an alignment template. Analysis of the available 14100 Drosophila protein sequences revealed that 37% of them (5228 sequences) might lead to discoveries of novel human genes. Upon further database interrogations, we found several putative full-length human gene transcripts, including the human crooked neck (crn) gene. Based on sequence gap-closure experiments using reverse transcriptase-polymerase chain reaction as well as bioinformatic analysis, we found that the assembled human cDNA contig of crooked neck gene was at least 3903 base pairs in length with alternative splicing variations which encoded mainly for a 687-amino-acid residue protein. The human crooked neck gene was located on chromosome 20 with at least 15 exons. The unique features of the 16 copies of the tetratrico peptide repeat (TPR) motif were conserved in the yeast, fly and human crooked neck orthologous proteins, which were important for spliceosome assembly in cells.

References

Sep 1, 1989·Molecular and Cellular Biology·C M Nicolet, E A Craig
Apr 25, 1997·Journal of Molecular Biology·C Burge, S Karlin
Jan 7, 1998·Molecular and Cellular Biology·M R McLean, B C Rymond
Mar 25, 2000·Science·M D AdamsJ C Venter
Mar 24, 2000·Science·G M RubinS Lewis

❮ Previous
Next ❯

Citations

Feb 14, 2006·Biochemical and Biophysical Research Communications·Chun-Hung LaiWen-chang Lin
Apr 2, 2017·Artificial Intelligence in Medicine·Zhen LiYixue Li
Jun 27, 2002·Biochimica Et Biophysica Acta·Seyung ChungBrian C Rymond

❮ Previous
Next ❯

Related Concepts

Related Feeds

Alternative splicing

Alternative splicing a regulated gene expression process that allows a single genetic sequence to code for multiple proteins. Here is that latest research.