Abstract
We have analysed the genomes of representatives of three kingdoms of life, namely, archaea, eubacteria and eukaryota using data mining tools based on compositional analyses of the protein sequences. The representatives chosen in this analysis were Methanococcus jannaschii, Haemophilus influenzae and Saccharomyces cerevisiae. We have identified the common and different features between the three genomes in the protein evolution patterns. M. jannaschii has been seen to have a greater number of proteins with more charged amino acids whereas S. cerevisiae has been observed to have a greater number of hydrophilic proteins. Despite the differences in intrinsic compositional characteristics between the proteins from the different genomes we have also identified certain common characteristics. We have carried out exploratory Principal Component Analysis of the multivariate data on the proteins of each organism in an effort to classify the proteins into clusters. Interestingly, we found that most of the proteins in each organism cluster closely together, but there are a few 'outliers'. We focus on the outliers for the functional investigations, which may aid in revealing any unique features of the biology of the respective organisms
References
Jun 1, 1992·FEBS Letters·H Nakashima, K Nishikawa
Aug 20, 1991·Journal of Molecular Biology·M van Heel
Jan 1, 1986·Journal of Biochemistry·H NakashimaT Ooi
Oct 20, 1995·Science·C M FraserJ C Venter
Feb 1, 1995·Nature Structural Biology·G CasariA Valencia
Sep 1, 1994·Computers & Chemistry·J C Wootton
Apr 22, 1994·Journal of Molecular Biology·H Nakashima, K Nishikawa
Jun 1, 1993·Journal of Molecular Evolution·G Schneider, P Wrede
Sep 17, 1996·Proceedings of the National Academy of Sciences of the United States of America·A R Mushegian, E V Koonin
Oct 24, 1997·Science·R L TatusovD J Lipman
Jul 17, 1998·Current Opinion in Structural Biology·E V KooninM Y Galperin
Mar 9, 1999·Bioinformatics·M ForsterM Afzal
May 18, 1999·Journal of Molecular Biology·R NateshM A Viswamitra
Oct 19, 1999·Gene·G Schneider
Nov 7, 1999·Journal of Molecular Evolution·M A AndradeA Valencia
Dec 11, 1999·Science·C A HutchisonJ C Venter
Dec 11, 1999·Nucleic Acids Research·R L TatusovE V Koonin
Jan 19, 2000·Nucleic Acids Research·M S GelfandA A Mironov
Mar 18, 2000·Gene·S RaghavanS K Brahmachari