Comparative genomics using data mining tools

Journal of Biosciences
Tannistha NandiSrinivasan Ramachandran

Abstract

We have analysed the genomes of representatives of three kingdoms of life, namely, archaea, eubacteria and eukaryota using data mining tools based on compositional analyses of the protein sequences. The representatives chosen in this analysis were Methanococcus jannaschii, Haemophilus influenzae and Saccharomyces cerevisiae. We have identified the common and different features between the three genomes in the protein evolution patterns. M. jannaschii has been seen to have a greater number of proteins with more charged amino acids whereas S. cerevisiae has been observed to have a greater number of hydrophilic proteins. Despite the differences in intrinsic compositional characteristics between the proteins from the different genomes we have also identified certain common characteristics. We have carried out exploratory Principal Component Analysis of the multivariate data on the proteins of each organism in an effort to classify the proteins into clusters. Interestingly, we found that most of the proteins in each organism cluster closely together, but there are a few 'outliers'. We focus on the outliers for the functional investigations, which may aid in revealing any unique features of the biology of the respective organisms

References

Aug 20, 1991·Journal of Molecular Biology·M van Heel
Jan 1, 1986·Journal of Biochemistry·H NakashimaT Ooi
Oct 20, 1995·Science·C M FraserJ C Venter
Feb 1, 1995·Nature Structural Biology·G CasariA Valencia
Jun 1, 1993·Journal of Molecular Evolution·G Schneider, P Wrede
Sep 17, 1996·Proceedings of the National Academy of Sciences of the United States of America·A R Mushegian, E V Koonin
Oct 24, 1997·Science·R L TatusovD J Lipman
Jul 17, 1998·Current Opinion in Structural Biology·E V KooninM Y Galperin
Nov 7, 1999·Journal of Molecular Evolution·M A AndradeA Valencia
Dec 11, 1999·Science·C A HutchisonJ C Venter
Dec 11, 1999·Nucleic Acids Research·R L TatusovE V Koonin
Jan 19, 2000·Nucleic Acids Research·M S GelfandA A Mironov

❮ Previous
Next ❯

Citations

Feb 16, 2005·BMC Bioinformatics·Alena Shmygelska, Holger H Hoos
May 15, 2008·Bioinformation·Jayavel Sridhar, Ziauddin Ahamed Rafi

❮ Previous
Next ❯

Related Concepts

Related Feeds

Archaeogenetics

Recent advances in genomic sequencing has led to the discovery of new strains of Archaea and shed light on their evolutionary history. Discover the latest research on Archaeogenetics here.

Related Papers

Archives des maladies du coeur et des vaisseaux
L Foucan, J Vaillant
The Journal of Clinical Investigation
Edward M Rubin, G S Barsh
Omics : a Journal of Integrative Biology
Jayavel Sridhar, Ziauddin Ahamed Rafi
Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics
Hsiao-Ping HsuPeter Grassberger
© 2022 Meta ULC. All rights reserved