A visual framework for sequence analysis using n-grams and spectral rearrangement

Bioinformatics
Stefan R MaetschkeMark A Ragan

Abstract

Protein sequences are often composed of regions that have distinct evolutionary histories as a consequence of domain shuffling, recombination or gene conversion. New approaches are required to discover, visualize and analyze these sequence regions and thus enable a better understanding of protein evolution. Here, we have developed an alignment-free and visual approach to analyze sequence relationships. We use the number of shared n-grams between sequences as a measure of sequence similarity and rearrange the resulting affinity matrix applying a spectral technique. Heat maps of the affinity matrix are employed to identify and visualize clusters of related sequences or outliers, while n-gram-based dot plots and conservation profiles allow detailed analysis of similarities among selected sequences. Using this approach, we have identified signatures of domain shuffling in an otherwise poorly characterized family, and homology clusters in another. We conclude that this approach may be generally useful as a framework to analyze related, but highly divergent protein sequences. It is particularly useful as a fast method to study sequence relationships prior to much more time-consuming multiple sequence alignment and phylogenetic analys...Continue Reading

References

Jan 1, 1993·Annual Review of Biochemistry·G DreyfussC G Burd
Feb 21, 1993·Journal of Theoretical Biology·J L OliverR Román-Roldán
Jul 15, 1999·Computers & Chemistry·M Crochemore, R Vérin
Jan 11, 2000·Journal of Cellular Biochemistry·G K WhitfieldM R Haussler
Oct 12, 2001·Magnetic Resonance Imaging·B A ArdekaniJ A Helpern
Nov 8, 2002·Genome Research·Henrik KaessmannWen-Hsiung Li
Jan 18, 2003·Mammalian Genome : Official Journal of the International Mammalian Genome Society·Tarmo AnniloMichael Dean
Mar 4, 2003·Bioinformatics·Susana Vinga, Jonas Almeida
Jun 13, 2003·Trends in Genetics : TIG·Sandra L Baldauf
Dec 9, 2003·Molecular Biology and Evolution·David Bryant, Vincent Moulton
Apr 20, 2004·Current Opinion in Structural Biology·Christine VogelSarah A Teichmann
Oct 1, 2005·Molecular Endocrinology·Xiao Hu, John W Funder
Mar 21, 2006·Nucleic Acids Research·Alberto PaccanaroMansoor A S Saqi
Feb 14, 2007·BioEssays : News and Reviews in Molecular, Cellular and Developmental Biology·Edward E Schmidt, Christopher J Davies
Apr 25, 2007·Systematic Biology·Michael Höhl, Mark A Ragan
Aug 31, 2007·BMC Evolutionary Biology·Egbert K O KruithofRichard J Fish
Oct 18, 2007·BMC Bioinformatics·Susana Vinga, Jonas S Almeida
May 15, 2008·Bioinformatics·Gabriel CardonaGabriel Valiente
May 27, 2008·The Journal of Heredity·Andrey A PerelyginMargo A Brinton
Jul 1, 2008·Bioinformatics·Simon Wong, Mark A Ragan
Feb 21, 2009·PloS One·Cheong Xin ChanMark A Ragan
May 16, 2009·Genome Research·Takeshi KawashimaHiroshi Wada

❮ Previous
Next ❯

Citations

Jun 2, 2011·Journal of Mathematical Biology·Xingpeng JiangJonathan Dushoff
Jan 12, 2011·BMC Bioinformatics·Hatice Ulku Osmanbeyoglu, Madhavi K Ganapathiraju
Mar 19, 2013·BMC Bioinformatics·Satish M SrinivasanChittibabu Guda
Aug 9, 2012·BMC Evolutionary Biology·Jasmyn A CridlandKatryn J Stacey
Sep 24, 2013·Briefings in Bioinformatics·Susana Vinga
Jan 1, 2012·Computational and Structural Biotechnology Journal·Troy Wymore, Charles L Brooks
Jul 5, 2017·Briefings in Bioinformatics·Guillaume BernardMark A Ragan
Oct 5, 2017·Genome Biology·Andrzej ZielezinskiWojciech M Karlowski

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.