An efficient algorithm for large-scale detection of protein families

Nucleic Acids Research
A J EnrightC A Ouzounis

Abstract

Detection of protein families in large databases is one of the principal research objectives in structural and functional genomics. Protein family classification can significantly contribute to the delineation of functional diversity of homologous proteins, the prediction of function based on domain architecture or the presence of sequence motifs as well as comparative genomics, providing valuable evolutionary insights. We present a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families. The method relies on the Markov cluster (MCL) algorithm for the assignment of proteins into families based on precomputed sequence similarity information. This novel approach does not suffer from the problems that normally hinder other protein sequence clustering algorithms, such as the presence of multi-domain proteins, promiscuous domains and fragmented proteins. The method has been rigorously tested and validated on a number of very large databases, including SwissProt, InterPro, SCOP and the draft human genome. Our results indicate that the method is ideally suited to the rapid and accurate detection of protein families on a large scale. The method has been used to detect and categorise protein ...Continue Reading

References

Jan 1, 1973·Annual Review of Genetics·W M Fitch
Mar 25, 1981·Journal of Molecular Biology·T F Smith, M S Waterman
Jan 1, 1995·Annual Review of Biochemistry·R F Doolittle
Jun 1, 1994·Research in Microbiology·C Chang, E M Meyerowitz
Oct 1, 1993·Scientific American·R F Doolittle, P Bork
Jul 22, 1996·FEBS Letters·C Ouzounis, N Kyrpides
Jun 1, 1996·Current Opinion in Structural Biology·S R Eddy
Aug 1, 1996·Trends in Biotechnology·C OuzounisA Valencia
Sep 1, 1997·Nucleic Acids Research·S F AltschulD J Lipman
Nov 14, 1997·Nature Biotechnology·T F Smith, X Zhang
Feb 21, 1998·Nucleic Acids Research·F CorpetD Kahn
Mar 31, 1998·Current Opinion in Structural Biology·S Tan, T J Richmond
Oct 29, 1998·Journal of Molecular Biology·P BorkY Yuan
Dec 10, 1998·Nucleic Acids Research·K HofmannA Bairoch
Dec 10, 1998·Nucleic Acids Research·T K AttwoodW Wright
Jan 27, 1999·Bioinformatics·S R Eddy
Jan 27, 1999·Bioinformatics·X Guan, L Du
Dec 11, 1999·Nucleic Acids Research·A Bairoch, R Apweiler
Dec 11, 1999·Nucleic Acids Research·L Lo ConteC Chothia
Dec 11, 1999·Nucleic Acids Research·A BatemanE L Sonnhammer
Mar 24, 2000·Science·G M RubinS Lewis
Apr 26, 2000·Genome Research·C A Ouzounis, P D Karp
Jun 24, 2000·Nature·D EisenbergT O Yeates
Jun 28, 2000·Bioinformatics·A J Enright, C A Ouzounis
Aug 31, 2000·Annual Review of Biochemistry·A M StockP N Goudreau
Sep 1, 2000·FEBS Letters·S Tsoka, C A Ouzounis
Nov 7, 2000·Progress in Biophysics and Molecular Biology·A Heger, L Holm
Jan 11, 2000·Nucleic Acids Research·A BernalN Kyrpides
Feb 24, 2001·Genome Biology·I IliopoulosC A Ouzounis
Feb 27, 2001·Bioinformatics·R M CoulsonC A Ouzounis
Mar 10, 2001·Nature·E BirneyT J Hubbard
Mar 10, 2001·Nature·E S LanderUNKNOWN International Human Genome Sequencing Consortium
Jun 29, 2001·Journal of Molecular Biology·G ApicS A Teichmann
Jul 27, 2001·Bioinformatics·G ApicS A Teichmann
Oct 9, 2001·Bioinformatics·A J Enright, C A Ouzounis
Nov 3, 2001·Nucleic Acids Research·P J JanssenC A Ouzounis

❮ Previous
Next ❯

Citations

Nov 8, 2005·TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik·Chenwei LinSteven D Tanksley
May 11, 2005·Journal of Molecular Evolution·Pieter MonsieursKathleen Marchal
Sep 22, 2006·Journal of Mathematical Biology·Ryszard RudnickiDamian Wójtowicz
Apr 23, 2008·Amino Acids·Xing-Ming ZhaoKazuyuki Aihara
Dec 31, 2010·Chromosome Research : an International Journal on the Molecular, Supramolecular and Evolutionary Aspects of Chromosome Biology·Davide Baù, Marc A Marti-Renom
Jan 17, 2013·Bulletin of Mathematical Biology·Ashish Saini, Jingyu Hou
Sep 18, 2007·Cell Biochemistry and Biophysics·J C Nacher, T Akutsu
Dec 21, 2007·Molecular Biotechnology·Lucy SkrabanekAnton J Enright
Aug 13, 2013·Computers in Biology and Medicine·Chien-Hung HuangKa-Lok Ng
Apr 14, 2005·International Journal for Parasitology·Martin AslettAdrian Tivey
May 22, 2013·International Journal for Parasitology·Sven B GouldWilliam F Martin
Apr 23, 2003·Journal of Molecular Biology·Andreas Heger, Liisa Holm
Oct 29, 2000·Cell·M M WöstenE A Groisman
Jul 2, 2003·Current Opinion in Structural Biology·David LeeChristine Orengo
Jan 28, 2003·Current Opinion in Chemical Biology·Jinfeng Liu, Burkhard Rost
Nov 23, 2011·Journal of Proteome Research·Enrico CappelliniJesper V Olsen
Sep 18, 2012·Journal of Proteome Research·Chi Nam Ignatius PangMarc R Wilkins
May 1, 2013·Journal of Proteome Research·Harald MarxBernhard Kuster
Jul 2, 2004·Nature·Bernard DujonJean-Luc Souciet
May 6, 2005·Nature·L EichingerA Kuspa
Mar 24, 2006·Nature·Nevan J KroganJack F Greenblatt
Jul 17, 2009·Nature·UNKNOWN Schistosoma japonicum Genome Sequencing and Functional Analysis Consortium
Dec 25, 2009·Nature·Dongying WuJonathan A Eisen
May 26, 2009·Nature Biotechnology·Kristof De SchutterNico Callewaert
Nov 8, 2011·Nature Biotechnology·Rajeev K VarshneyScott A Jackson
May 15, 2012·Nature Biotechnology·Jeffrey L BennetzenKatrien M Devos
Jan 31, 2013·Nature Communications·Aiping ZhengPing Li
Aug 21, 2013·Nature Communications·Philip RuelensKerstin Kaufmann

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.