SUPER-FOCUS: a tool for agile functional analysis of shotgun metagenomic data

Genivaldo Gueiros Z SilvaRobert A Edwards


Analyzing the functional profile of a microbial community from unannotated shotgun sequencing reads is one of the important goals in metagenomics. Functional profiling has valuable applications in biological research because it identifies the abundances of the functional genes of the organisms present in the original sample, answering the question what they can do. Currently, available tools do not scale well with increasing data volumes, which is important because both the number and lengths of the reads produced by sequencing platforms keep increasing. Here, we introduce SUPER-FOCUS, SUbsystems Profile by databasE Reduction using FOCUS, an agile homology-based approach using a reduced reference database to report the subsystems present in metagenomic datasets and profile their abundances. SUPER-FOCUS was tested with over 70 real metagenomes, the results showing that it accurately predicts the subsystems present in the profiled microbial communities, and is up to 1000 times faster than other tools. SUPER-FOCUS was implemented in Python, and its source code and the tool website are freely available at Supplementary data are available at Bioinformatics online.


Mar 18, 2016·Nucleic Acids Research·Yanming ZhangFangqing Zhao
Oct 13, 2016·Frontiers in Microbiology·María-Eugenia DeCastroMaría-Isabel González-Siso
Jan 17, 2017·Expert Review of Molecular Diagnostics·Jeremy Davis-TurakGary Hardiman
Aug 5, 2018·Applied and Environmental Microbiology·Aoife J McHughPaul D Cotter
Nov 1, 2018·Nature Methods·Eric A FranzosaCurtis Huttenhower
Mar 14, 2018·Nature Microbiology·Daniel R GarzaBas E Dutilh
Jun 20, 2017·Applied and Environmental Microbiology·Aaron M WalshPaul D Cotter
Apr 9, 2020·MSystems·Aoife J McHughPaul D Cotter
Jul 17, 2020·Investigative Ophthalmology & Visual Science·Fuxin ZhaoWei Chen
Jul 28, 2020·Global Change Biology·Lauren F MesserMark V Brown
May 26, 2017·Frontiers in Microbiology·Danillo O AlvarengaAlessandro M Varani
May 30, 2019·Scientific Reports·Keylie M GibsonBudhan Pukazhenthi
Aug 17, 2019·Frontiers in Microbiology·Liliane Costa ContevilleAna Carolina Paulo Vicente
Feb 5, 2020·Frontiers in Genetics·Rilquer MascarenhasPedro Milet Meirelles
May 7, 2016·Bioinformatics·Jens Roat KultimaPeer Bork
Apr 14, 2019·Nature Communications·Linda Wegley KellyForest Rohwer
Apr 3, 2019·Bioinformatics·Jin-Dong KimK Bretonnel Cohen
Feb 1, 2019·NPJ Biofilms and Microbiomes·Marie LindefeldtStefanie Prast-Nielsen
Nov 11, 2020·Journal of Translational Medicine·Mohammad Tahseen Al BatainehQutayba Hamid


Sep 1, 1997·Nucleic Acids Research·S F AltschulD J Lipman
Jun 17, 1998·Proceedings of the National Academy of Sciences of the United States of America·W B WhitmanW J Wiebe
Dec 11, 1999·Nucleic Acids Research·M Kanehisa, S Goto
Apr 5, 2002·Genome Research·W James Kent
Dec 14, 2004·Microbiology and Molecular Biology Reviews : MMBR·Jo Handelsman
Oct 11, 2005·Nucleic Acids Research·Ross OverbeekVeronika Vonstein
Feb 12, 2008·BMC Genomics·Ramy K AzizOlga Zagnitko
Feb 28, 2008·PloS One·Elizabeth A DinsdaleForest Rohwer
Jun 16, 2010·BMC Bioinformatics·Terry DiszRobert A Edwards
Sep 2, 2010·Nucleic Acids Research·Mina RhoYuzhen Ye
Mar 5, 2011·BMC Bioinformatics·Suparna MitraDaniel H Huson
Apr 12, 2011·Journal of Genetics and Genomics = Yi Chuan Xue Bao·Jun ZhangGenfa Zhang
Jun 13, 2012·Nature Methods·Nicola SegataCurtis Huttenhower
Jun 16, 2012·Nature·Human Microbiome Project Consortium
Jul 10, 2012·Briefings in Bioinformatics·Weizhong LiJohn Wooley
Oct 11, 2012·Bioinformatics·Robert A EdwardsRoss Overbeek
Jan 1, 2013·Applied and Environmental Microbiology·Amaro E Trindade-SilvaFabiano L Thompson
Jan 15, 2013·Microbial Ecology·Gizele D GarciaFabiano L Thompson
Mar 4, 2014·Genome Biology·Derrick E Wood, Steven L Salzberg
Nov 18, 2014·Nature Methods·Benjamin BuchfinkDaniel H Huson
Nov 20, 2014·Journal of Visualized Experiments : JoVE·Andreas F HaasForest Rohwer
Jan 23, 2015·Genome Biology·Sophie WeissRob Knight
Feb 13, 2015·Briefings in Bioinformatics·Marie Lisandra Zepeda MendozaM Thomas P Gilbert
Jan 19, 2016·Scientific Reports·Stinus LindgreenPaul P Gardner

Related Concepts

Homology Modeling
Profile (Lab Procedure)
Computer Programs and Programming
Coral Reefs

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Sexual Dimorphism in Neurodegeneration

There exist sex differences in neurodevelopmental and neurodegenerative disorders. For instance, multiple sclerosis is more common in women, whereas Parkinson’s disease is more common in men. Here is the latest research on sexual dimorphism in neurodegeneration

HLA Genetic Variation

HLA genetic variation has been found to confer risk for a wide variety of diseases. Identifying these associations and understanding their molecular mechanisms is ongoing and holds promise for the development of therapeutics. Find the latest research on HLA genetic variation here.

Super-resolution Microscopy

Super-resolution microscopy is the term commonly given to fluorescence microscopy techniques with resolutions that are not limited by the diffraction of light. Here are the latest discoveries pertaining to super-resolution microscopy.

Genetic Screens in iPSC-derived Brain Cells

Genetic screening is a critical tool that can be employed to define and understand gene function and interaction. This feed focuses on genetic screens conducted using induced pluripotent stem cell (iPSC)-derived brain cells.

Brain Lower Grade Glioma

Low grade gliomas in the brain form from oligodendrocytes and astrocytes and are the slowest-growing glioma in adults. Discover the latest research on these brain tumors here.

CD4/CD8 Signaling

Cluster of differentiation 4 and 8 (CD8 and CD8) are glycoproteins founds on the surface of immune cells. Here is the latest research on their role in cell signaling pathways.

Alignment-free Sequence Analysis Tools

Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.