May 24, 2016

Centrifuge: rapid and sensitive classification of metagenomic sequences

BioRxiv : the Preprint Server for Biology
Daehwan KimSteven L Salzberg

Abstract

Centrifuge is a novel microbial classification engine that enables rapid, accurate and sensitive labeling of reads and quantification of species on desktop computers. The system uses an indexing scheme based on the Burrows-Wheeler transform (BWT) and the Ferragina-Manzini (FM) index, optimized specifically for the metagenomic classification problem. Centrifuge requires a relatively small index (4.2 GB for 4,078 bacterial and 200 archaeal genomes) and classifies sequences at very high speed, allowing it to process the millions of reads from a typical high-throughput DNA sequencing run within a few minutes. Together these advances enable timely and accurate analysis of large metagenomics data sets on conventional desktop computers. Because of its space-optimized indexing schemes, Centrifuge also makes it possible to index the entire NCBI non-redundant nucleotide sequence database (a total of 109 billion bases) with an index size of 69 GB, in contrast to k-mer based indexing schemes, which require far more extensive space. Centrifuge is available as free, open-source software from http://www.ccb.jhu.edu/software/centrifuge.

  • References
  • Citations

References

  • We're still populating references for this paper, please check back later.
  • References
  • Citations

Citations

  • This paper may not have been cited yet.

Mentioned in this Paper

Computer Software
Genome, Archaeal
Classification
Ncbi Taxonomy
Anatomical Space Structure
Act Relationship Type - Transformation
Microbial
Base Sequence
Species
Analysis

About this Paper

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

CREs: Gene & Cell Therapy

Gene and cell therapy advances have shown promising outcomes for several diseases. The role of cis-regulatory elements (CREs) is crucial in the design of gene therapy vectors. Here is the latest research on CREs in gene and cell therapy.

Related Papers

BioRxiv : the Preprint Server for Biology
A. J. FarrensFabrizio Sergi
BioRxiv : the Preprint Server for Biology
Peter MenzelAnders Krogh
Briefings in Bioinformatics
Mihai PopSteven L Salzberg
© 2020 Meta ULC. All rights reserved