Assessing the Impact of Assemblers on Virus Detection in a De Novo Metagenomic Analysis Pipeline

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
Daniel J WhiteRichard J Hall

Abstract

Applying high-throughput sequencing to pathogen discovery is a relatively new field, the objective of which is to find disease-causing agents when little or no background information on disease is available. Key steps in the process are the generation of millions of sequence reads from an infected tissue sample, followed by assembly of these reads into longer, contiguous stretches of nucleotide sequences, and then identification of the contigs by matching them to known databases, such as those stored at GenBank or Ensembl. This technique, that is, de novo metagenomics, is particularly useful when the pathogen is viral and strong discriminatory power can be achieved. However, recently, we found that striking differences in results can be achieved when different assemblers were used. In this study, we test formally the impact of five popular assemblers (MIRA, VELVET, METAVELVET, SPADES, and OMEGA) on the detection of a novel virus and assembly of its whole genome in a data set for which we have confirmed the presence of the virus by empirical laboratory techniques, and compare the overall performance between assemblers. Our results show that if results from only one assembler are considered, biologically important reads can easil...Continue Reading

References

Mar 24, 2000·Science·Eugene W MyersJ C Venter
Jul 27, 2002·Journal of Bacteriology·Forest Rohwer, Rob Edwards
Oct 18, 2002·Proceedings of the National Academy of Sciences of the United States of America·Mya BreitbartForest Rohwer
Aug 2, 2005·Nature·Marcel MarguliesJonathan M Rothberg
Jan 27, 2007·Genome Research·Daniel H HusonStephan C Schuster
Sep 8, 2007·Science·Diana L Cox-FosterW Ian Lipkin
Mar 20, 2008·Genome Research·Daniel R Zerbino, Ewan Birney
Dec 17, 2009·BMC Bioinformatics·Christiam CamachoThomas L Madden
Feb 1, 2011·Bioinformatics·Robert Schmieder, Robert Edwards
Mar 10, 2011·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Jonathan LasersonDaphne Koller
Mar 24, 2012·Current Opinion in Virology·John L MokiliBas E Dutilh
Apr 18, 2012·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Anton BankevichPavel A Pevzner
Jul 24, 2012·Nucleic Acids Research·Toshiaki NamikiYasubumi Sakakibara
Aug 24, 2013·Emerging Infectious Diseases·Saskia L SmitsAlbert D M E Osterhaus
Mar 22, 2014·BMC Bioinformatics·Simon RouxFrançois Enault
Mar 25, 2014·Emerging Infectious Diseases·Richard J HallDaniel M Tompkins
Apr 1, 2014·BMC Bioinformatics·Koldo Garcia-EtxebarriaFrancesc Calafell
Jun 21, 2014·Bioinformatics·Bahlul HaiderChongle Pan
Jan 8, 2015·Frontiers in Microbiology·Saskia L SmitsAnita C Schürch
Feb 17, 2015·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Xuan GuoYi Pan
Apr 2, 2015·International Journal for Parasitology. Parasites and Wildlife·Brett D GartrellJ M Ward

Related Concepts

Related Feeds

Antivirals

Antivirals are medications that are used specifically for treating viral infections. Discover the latest research on antivirals here.

Antivirals (ASM)

Antivirals are medications that are used specifically for treating viral infections. Discover the latest research on antivirals here.