One is not enough: on the effects of reference genome for the mapping and subsequent analyses of short-reads

BioRxiv : the Preprint Server for Biology
C. Valiente-MullorF. Gonzalez-Candelas


Mapping of high-throughput sequencing (HTS) reads to a single arbitrary reference genome is a frequently used approach in microbial genomics. However, the choice of a reference may represent a source of errors that may affect subsequent analyses such as the detection of single nucleotide polymorphisms (SNPs) and phylogenetic inference. In this work, we evaluated the effect of reference choice on short-read sequence data from five clinically and epidemiologically relevant bacteria (Klebsiella pneumoniae, Legionella pneumophila, Neisseria gonorrhoeae, Pseudomonas aeruginosa and Serratia marcescens). Publicly available whole-genome assemblies encompassing the genomic diversity of these species were selected as reference sequences, and read alignment statistics, SNP calling, recombination rates, d N /d S ratios, and phylogenetic trees were evaluated depending on the mapping reference. The choice of different reference genomes proved to have an impact on almost all the parameters considered in the five species. In addition, these biases had potential epidemiological implications such as including/excluding isolates of particular clades and the estimation of genetic distances. These findings suggest that the single reference approach...Continue Reading

Related Concepts

Cholinergic Effect
Regulation of Biological Process
Ion Homeostasis
Structure of Cortex of Kidney

Related Feeds

Basal Forebrain- Circuits

Basal forebrain is a region in the brain important for production of acetylcholine and is the major cholinergic output of the CNS. Discover the latest research on circuits in the basal forebrain here.

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.