Assembly by Reduced Complexity (ARC): a hybrid approach for targeted assembly of homologous sequences

BioRxiv : the Preprint Server for Biology
Samual S HunterMatthew L Settles


Analysis of High-throughput sequencing (HTS) data is a difficult problem, especially in the context of non-model organisms where comparison of homologous sequences may be hindered by the lack of a close reference genome. Current mapping-based methods rely on the availability of a highly similar reference sequence, whereas de novo assemblies produce anonymous (unannotated) contigs that are not easily compared across samples. Here, we present Assembly by Reduced Complexity (ARC) a hybrid mapping and assembly approach for targeted assembly of homologous sequences. ARC is an open-source project (<>) implemented in the Python language and consists of the following stages: 1) align sequence reads to reference targets, 2) use alignment results to distribute reads into target specific bins, 3) perform assemblies for each bin (target) to produce contigs, and 4) replace previous reference targets with assembled contigs and iterate. We show that ARC is able to assemble high quality, unbiased mitochondrial genomes seeded from 11 progressively divergent references, and is able to assemble full mitochondrial genomes starting from short, poor quality ancient DNA reads. We also show ARC compares favorably to de novo ...Continue Reading

Related Concepts

Genome Mapping
Homologous Sequences, Amino Acid
Molecular Assembly/Self Assembly
Biniou protein, Drosophila
Complex (molecular entity)
Genome, Mitochondrial

Related Feeds

Ancient DNA

Ancient DNA sequences are able to offer valuable insights into molecular evolutionary processes, but are notoriously difficult to analyze due to molecular damage and exogenous dna contamination. Discover the latest research on Ancient DNA here.

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.