Jan 31, 2015

Assembly by Reduced Complexity (ARC): a hybrid approach for targeted assembly of homologous sequences

BioRxiv : the Preprint Server for Biology
Samual S HunterMatthew L Settles

Abstract

Analysis of High-throughput sequencing (HTS) data is a difficult problem, especially in the context of non-model organisms where comparison of homologous sequences may be hindered by the lack of a close reference genome. Current mapping-based methods rely on the availability of a highly similar reference sequence, whereas de novo assemblies produce anonymous (unannotated) contigs that are not easily compared across samples. Here, we present Assembly by Reduced Complexity (ARC) a hybrid mapping and assembly approach for targeted assembly of homologous sequences. ARC is an open-source project (<http://ibest.github.io/ARC/>) implemented in the Python language and consists of the following stages: 1) align sequence reads to reference targets, 2) use alignment results to distribute reads into target specific bins, 3) perform assemblies for each bin (target) to produce contigs, and 4) replace previous reference targets with assembled contigs and iterate. We show that ARC is able to assemble high quality, unbiased mitochondrial genomes seeded from 11 progressively divergent references, and is able to assemble full mitochondrial genomes starting from short, poor quality ancient DNA reads. We also show ARC compares favorably to de novo ...Continue Reading

  • References
  • Citations

References

  • We're still populating references for this paper, please check back later.
  • References
  • Citations

Citations

  • This paper may not have been cited yet.

Mentioned in this Paper

Genome
Complex (molecular entity)
Homologous Sequences, Amino Acid
Ancient DNA
Whole Exome Sequencing
Aids Related Complex
Genome Mapping
Sequencing
Biniou protein, Drosophila
Analysis

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Ancient DNA

Ancient DNA sequences are able to offer valuable insights into molecular evolutionary processes, but are notoriously difficult to analyze due to molecular damage and exogenous dna contamination. Discover the latest research on Ancient DNA here.