Apr 6, 2016

Selecting Reads for Haplotype Assembly

BioRxiv : the Preprint Server for Biology
Sarah O. Fischer, Tobias Marschall


Haplotype assembly or read-based phasing is the problem of reconstructing both haplotypes of a diploid genome from next-generation sequencing data. This problem is formalized as the Minimum Error Correction (MEC) problem and can be solved using algorithms such as WhatsHap. The runtime of WhatsHap is exponential in the maximum coverage, which is hence controlled in a pre-processing step that selects reads to be used for phasing. Here, we report on a heuristic algorithm designed to choose beneficial reads for phasing, in particular to increase the connectivity of the phased blocks and the number of correctly phased variants compared to the random selection previously employed in by WhatsHap. The algorithm we describe has been integrated into the WhatsHap software, which is available under MIT licence from https://bitbucket.org/whatshap/whatshap.

  • References
  • Citations


  • We're still populating references for this paper, please check back later.
  • References
  • Citations


  • This paper may not have been cited yet.

Mentioned in this Paper

Computer Software
Mucoepidermoid Carcinoma
Massively-Parallel Sequencing
Replication Error
Molecular Assembly/Self Assembly

About this Paper

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Related Papers

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
Murray PattersonAlexander Schönhuth
Shilpa GargTobias Marschall
BioRxiv : the Preprint Server for Biology
Marcel MartinTobias Marschall
© 2020 Meta ULC. All rights reserved