Broom: application for non-redundant storage of high throughput sequencing data

Bioinformatics
Levent AlbayrakYuriy Fofanov

Abstract

The data generation capabilities of high throughput sequencing (HTS) instruments have exponentially increased over the last few years, while the cost of sequencing has dramatically decreased allowing this technology to become widely used in biomedical studies. For small labs and individual researchers, however, storage and transfer of large amounts of HTS data present a significant challenge. The recent trends in increased sequencing quality and genome coverage can be used to reconsider HTS data storage strategies. We present Broom, a stand-alone application designed to select and store only high-quality sequencing reads at extremely high compression rates. Written in C++, the application accepts single and paired-end reads in FASTQ and FASTA formats and decompresses data in FASTA format. C++ code available at https://scsb.utmb.edu/labgroups/fofanov/broom.asp. Supplementary data are available at Bioinformatics online.

References

Nov 11, 2010·Nucleic Acids Research·Rasko LeinonenUNKNOWN International Nucleotide Sequence Database Collaboration
Aug 21, 2012·Nucleic Acids Research·Daniel C JonesMichael G Katze

❮ Previous
Next ❯

Citations


❮ Previous
Next ❯

Related Concepts

Related Feeds

Bioinformatics in Biomedicine

Bioinformatics in biomedicine incorporates computer science, biology, chemistry, medicine, mathematics and statistics. Discover the latest research on bioinformatics in biomedicine here.

Related Papers

Health Management Technology
J Evans
Bioinformatics
Sebastian Deorowicz, Szymon Grabowski
© 2022 Meta ULC. All rights reserved