SOAP3: ultra-fast GPU-based parallel alignment tool for short reads

Bioinformatics
Chi-Man LiuTak-Wah Lam

Abstract

SOAP3 is the first short read alignment tool that leverages the multi-processors in a graphic processing unit (GPU) to achieve a drastic improvement in speed. We adapted the compressed full-text index (BWT) used by SOAP2 in view of the advantages and disadvantages of GPU. When tested with millions of Illumina Hiseq 2000 length-100 bp reads, SOAP3 takes < 30 s to align a million read pairs onto the human reference genome and is at least 7.5 and 20 times faster than BWA and Bowtie, respectively. For aligning reads with up to four mismatches, SOAP3 aligns slightly more reads than BWA and Bowtie; this is because SOAP3, unlike BWA and Bowtie, is not heuristic-based and always reports all answers.

References

Jan 30, 2008·Bioinformatics·Ruiqiang LiJun Wang
Nov 7, 2008·Nature·Jun WangJian Wang
Mar 6, 2009·Genome Biology·Ben LangmeadSteven L Salzberg
May 20, 2009·Bioinformatics·Heng Li, Richard Durbin
Jun 6, 2009·Bioinformatics·Ruiqiang LiJun Wang
May 13, 2010·Briefings in Bioinformatics·Heng Li, Nils Homer
Oct 29, 2010·Genome Research·Gerton Lunter, Martin Goodson

Citations

Feb 1, 2013·TheScientificWorldJournal·Marisa P Dolled-FilhartJimmy Cheng-Ho Lin
Jun 14, 2013·BMC Bioinformatics·Ayat HatemÜmit V Çatalyürek
Feb 28, 2013·BMC Bioinformatics·Yupeng ChenDouglas L Maskell
Nov 21, 2013·Algorithms for Molecular Biology : AMB·Sebastian Deorowicz, Szymon Grabowski
Jun 7, 2013·PloS One·Ruibang LuoTak-Wah Lam
Oct 15, 2013·PloS One·Ben JiaChaochun Wei
May 21, 2014·PloS One·Andrea ManconiLuciano Milanesi
Apr 11, 2014·BMC Bioinformatics·Peter KerpedjievAnders Krogh
Oct 8, 2013·Briefings in Functional Genomics·Cornelia DornSilke R Sperling
Apr 30, 2014·BioMed Research International·Jing ShangBairong Shen
Jan 28, 2014·Bioinformatics·Kaiyong Zhao, Xiaowen Chu
Apr 23, 2014·Philosophical Transactions. Series A, Mathematical, Physical, and Engineering Sciences·H FerradaS J Puglisi
Sep 2, 2014·BMC Research Notes·Miriam LeeserJames Brock
Aug 26, 2014·BioMed Research International·Claudia MisaleMarco Aldinucci
Feb 12, 2013·Briefings in Bioinformatics·Quan ZouKe Chen
Sep 26, 2014·BMC Bioinformatics·Jintao MengPavan Balaji
Oct 8, 2014·PeerJ·Johannes Köster, Sven Rahmann
Dec 26, 2013·Journal of the American Medical Informatics Association : JAMIA·Pinghao LiLucila Ohno-Machado
Aug 22, 2014·Bioinformatics·Joaquín TárragaIgnacio Medina
Feb 16, 2016·Genomics, Proteomics & Bioinformatics·Pingjian Yu, Wei Lin
Feb 24, 2016·Frontiers in Genetics·Daniel LangenkämperTim W Nattkemper
Jun 10, 2015·Journal of Applied Genetics·M Mielczarek, J Szyda
Oct 1, 2015·Journal of Bioinformatics and Computational Biology·Jianfeng YangHong Xue
Aug 29, 2012·Methods : a Companion to Methods in Enzymology·Markus HafnerDoron Betel
Nov 24, 2012·Experimental Dermatology·Manfred KunzJanet Kelso
Oct 10, 2015·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Edward B FernandezWalid A Najjar
Aug 14, 2012·Current Opinion in Structural Biology·Nicholas FurnhamJanet M Thornton
Feb 25, 2015·BMC Systems Biology·Takehiro ShimodaYutaka Akiyama
Mar 26, 2015·Frontiers in Bioengineering and Biotechnology·Andrea ManconiLuciano Milanesi
Jul 23, 2013·Journal of Biomedical Informatics·Aisling O'DriscollRoy D Sleator
Jan 30, 2015·BMC Bioinformatics·José SalavertIgnacio Blanquer
Jun 14, 2016·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Yongchao LiuBertil Schmidt
Jul 13, 2016·Briefings in Bioinformatics·Marco S NobileDaniela Besozzi
Nov 4, 2015·IEEE/ACM Transactions on Computational Biology and Bioinformatics·David NogueiraNuno Roma
Sep 27, 2016·Bioinformatics·Bo LiuYadong Wang
Apr 17, 2013·Journal of Clinical Oncology : Official Journal of the American Society of Clinical Oncology·Eliezer M Van AllenMia A Levy
Apr 1, 2016·Scientific Reports·Babu ValliyodanHenry T Nguyen
Jan 4, 2018·BMC Bioinformatics·Quang TranVinhthuy Phan
Jul 1, 2016·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Ahmad Al KawamAniruddha Datta
Nov 9, 2018·Bioinformatics·Sebastian DeorowiczSzymon Grabowski
Feb 26, 2014·BMC Bioinformatics·Andrea ManconiLuciano Milanesi
Sep 13, 2019·Bioinformatics·Guillaume MarçaisCarl Kingsford
Jan 1, 2013·Current Protocols in Bioinformatics·Shengchang GuXun Xu
Dec 11, 2019·Genes·Lizhen Shi, Zhong Wang
May 8, 2019·Scientific Reports·Xiaolei XieXiaoyan Guo
Mar 11, 2016·Interdisciplinary Sciences, Computational Life Sciences·Chiranjib ChakrabortyGovindasamy Agoramoorthy
Apr 11, 2018·Nature Communications·Alexander LachmannAvi Ma'ayan
Aug 9, 2017·BioData Mining·W B Langdon, Brian Yee Hong Lam
Feb 2, 2017·Kyle RupnowGowthami Jayashri Manikandan

Related Concepts

Computer Programs and Programming
Genome, Human
Determination, Sequence Homology
Sequence Determinations, DNA
High-Throughput Nucleotide Sequencing
Genome
BWA 4C
Parallel Study

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

The Tendon Seed Network

Tendons are rich in the extracellular matrix and are abundant throughout the body providing essential roles including structure and mobility. The transcriptome of tendons is being compiled to understand the micro-anatomical functioning of tendons. Discover the latest research pertaining to the Tendon Seed Network here.

Myocardial Stunning

Myocardial stunning is a mechanical dysfunction that persists after reperfusion of previously ischemic tissue in the absence of irreversible damage including myocardial necrosis. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Incretins

Incretins are metabolic hormones that stimulate a decrease in glucose levels in the blood and they have been implicated in glycemic regulation in the remission phase of type 1 diabetes. Here is the latest research.

Chromatin Regulation and Circadian Clocks

The circadian clock plays an important role in regulating transcriptional dynamics through changes in chromatin folding and remodelling. Discover the latest research on Chromatin Regulation and Circadian Clocks here.

Long COVID-19

“Long Covid-19” describes illness in patients who are reporting long-lasting effects of the SARS-CoV-19 infection, often long after they have recovered from acute Covid-19. Ongoing health issues often reported include low exercise tolerance and breathing difficulties, chronic tiredness, and mental health problems such as post-traumatic stress disorder and depression. This feed follows the latest research into Long Covid.

Spatio-Temporal Regulation of DNA Repair

DNA repair is a complex process regulated by several different classes of enzymes, including ligases, endonucleases, and polymerases. This feed focuses on the spatial and temporal regulation that accompanies DNA damage signaling and repair enzymes and processes.