Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly.

Bioinformatics
Heng Li

Abstract

Eugene Myers in his string graph paper suggested that in a string graph or equivalently a unitig graph, any path spells a valid assembly. As a string/unitig graph also encodes every valid assembly of reads, such a graph, provided that it can be constructed correctly, is in fact a lossless representation of reads. In principle, every analysis based on whole-genome shotgun sequencing (WGS) data, such as SNP and insertion/deletion (INDEL) calling, can also be achieved with unitigs. To explore the feasibility of using de novo assembly in the context of resequencing, we developed a de novo assembler, fermi, that assembles Illumina short reads into unitigs while preserving most of information of the input reads. SNPs and INDELs can be called by mapping the unitigs against a reference genome. By applying the method on 35-fold human resequencing data, we showed that in comparison to the standard pipeline, our approach yields similar accuracy for SNP calling and better results for INDEL calling. It has higher sensitivity than other de novo assembly based methods for variant calling. Our work suggests that variant calling with de novo assembly can be a beneficial complement to the standard variant calling pipeline for whole-genome resequ...Continue Reading

References

Jun 11, 1979·Nucleic Acids Research·R Staden
Sep 25, 1979·Nucleic Acids Research·T R GingerasR J Roberts
Jan 11, 1984·Nucleic Acids Research·H PeltolaE Ukkonen
Jan 1, 1995·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·E W Myers
Jan 1, 1995·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·R M Idury, M S Waterman
Mar 24, 2000·Science·E W MyersJ C Venter
Aug 16, 2001·Proceedings of the National Academy of Sciences of the United States of America·P A PevznerM S Waterman
Oct 6, 2005·Bioinformatics·Eugene W Myers
Sep 7, 2007·PLoS Biology·Samuel LevyJ Craig Venter
Sep 27, 2008·Genome Research·Stephan OssowskiDetlef Weigel
Dec 6, 2008·Genome Research·Mark J ChaissonPavel A Pevzner
Mar 3, 2009·Genome Research·Jared T SimpsonInanç Birol
May 20, 2009·Bioinformatics·Heng Li, Richard Durbin
Jul 4, 2009·Bioinformatics·Heinrich Magnus Manske, Dominic P Kwiatkowski
Jul 28, 2009·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Martin Vingron
Jan 19, 2010·Bioinformatics·Heng Li, Richard Durbin
Jun 10, 2010·Bioinformatics·Jared T Simpson, Richard Durbin
Oct 29, 2010·Genome Research·Cornelis A AlbersRichard Durbin
Oct 29, 2010·Nature·Gonçalo R AbecasisGil A McVean
Dec 1, 2010·Bioinformatics·Lucian IlieSilvana Ilie
Dec 29, 2010·Proceedings of the National Academy of Sciences of the United States of America·Sante GnerreDavid B Jaffe
Feb 16, 2011·Bioinformatics·Heng Li
Dec 14, 2011·Genome Research·Jared T Simpson, Richard Durbin
Dec 20, 2011·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Paolo CarnevaliRadoje Drmanac
Dec 20, 2011·Nature Biotechnology·Hugo Y K LamMichael Snyder
Jan 11, 2012·Nature Genetics·Zamin IqbalGil McVean

❮ Previous
Next ❯

Citations

Sep 24, 2013·Nature Biotechnology·Maulik N ThakerGerard D Wright
May 11, 2013·Bioinformatics·Lilian JaninAnthony J Cox
Sep 12, 2013·Bioinformatics·Mark HowisonCasey W Dunn
May 24, 2013·Nucleic Acids Research·Feng ZengTing Chen
Feb 13, 2013·BMC Genomics·Roland Wittler
Nov 21, 2013·Algorithms for Molecular Biology : AMB·Sebastian Deorowicz, Szymon Grabowski
Dec 19, 2013·PLoS Computational Biology·Sara El-MetwallyMohamed Helmy
Sep 17, 2014·BMC Bioinformatics·Lucian IlieRoberto Solis-Oba
Aug 19, 2014·Nature Methods·Giuseppe NarzisiMichael C Schatz
Jul 20, 2014·Briefings in Functional Genomics·María Victoria Aguilar-PontesMiaomiao Zhou
Jun 8, 2014·Bioinformatics·Lisle E MoseJoel S Parker
Jul 25, 2015·Molecules : a Journal of Synthetic Chemistry and Natural Product Chemistry·Sebastin RaveendarJong-Wook Chung
Oct 17, 2015·American Journal of Respiratory and Critical Care Medicine·Jessie Nicodemus-JohnsonCarole Ober
Oct 9, 2015·Bioinformatics·Johannes Köster
May 31, 2015·Briefings in Bioinformatics·David LaehnemannAlice Carolyn McHardy
Aug 25, 2015·American Journal of Human Genetics·Akemi J TanakaWendy K Chung
Dec 8, 2015·International Journal of Systematic and Evolutionary Microbiology·Swapnil DoijadTrinad Chakraborty
Feb 3, 2016·Bioinformatics·Magali JaillardJean-Baptiste Veyrieras
Mar 10, 2016·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Paola BonizzoniRaffaella Rizzi
Oct 28, 2015·Nucleic Acids Research·Julia H WildschutteJeffrey M Kidd
Mar 16, 2016·DNA Research : an International Journal for Rapid Publication of Reports on Genes and Genomes·Hidenori TanakaShusei Sato
Jan 2, 2015·Evolutionary Applications·Robert Ekblom, Jochen B W Wolf
Oct 8, 2015·Nature Reviews. Genetics·Mark J P ChaissonEvan E Eichler
Nov 6, 2015·International Journal of Genomics·Inanç BirolRené L Warren
Nov 26, 2015·BMC Research Notes·Nilesh Khiste, Lucian Ilie
Sep 18, 2015·Nature Protocols·Hui Yang, Kai Wang
Aug 8, 2015·Nucleic Acids Research·Ying ChenYuesheng Xu
Apr 6, 2016·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Yuanqi Hu, Pantelis Georgiou
May 9, 2015·Bioinformatics·Heng Li
Aug 28, 2014·Talanta·Bonnie Jaskowski HugeNorman J Dovichi
May 15, 2015·BMC Bioinformatics·Michaël VyvermanPeter Dawyndt
Apr 29, 2015·Nature Genetics·Alexander DiltheyGil McVean
Jan 30, 2015·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Rayan ChikhiPaul Medvedev
May 7, 2016·Bioinformatics·Andreas BremgesAlexander Sczyrba
Sep 12, 2013·Blood·Jill M JohnsenAlex P Reiner
Aug 10, 2013·Journal of Molecular Biology·Y Bromberg

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.