Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Genome Research
Heng LiRichard Durbin

Abstract

New sequencing technologies promise a new era in the use of DNA sequence. However, some of these technologies produce very short reads, typically of a few tens of base pairs, and to use these reads effectively requires new algorithms and software. In particular, there is a major issue in efficiently aligning short reads to a reference genome and handling ambiguity or lack of accuracy in this alignment. Here we introduce the concept of mapping quality, a measure of the confidence that a read actually comes from the position it is aligned to by the mapping algorithm. We describe the software MAQ that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample. MAQ makes full use of mate-pair information and estimates the error probability of each read alignment. Error probabilities are also derived for the final genotype calls, using a Bayesian statistical model that incorporates the mapping qualities, error probabilities from the raw sequence quality scores, sampling of the two haplotypes, and an empirical model for correlated errors at a site. Both read mapping and genotype calling are evaluated ...Continue Reading

Associated Clinical Trials

Mar 20, 2017·Stefano Gambardella

References

Mar 25, 1981·Journal of Molecular Biology·T F Smith, M S Waterman
Sep 1, 1997·Nucleic Acids Research·S F AltschulD J Lipman
May 16, 1998·Genome Research·R Durbin, S Dear
Oct 6, 1999·Genome Research·X Huang, A Madan
Dec 2, 1999·Nature Genetics·G T MarthW R Gish
Oct 10, 2001·Genome Research·Z NingJ C Mullikin
Apr 5, 2002·Genome Research·W James Kent
Apr 6, 2002·Bioinformatics·Bin MaMing Li
Jan 17, 2003·Genome Research·Scott SchwartzWebb Miller
Feb 25, 2005·Bioinformatics·Thomas D Wu, Colin K Watanabe
Mar 3, 2005·Genome Research·Stefan WeckxPeter De Rijk
Aug 2, 2005·Nature·Marcel MarguliesJonathan M Rothberg
Nov 2, 2005·PLoS Computational Biology·Jinghui ZhangKenneth H Buetow
Feb 24, 2006·Nature Genetics·Matthew StephensDeborah A Nickerson
Oct 24, 2006·Current Opinion in Genetics & Development·David R Bentley
Jun 2, 2007·Science·David S JohnsonBarbara Wold
Jan 22, 2008·Nature Methods·LaDeana W HillierElaine R Mardis
Mar 20, 2008·Genome Research·Daniel R Zerbino, Ewan Birney
Jul 10, 2008·Nature Biotechnology·Thomas A DownStephan Beck

❮ Previous
Next ❯

Citations

Oct 28, 2009·Cellular and Molecular Life Sciences : CMLS·Samuel Marguerat, Jürg Bähler
May 20, 2009·Mammalian Genome : Official Journal of the International Mammalian Genome Society·Daniel J TurnerDavid J Adams
Sep 20, 2012·Mammalian Genome : Official Journal of the International Mammalian Genome Society·Michelle M SimonLaura G Reinholdt
Aug 14, 2012·Human Genetics·André AltmannBertram Müller-Myhsok
May 18, 2010·Archives of Virology·Beatrix CoetzeeJohan T Burger
Jul 14, 2012·Functional & Integrative Genomics·George E Liu, Derek M Bickhart
Aug 20, 2011·Journal of the Association for Research in Otolaryngology : JARO·Saku T SinkkonenStefan Heller
May 17, 2012·Journal of Mammary Gland Biology and Neoplasia·Siv GilfillanAntoni Hurtado
Feb 22, 2012·Journal of Molecular Neuroscience : MN·R R LemosJ R M Oliveira
Mar 26, 2013·Journal of Biotechnology·Minenosuke MatsutaniKazunobu Matsushita
Aug 17, 2011·Environmental Science & Technology·Rute F DomingosKevin J Wilkinson
Aug 10, 2013·Journal of Medicinal Chemistry·Erika L FlanneryElizabeth A Winzeler
Nov 23, 2011·Journal of Proteome Research·Xiaojing WangBing Zhang
Jun 30, 2011·European Journal of Human Genetics : EJHG·Kaja K SelmerDag E Undlien
Mar 3, 2011·European Journal of Human Genetics : EJHG·Alison J CoffeyAarno Palotie
Sep 15, 2012·Genetics in Medicine : Official Journal of the American College of Medical Genetics·Sowmiya MoorthieCaroline F Wright
Apr 27, 2013·Genetics in Medicine : Official Journal of the American College of Medical Genetics·Colin C PritchardRobin L Bennett
Jun 17, 2011·The ISME Journal·Patrick H Degnan, Howard Ochman
Apr 29, 2011·Journal of Human Genetics·Suying BaoYou-Qiang Song
Jun 17, 2011·Journal of Human Genetics·Suying BaoYou-Qiang Song
Nov 19, 2011·Leukemia·H LilljebjörnT Fioretos
Nov 7, 2008·Nature·Timothy J LeyRichard K Wilson
Nov 7, 2008·Nature·David R BentleyAnthony J Smith
Dec 25, 2009·Nature·Philip J StephensMichael R Stratton
Dec 18, 2009·Nature·Erin D PleasanceMichael R Stratton
Mar 12, 2010·Nature·Joseph K PickrellJonathan K Pritchard
Mar 12, 2010·Nature·Stephen B MontgomeryEmmanouil T Dermitzakis
Oct 29, 2010·Nature·Peter J CampbellP Andrew Futreal
Aug 9, 2013·Nature·Michael J ZillerAlexander Meissner
Jan 6, 2009·Nature Biotechnology·Joel RozowskyMark B Gerstein
Aug 12, 2009·Nature Biotechnology·Dmitry PushkarevStephen R Quake

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.