May 16, 1998

Base-calling of automated sequencer traces using phred. II. Error probabilities

Genome Research
B Ewing, P Green

Abstract

Elimination of the data processing bottleneck in high-throughput sequencing will require both improved accuracy of data processing software and reliable measures of that accuracy. We have developed and implemented in our base-calling program phred the ability to estimate a probability of error for each base-call, as a function of certain parameters computed from the trace data. These error probabilities are shown here to be valid (correspond to actual error rates) and to have high power to discriminate correct base-calls from incorrect ones, for read data collected under several different chemistries and electrophoretic conditions. They play a critical role in our assembly program phrap and our finishing program consed.

Mentioned in this Paper

Shuttle Vectors
Chimera Organism
Computer Programs and Programming
Sequencing
Data Interpretation, Statistical
Sequence Determinations, DNA
Probability
Discriminant Analysis
Xeroderma Pigmentosum, Variant Type
Human Genome Diversity Project

About this Paper

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Synthetic Genetic Array Analysis

Synthetic genetic arrays allow the systematic examination of genetic interactions. Here is the latest research focusing on synthetic genetic arrays and their analyses.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

Head And Neck Squamous Cell Carcinoma

Squamous cell carcinomas account for >90% of all tumors in the head and neck region. Head and neck squamous cell carcinoma incidence has increased dramatically recently with little improvement in patient outcomes. Here is the latest research on this aggressive malignancy.

Signaling in Adult Neurogenesis

Neural stem cells play a critical role in the production of neuronal cells in neurogenesis is of great importance. Of interest is the role signalling mechanisms in adult neurogenesis. Discover the latest research on signalling in adult neurogenesis.

Psychiatric Chronotherapy

Psychiatric Chronotherapy considers the circadian rhythm as a major factor for optimizing therapeutic efficacy of psychiatric interventions. Discover the latest research on Psychiatric Chronotherapy here.

Epigenetic Memory

Epigenetic memory refers to the heritable genetic changes that are not explained by the DNA sequence. Find the latest research on epigenetic memory here.

Bone Marrow Neoplasms

Bone Marrow Neoplasms are cancers that occur in the bone marrow. Discover the latest research on Bone Marrow Neoplasms here.