Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.

Nature Biotechnology
Kishwar ShafinBenedict Paten

Abstract

De novo assembly of a human genome using nanopore long-read sequences has been reported, but it used more than 150,000 CPU hours and weeks of wall-clock time. To enable rapid human genome assembly, we present Shasta, a de novo long-read assembler, and polishing algorithms named MarginPolish and HELEN. Using a single PromethION nanopore sequencer and our toolkit, we assembled 11 highly contiguous human genomes de novo in 9 d. We achieved roughly 63× coverage, 42-kb read N50 values and 6.5× coverage in reads >100 kb using three flow cells per sample. Shasta produced a complete haploid human genome assembly in under 6 h on a single commercial compute node. MarginPolish and HELEN polished haploid assemblies to more than 99.9% identity (Phred quality score QV = 30) with nanopore reads alone. Addition of proximity-ligation sequencing enabled near chromosome-level scaffolds for all 11 genomes. We compare our assembly performance to existing methods for diploid, haploid and trio-binned human samples and report superior accuracy and speed.

References

Apr 6, 2002·Bioinformatics·Christopher LeeMark F Sharlow
Feb 5, 2004·Genome Biology·Stefan KurtzSteven L Salzberg
May 15, 2004·Nature Reviews. Genetics·Evan E EichlerXinwei She
Sep 7, 2007·PLoS Biology·Samuel LevyJ Craig Venter
Oct 28, 2008·Bioinformatics·Jason R MillerGranger Sutton
Nov 22, 2008·Science·John EidStephen Turner
Mar 2, 2011·Nature Reviews. Genetics·Can AlkanEvan E Eichler
Jun 15, 2011·Genome Research·Benedict PatenDavid Haussler
Jun 2, 2012·Methods : a Companion to Methods in Enzymology·Jon-Matthew BeltonJob Dekker
Sep 8, 2012·Genome Research·Jennifer HarrowTim J Hubbard
Nov 7, 2012·Nature·UNKNOWN 1000 Genomes Project ConsortiumGil A McVean
May 7, 2013·Nature Methods·Chen-Shan ChinJonas Korlach
May 15, 2013·Seminars in Cell & Developmental Biology·Ester Falconer, Peter M Lansdorp
Feb 7, 2015·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Murray PattersonAlexander Schönhuth
Feb 17, 2015·Nature Methods·Miten JainMark Akeson
May 26, 2015·Nature Biotechnology·Konstantin BerlinAdam M Phillippy
Jun 11, 2015·Bioinformatics·Felipe A SimãoEvgeny M Zdobnov
Jun 16, 2015·Nature Methods·Nicholas J LomanJared T Simpson
Oct 4, 2015·Nature·UNKNOWN 1000 Genomes Project ConsortiumGonçalo R Abecasis
Feb 6, 2016·Genome Research·Nicholas H PutnamRichard E Green
Nov 1, 2016·Nature Methods·Chen-Shan ChinMichael C Schatz
Jan 20, 2017·Genome Research·Robert VaserMile Šikić
Apr 7, 2017·Genome Research·Neil I WeisenfeldDavid B Jaffe
Feb 13, 2018·Nature Biotechnology·Miten JainMatthew Loose
Mar 20, 2018·Nature Biotechnology·Miten JainKaren H Miga
May 2, 2018·Nature Methods·Fritz J SedlazeckMichael C Schatz
May 12, 2018·Bioinformatics·Heng Li
Jun 9, 2018·Science·Zev N KronenbergEvan E Eichler
Jun 29, 2018·Bioinformatics·Alla MikheenkoAlexey Gurevich
Jun 29, 2018·Bioinformatics·Shilpa GargTobias Marschall
Sep 25, 2018·Nature Biotechnology·Ryan PoplinMark A DePristo
Oct 23, 2018·Nature Biotechnology·Sergey KorenAdam M Phillippy
Jan 4, 2019·Nature Communications·Maksim KunitskiReinhard Dörner
Jan 22, 2019·Cell·Peter A AudanoEvan E Eichler
Apr 3, 2019·Nature Biotechnology·Mikhail KolmogorovPavel A Pevzner
Apr 3, 2019·Nature Biotechnology·Justin M ZookMarc Salit
Apr 18, 2019·Nature Communications·Mark J P ChaissonCharles Lee

❮ Previous
Next ❯

Citations

Sep 17, 2020·Scientific Reports·Anne-Laure BoutignyMathieu Rolland
Sep 24, 2020·Nature Communications·Chen-Shan ChinJustin M Zook
Jun 7, 2020·Nature Reviews. Genetics·Glennis A LogsdonEvan E Eichler
Nov 13, 2020·PLoS Computational Biology·Hyungtaek JungSeong-Il Eyun
Nov 25, 2020·Proceedings of the National Academy of Sciences of the United States of America·Huei-Mien KeIsheng Jason Tsai
Dec 2, 2020·Nature Biotechnology·Sam KovakaMichael C Schatz
Dec 18, 2020·Microbial Genomics·Stephen J Bush
Jan 10, 2021·Genome Biology·Guillaume HolleyBjarni V Halldorsson
Nov 16, 2020·The International Journal of Biochemistry & Cell Biology·Alexander T Dilthey
Feb 2, 2021·Pharmacogenomics·Sylvan Manuel CasparGabor Matyas
Feb 2, 2021·Frontiers in Genetics·Alexey A DmitrievNataliya V Melnikova
Dec 5, 2020·International Journal of Molecular Sciences·Zhao ChenJianghong Meng
Dec 16, 2020·Nature Biotechnology·Alex Di GenovaMarie-France Sagot
Jan 6, 2021·Nature Communications·Ying ChenChuan-Le Xiao
Mar 23, 2021·Human Immunology·Taishan HuAnh Dinh
Apr 6, 2021·Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences·Jin SunKevin M Kocot
Mar 12, 2021·Human Molecular Genetics·Satomi MitsuhashiHiroaki Mitsuhashi
Apr 4, 2021·Genome Research·T Rhyker Ranallo-BenavidezFritz J Sedlazeck
May 6, 2021·Journal of Personalized Medicine·Bram Peter PrinsHarold Snieder
May 30, 2021·Nature Reviews. Genetics·Wouter De CosterFritz J Sedlazeck
Jun 5, 2021·Nature Genetics·Robert W DaviesSimon Myers
May 18, 2021·The Journal of Biological Chemistry·Apple Cortez VollmersChristopher Vollmers
Jun 7, 2021·BMC Bioinformatics·Nadège GuiglielmoniJean-François Flot
Jul 25, 2021·Nature Communications·Riccardo VicedominiRayan Chikhi
May 1, 2021·Annual Review of Genomics and Human Genetics·Karen H Miga, Ting Wang

❮ Previous
Next ❯

Methods Mentioned

BETA
Illumina sequencing
HiC
electrophoresis
PCR
Whole Genome Sequencing

Software Mentioned

Pilon
Racon
Pomoxis
Guppy Flipflop
Amazon Web Services ( AWS
samtools
get
align
Shasta http
NGx

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.