Jan 6, 2021Paper

Emerging SARS-CoV-2 diversity revealed by rapid whole genome sequence typing

BioRxiv : the Preprint Server for Biology
Ahmed M Moustafa, Paul J Planet


Discrete classification of SARS-CoV-2 viral genotypes can identify emerging strains and detect geographic spread, viral diversity, and transmission events. We developed a tool (GNUVID) that integrates whole genome multilocus sequence typing and a supervised machine learning random forest-based classifier. We used GNUVID to assign sequence type (ST) profiles to each of 69,686 SARS-CoV-2 complete, high-quality genomes available from GISAID as of October 20 th 2020. STs were then clustered into clonal complexes (CCs), and then used to train a machine learning classifier. We used this tool to detect potential introduction and exportation events, and to estimate effective viral diversity across locations and over time in 16 US states. GNUVID is a scalable tool for viral genotype classification (available at https://github.com/ahmedmagds/GNUVID ) that can be used to quickly process tens of thousands of genomes. Our genotyping ST/CC analysis uncovered dynamic local changes in ST/CC prevalence and diversity with multiple replacement events in different states. We detected an average of 20.6 putative introductions and 7.5 exportations for each state. Effective viral diversity dropped in all states as shelter-in-place travel-restrictions...Continue Reading

Related Concepts

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Related Papers

BioRxiv : the Preprint Server for Biology
A. M. Moustafa, Paul Planet
BioRxiv : the Preprint Server for Biology
A. M. Moustafa, Paul Planet
MedRxiv : the Preprint Server for Health Sciences
A. Crits-ChristophKara L Nelson
BioRxiv : the Preprint Server for Biology
Nzungize LambertF. Zakham
© 2021 Meta ULC. All rights reserved