A Python package for parsing, validating, mapping and formatting sequence variants using HGVS nomenclature

Reece K HartVincent A Fusaro


Biological sequence variants are commonly represented in scientific literature, clinical reports and databases of variation using the mutation nomenclature guidelines endorsed by the Human Genome Variation Society (HGVS). Despite the widespread use of the standard, no freely available and comprehensive programming libraries are available. Here we report an open-source and easy-to-use Python library that facilitates the parsing, manipulation, formatting and validation of variants according to the HGVS specification. The current implementation focuses on the subset of the HGVS recommendations that precisely describe sequence-level variation relevant to the application of high-throughput sequencing to clinical diagnostics. The package is released under the Apache 2.0 open-source license. Source code, documentation and issue tracking are available at http://bitbucket.org/hgvs/hgvs/. Python packages are available at PyPI (https://pypi.python.org/pypi/hgvs). Supplementary data are available at Bioinformatics online.


Apr 5, 2002·Genome Research·W James Kent
Jun 22, 2010·Bioinformatics·William McLarenFiona Cunningham
Feb 11, 2011·Human Mutation·Peter E M Taschner, Johan T den Dunnen
Oct 26, 2011·BMC Bioinformatics·Jeroen F J LarosPeter E M Taschner
Dec 22, 2011·BMC Bioinformatics·Baoyong WangDavid Sankoff
Dec 23, 2011·BMC Bioinformatics·Ying WangHuaijun Zhou
Jun 22, 2013·Genetics in Medicine : Official Journal of the American College of Medical Genetics·Robert C GreenAmerican College of Medical Genetics and Genomics


Nov 11, 2015·Nature Reviews. Genetics·Anthony J Brookes, Peter N Robinson
Jun 21, 2015·Trends in Genetics : TIG·Mauno Vihinen
Feb 27, 2016·Human Mutation·Raymond Dalgleish
Aug 17, 2016·Nature Reviews. Genetics·Euan A Ashley
Dec 27, 2018·PLoS Genetics·Melissa S ClineAmanda B Spurdle
May 1, 2019·Der Pathologe·S Merkelbach-BruseF Klauschen
Jul 2, 2016·Current Protocols in Human Genetics·Johan T den Dunnen
Jan 24, 2018·Journal of Medical Genetics·Sally Ann LynchJillian Casey
Nov 11, 2019·Autism Research : Official Journal of the International Society for Autism Research·Manuel BelmadaniSanja Rogic
Mar 5, 2016·Human Mutation·Johan T den DunnenPeter E M Taschner
Jan 17, 2019·Thyroid : Official Journal of the American Thyroid Association·Marina N NikiforovaYuri E Nikiforov

Related Concepts

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Spatio-Temporal Regulation of DNA Repair

DNA repair is a complex process regulated by several different classes of enzymes, including ligases, endonucleases, and polymerases. This feed focuses on the spatial and temporal regulation that accompanies DNA damage signaling and repair enzymes and processes.

Glut1 Deficiency

Glut1 deficiency, an autosomal dominant, genetic metabolic disorder associated with a deficiency of GLUT1, the protein that transports glucose across the blood brain barrier, is characterized by mental and motor developmental delays and infantile seizures. Follow the latest research on Glut1 deficiency with this feed.

Hereditary Sensory Autonomic Neuropathy

Hereditary Sensory Autonomic Neuropathies are a group of inherited neurodegenerative disorders characterized clinically by loss of sensation and autonomic dysfunction. Here is the latest research on these neuropathies.

Separation Anxiety

Separation anxiety is a type of anxiety disorder that involves excessive distress and anxiety with separation. This may include separation from places or people to which they have a strong emotional connection with. It often affects children more than adults. Here is the latest research on separation anxiety.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.