A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows

Gabriel RiquelmeMaría Eugenia Monge


Preprocessing data in a reproducible and robust way is one of the current challenges in untargeted metabolomics workflows. Data curation in liquid chromatography-mass spectrometry (LC-MS) involves the removal of biologically non-relevant features (retention time, m/z pairs) to retain only high-quality data for subsequent analysis and interpretation. The present work introduces TidyMS, a package for the Python programming language for preprocessing LC-MS data for quality control (QC) procedures in untargeted metabolomics workflows. It is a versatile strategy that can be customized or fit for purpose according to the specific metabolomics application. It allows performing quality control procedures to ensure accuracy and reliability in LC-MS measurements, and it allows preprocessing metabolomics data to obtain cleaned matrices for subsequent statistical analysis. The capabilities of the package are shown with pipelines for an LC-MS system suitability check, system conditioning, signal drift evaluation, and data curation. These applications were implemented to preprocess data corresponding to a new suite of candidate plasma reference materials developed by the National Institute of Standards and Technology (NIST; hypertriglyceride...Continue Reading


Dec 2, 2008·BMC Bioinformatics·Ralf TautenhahnSteffen Neumann
Oct 12, 2012·Nature Biotechnology·Matthew C ChambersParag Mallick
Sep 1, 2007·Metabolomics : Official Journal of the Metabolomic Society·Lloyd W SumnerMark R Viant
Dec 21, 2014·Bioinformatics·Franck GiacomoniChristophe Caron
May 6, 2015·Nature Methods·Hiroshi TsugawaMasanori Arita
Mar 16, 2016·Scientific Data·Mark D WilkinsonBarend Mons
Aug 31, 2016·Nature Methods·Hannes L RöstOliver Kohlbacher
Aug 22, 2017·Journal of Pharmaceutical and Biomedical Analysis·Danuta DudzikCoral Barbas
Mar 5, 2019·Metabolomics : Official Journal of the Metabolomic Society·Richard D BegerKrista A Zanetti
Mar 19, 2019·Annual Review of Analytical Chemistry·María Eugenia MongeFacundo M Fernández
Jul 12, 2019·Nature Communications·Mark R ViantRalf J M Weber
Oct 19, 2019·Analytical Chemistry·Carolina González-RianoCoral Barbas
Nov 7, 2019·Nucleic Acids Research·Kenneth HaugClaire O'Donovan
Feb 5, 2020·Nature Methods·Pauli VirtanenSciPy 1.0 Contributors
Apr 5, 2020·Metabolites·Anton KlåvusKati Hanhineva
Apr 11, 2020·Journal of the American Chemical Society·Miriam Sindelar, Gary J Patti
Apr 12, 2020·European Journal of Mass Spectrometry·Biswapriya B Misra
Apr 15, 2020·Analytical and Bioanalytical Chemistry·Paula Cuevas-DelgadoCoral Barbas
Oct 13, 2020·Metabolomics : Official Journal of the Metabolomic Society·Anne M EvansMetabolomics Quality Assurance, Quality Control Consortium (mQACC)

Related Concepts

United States National Institutes of Health
Technology Assessment
African American
Liquid Chromatography Mass Spectrometry
Electronic Health Records

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Systemic Juvenile Idiopathic Arthritis

Systemic juvenile idiopathic arthritis is a rare rheumatic disease that affects children. Symptoms include joint pain, but also fevers and skin rashes. Here is the latest on this disease.

Chromatin Regulation and Circadian Clocks

The circadian clock plays an important role in regulating transcriptional dynamics through changes in chromatin folding and remodelling. Discover the latest research on Chromatin Regulation and Circadian Clocks here.

Central Pontine Myelinolysis

Central Pontine Myelinolysis is a neurologic disorder caused most frequently by rapid correction of hyponatremia and is characterized by demyelination that affects the central portion of the base of the pons. Here is the latest research on this disease.

Myocardial Stunning

Myocardial stunning is a mechanical dysfunction that persists after reperfusion of previously ischemic tissue in the absence of irreversible damage including myocardial necrosis. Here is the latest research.

Pontocerebellar Hypoplasia

Pontocerebellar hypoplasias are a group of neurodegenerative autosomal recessive disorders with prenatal onset, atrophy or hypoplasia of the cerebellum, hypoplasia of the ventral pons, microcephaly, variable neocortical atrophy and severe mental and motor impairments. Here is the latest research on pontocerebellar hypoplasia.

Cell Atlas Along the Gut-Brain Axis

Profiling cells along the gut-brain axis at the single cell level will provide unique information for each cell type, a three-dimensional map of how cell types work together to form tissues, and insights into how changes in the map underlie health and disease of the GI system and its crosstalk with the brain. Disocver the latest research on single cell analysis of the gut-brain axis here.

Chronic Traumatic Encephalopathy

Chronic Traumatic Encephalopathy (CTE) is a progressive degenerative disease that occurs in individuals that suffer repetitive brain trauma. Discover the latest research on traumatic encephalopathy here.

Related Papers

Methods in Molecular Biology
Samantha Riccadonna, Pietro Franceschi
Journal of the American Chemical Society
Miriam Sindelar, Gary J Patti
Journal of Inherited Metabolic Disease
Ilya Gertsman, Bruce A Barshop
© 2021 Meta ULC. All rights reserved