Standardization and denoising algorithms for mass spectra to classify whole-organism bacterial specimens

Bioinformatics
Glen A SattenJohn R Barr

Abstract

Application of mass spectrometry in proteomics is a breakthrough in high-throughput analyses. Early applications have focused on protein expression profiles to differentiate among various types of tissue samples (e.g. normal versus tumor). Here our goal is to use mass spectra to differentiate bacterial species using whole-organism samples. The raw spectra are similar to spectra of tissue samples, raising some of the same statistical issues (e.g. non-uniform baselines and higher noise associated with higher baseline), but are substantially noisier. As a result, new preprocessing procedures are required before these spectra can be used for statistical classification. In this study, we introduce novel preprocessing steps that can be used with any mass spectra. These comprise a standardization step and a denoising step. The noise level for each spectrum is determined using only data from that spectrum. Only spectral features that exceed a threshold defined by the noise level are subsequently used for classification. Using this approach, we trained the Random Forest program to classify 240 mass spectra into four bacterial types. The method resulted in zero prediction errors in the training samples and in two test datasets having 240...Continue Reading

Citations

Sep 29, 2011·Journal of the American Society for Mass Spectrometry·Kui YangXianlin Han
Dec 7, 2006·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Bart J A MertensA M Deelder
Mar 15, 2012·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Sébastien Li-Thiao-Té, Benno Schwikowski
Jan 21, 2014·Bioinformatics·Pierre MahéJean-Baptiste Veyrieras
Nov 26, 2005·Expert Review of Proteomics·Eric T FungFujun Zhang
Feb 8, 2011·Systematic and Applied Microbiology·Katrien De BruynePeter Vandamme
Aug 18, 2005·Journal of Mass Spectrometry : JMS
Aug 23, 2008·Journal of Mass Spectrometry : JMS·Plamen A Demirev, Catherine Fenselau
Feb 8, 2006·Mass Spectrometry Reviews·Melanie HilarioMarkus Müller
Feb 2, 2008·Birth Defects Research. Part A, Clinical and Molecular Teratology·Susmita DattaThomas B Knudsen
Oct 20, 2006·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Weichuan YuHongyu Zhao
Sep 19, 2014·The Journal of Eukaryotic Microbiology·Hercules MouraJohn R Barr
Dec 27, 2005·Drug Discovery Today·Jennifer Listgarten, Andrew Emili
Jun 10, 2008·FEMS Immunology and Medical Microbiology·Hercules MouraJohn R Barr
Nov 23, 2007·Journal of Proteome Research·Weichaun YuHongyu Zhao
Mar 3, 2005·Molecular & Cellular Proteomics : MCP·Jennifer Listgarten, Andrew Emili
Nov 21, 2018·Omics : a Journal of Integrative Biology·Qingwei MaWei Wang
Jun 10, 2006·Briefings in Bioinformatics·Pedro LarrañagaVictor Robles
Feb 21, 2019·Archives of Pathology & Laboratory Medicine·Min YuJames H Harrison
Dec 12, 2018·BioMed Research International·Zichuan FanYalan Dai

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.