Using 16S rRNA gene as marker to detect unknown bacteria in microbial communities

BMC Bioinformatics
Quang TranVinhthuy Phan

Abstract

Quantification and identification of microbial genomes based on next-generation sequencing data is a challenging problem in metagenomics. Although current methods have mostly focused on analyzing bacteria whose genomes have been sequenced, such analyses are, however, complicated by the presence of unknown bacteria or bacteria whose genomes have not been sequence. We propose a method for detecting unknown bacteria in environmental samples. Our approach is unique in its utilization of short reads only from 16S rRNA genes, not from entire genomes. We show that short reads from 16S rRNA genes retain sufficient information for detecting unknown bacteria in oral microbial communities. In our experimentation with bacterial genomes from the Human Oral Microbiome Database, we found that this method made accurate and robust predictions at different read coverages and percentages of unknown bacteria. Advantages of this approach include not only a reduction in experimental and computational costs but also a potentially high accuracy across environmental samples due to the strong conservation of the 16S rRNA gene.

References

Jun 10, 2009·Bioinformatics·Heng Li1000 Genome Project Data Processing Subgroup
Aug 4, 2009·Nature Methods·Arthur Brady, Steven L Salzberg
Jan 19, 2010·Bioinformatics·Heng Li, Richard Durbin
May 7, 2011·Bioinformatics·Peter MeinickeThomas Lingner
Jan 31, 2012·Bioinformatics·Chi-Man LiuTak-Wah Lam
Mar 6, 2012·Nature Methods·Ben Langmead, Steven L Salzberg
Mar 22, 2012·Nucleic Acids Research·Florent E AnglyGene W Tyson
Jun 13, 2012·Nature Methods·Nicola SegataCurtis Huttenhower
Sep 4, 2012·Nucleic Acids Research·Martin S Lindner, Bernhard Y Renard
Aug 27, 2013·Nature Biotechnology·Morgan G I LangilleCurtis Huttenhower
Feb 3, 2015·PloS One·Martin S Lindner, Bernhard Y Renard

Related Concepts

Biological Markers
Environment
Genes
Genome
Oral Cavity
Experimentation
Analysis
Massively-Parallel Sequencing

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Synthetic Genetic Array Analysis

Synthetic genetic arrays allow the systematic examination of genetic interactions. Here is the latest research focusing on synthetic genetic arrays and their analyses.

Congenital Hyperinsulinism

Congenital hyperinsulinism is caused by genetic mutations resulting in excess insulin secretion from beta cells of the pancreas. Here is the latest research.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Epigenetic Memory

Epigenetic memory refers to the heritable genetic changes that are not explained by the DNA sequence. Find the latest research on epigenetic memory here.

Cell Atlas of the Human Eye

Constructing a cell atlas of the human eye will require transcriptomic and histologic analysis over the lifespan. This understanding will aid in the study of development and disease. Find the latest research pertaining to the Cell Atlas of the Human Eye here.

Femoral Neoplasms

Femoral Neoplasms are bone tumors that arise in the femur. Discover the latest research on femoral neoplasms here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.