A bioinformatic pipeline for NGS data analysis and mutation calling in human solid tumors

Biomedit︠s︡inskai︠a︡ khimii︠a︡
K Yu TsukanovV V Ilinsky

Abstract

We aimed to develop a pipeline for the bioinformatic analysis and interpretation of NGS data and detection of a wide range of single-nucleotide somatic mutations within tumor DNA. Initially, the NGS reads were submitted to a quality control check by the Cutadapt program. Low-quality 3¢-nucleotides were removed. After that the reads were mapped to the reference genome hg19 (GRCh37.p13) by BWA. The SAMtools program was used for exclusion of duplicates. MuTect was used for SNV calling. The functional effect of SNVs was evaluated using the algorithm, including annotation and evaluation of SNV pathogenicity by SnpEff and analysis of such databases as COSMIC, dbNSFP, Clinvar, and OMIM. The effect of SNV on the protein function was estimated by SIFT and PolyPhen2. Mutation frequencies were obtained from 1000 Genomes and ExAC projects, as well as from our own databases with frequency data. In order to evaluate the pipeline we used 18 breast cancer tumor biopsies. The MYbaits Onconome KL v1.5 Panel ("MYcroarray") was used for targeted enrichment. NGS was performed on the Illumina HiSeq 2500 platform. As a result, we identified alterations in BRCA1, BRCA2, ATM, CDH1, CHEK2, TP53 genes that affected the sequence of encoded proteins. Our p...Continue Reading

References

May 20, 2003·Human Mutation·Peter D StensonDavid N Cooper
Jun 26, 2003·Nucleic Acids Research·Pauline C Ng, Steven Henikoff
May 20, 2009·Bioinformatics·Heng Li, Richard Durbin
Jun 10, 2009·Bioinformatics·Heng LiUNKNOWN 1000 Genome Project Data Processing Subgroup
Jul 16, 2010·La Revue de médecine interne·J MartinM Hamidou
Sep 17, 2010·Human Molecular Genetics·Li DingElaine R Mardis
Jan 15, 2013·Current Protocols in Human Genetics·Ivan AdzhubeiShamil R Sunyaev
Feb 12, 2013·Nature Biotechnology·Kristian CibulskisGad Getz
Jun 3, 2014·BMC Bioinformatics·Su Yeon KimTerence P Speed
Oct 4, 2015·Nature·UNKNOWN 1000 Genomes Project ConsortiumGonçalo R Abecasis
Nov 20, 2015·Nucleic Acids Research·Melissa J LandrumDonna R Maglott
Aug 19, 2016·Nature·Monkol LekUNKNOWN Exome Aggregation Consortium
Oct 12, 2016·Current Protocols in Human Genetics·S A ForbesP J Campbell

❮ Previous
Next ❯

Citations

Aug 24, 2018·Biomedit︠s︡inskai︠a︡ khimii︠a︡·D S MikhaylenkoM V Nemtsova
Aug 18, 2018·Experimental and Therapeutic Medicine·Bo JiaQin Li

❮ Previous
Next ❯

Related Concepts

Related Feeds

Ataxia telangiectasia

Ataxia telangiectasia is a rare neurodegenerative diseases caused by defects in the ATM gene, which is involved in DNA damage recognition and repair pathways. Here is the latest research on this autosomal recessive disease.

Cancer Sequencing

Several sequencing approaches are employed to understand and examine tumor development and progression. These include whole genome as well as RNA sequencing. Here is the latest research on cancer sequencing.

Cadherins and Catenins

Cadherins (named for "calcium-dependent adhesion") are a type of cell adhesion molecule (CAM) that is important in the formation of adherens junctions to bind cells with each other. Catenins are a family of proteins found in complexes with cadherin cell adhesion molecules of animal cells: alpha-catenin can bind to β-catenin and can also bind actin. β-catenin binds the cytoplasmic domain of some cadherins. Discover the latest research on cadherins and catenins here.

Ataxia telangiectasia (MDS)

Ataxia telangiectasia is a rare neurodegenerative diseases caused by defects in the ATM gene, which is involved in DNA damage recognition and repair pathways. Here is the latest research on this autosomal recessive disease.

Breast Cancer: BRCA1 & BRCA2

Mutations involving BRCA1, found on chromosome 17, and BRCA2, found on chromosome 13, increase the risk for specific cancers, such as breast cancer. Discover the last research on breast cancer BRCA1 and BRCA2 here.