Genotyping Allelic and Copy Number Variation in the Immunoglobulin Heavy Chain Locus

BioRxiv : the Preprint Server for Biology
Shishi LuoYun S. Song

Abstract

The study of genomic regions that contain gene copies and structural variation is a major challenge in modern genomics. Unlike variation involving single nucleotide changes, data on the variation of copy number is difficult to collect and few tools exist for analyzing the variation between individuals. The immunoglobulin heavy variable (IGHV) locus, which plays an integral role in the adaptive immune response, is an example of a genomic region that is known to vary in gene copy number. Lack of standard methods to genotype this region prevents it from being included in association studies and is holding back the growing field of antibody repertoire analysis. Here, we establish a convention of representing the locus in terms of a reference panel of operationally distinguishable segments defined by hierarchical clustering. Using this reference set, we develop a pipeline that identifies copy number and allelic variation in the IGHV locus from whole-genome sequencing reads. Tests on simulated reads demonstrate that our approach is feasible and accurate for detecting the presence and absence of gene segments using reads as short as 70 bp. With reads 100 bp and longer, coverage depth can also be used to determine copy number. When app...Continue Reading

Related Concepts

Alleles
Base Sequence
Genes
IGH gene
Genome
Immunoglobulin G
Immunoglobulin Heavy Chains
Patterns
Structure
Genetic Loci

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Antibody Repertoire Diversity

Antibody repertoire diversity and its role during natural infection is a prerequisite for molecular and structural elucidation of functionally protective immunity. Discover the latest insights into antibody diversity here.