DOI: 10.1101/518696Jan 11, 2019Paper

Network-based hierarchical population structure analysis for large genomic datasets

BioRxiv : the Preprint Server for Biology
Gili GreenbaumNoah A. Rosenberg

Abstract

Analysis of population structure in natural populations using genetic data is a common practice in ecological and evolutionary studies. With large genomic datasets of populations now appearing more frequently across the taxonomic spectrum, it is becoming increasingly possible to reveal many hierarchical levels of structure, including fine-scale genetic clusters. To analyze these datasets, methods need to be appropriately suited to the challenges of extracting multi-level structure from whole-genome data. Here, we present a network-based approach for constructing population structure representations from genetic data. The use of community detection algorithms from network theory generates a natural hierarchical perspective on the representation that the method produces. The method is computationally efficient, and it requires relatively few assumptions regarding the biological processes that underlie the data. We demonstrate the approach by analyzing population structure in the model plant species Arabidopsis thaliana and in human populations. These examples illustrates how network-based approaches for population structure analysis are well-suited to extracting valuable ecological and evolutionary information in the era of large...Continue Reading

Related Concepts

Gene Clusters
Arabidopsis thaliana <plant>
Structure
Analysis
Population Group
Detection
Whole Genome Amplification
Species
Study

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.