Fine-scale Population Structure and Demographic History of Han Chinese Inferred from Haplotype Network of 111,000 Genomes

BioRxiv : the Preprint Server for Biology
A. LanGang Chen


Han Chinese is the most populated ethnic group across the globe with a comprehensive substructure that resembles its cultural diversification. Studies have constructed the genetic polymorphism spectrum of Han Chinese, whereas high-resolution investigations are still missing to unveil its fine-scale substructure and trace the genetic imprints for its demographic history. Here we construct a haplotype network consisted of 111,000 genome-wide genotyped Han Chinese individuals from direct-to-consumer genetic testing and over 1.3 billion identity-by-descent (IBD) links. We observed a clear separation of the northern and southern Han Chinese and captured 5 subclusters and 17 sub-subclusters in haplotype network hierarchical clustering, corresponding to geography (especially mountain ranges), immigration waves, and clans with cultural-linguistic segregation. We inferred differentiated split histories and founder effects for population clans Cantonese, Hakka, and Minnan-Chaoshanese in southern China, and also unveiled more recent demographic events within the past few centuries, such as Zou Xikou and Chuang Guandong. The composition shifts of the native and current residents of four major metropolitans (Beijing, Shanghai, Guangzhou, an...Continue Reading

Related Concepts

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.