Southern China is the birthplace of rice-cultivating agriculture and different language families and has also witnessed various human migrations that facilitated cultural diffusions. The fine-scale demographic history in situ that forms present-day local populations, however, remains unclear. To comprehensively cover the genetic diversity in East and Southeast Asia, we generated genome-wide SNP data from 211 present-day Southern Chinese and co-analyzed them with ∼1,200 ancient and modern genomes. In Southern China, language classification is significantly associated with genetic variation but with a different extent of predictability, and there is strong evidence for recent shared genetic history particularly in Hmong–Mien and Austronesian speakers. A geography-related genetic sub-structure that represents the major genetic variation in Southern East Asians is established pre-Holocene and its extremes are represented by Neolithic Fujianese and First Farmers in Mainland Southeast Asia. This sub-structure is largely reduced by admixture in ancient Southern Chinese since > ∼2,000 BP, which forms a “Southern Chinese Cluster” with a high level of genetic homogeneity. Further admixture characterizes the demographic history of the majority of Hmong–Mien speakers and some Kra-Dai speakers in Southwest China happened ∼1,500–1,000 BP, coeval to the reigns of local chiefdoms. In Yellow River Basin, we identify a connection of local populations to genetic sub-structure in Southern China with geographical correspondence appearing > ∼9,000 BP, while the gene flow likely closely related to “Southern Chinese Cluster” since the Longshan period (∼5,000–4,000 BP) forms ancestry profile of Han Chinese Cline.
Sui people, which belong to the Tai-Kadai-speaking family, remain poorly characterized due to a lack of genome-wide data. To infer the fine-scale population genetic structure and putative genetic sources of the Sui people, we genotyped 498,655 genome-wide single-nucleotide polymorphisms (SNPs) using SNP arrays in 68 Sui individuals from seven indigenous populations in Guizhou province and Guangxi Zhuang Autonomous Region in Southwest China and co-analyzed with available East Asians via a series of population genetic methods including principal component analysis (PCA), ADMIXTURE, pairwise Fst genetic distance, f-statistics, qpWave, and qpAdm. Our results revealed that Guangxi and Guizhou Sui people showed a strong genetic affinity with populations from southern China and Southeast Asia, especially Tai-Kadai- and Hmong-Mien-speaking populations as well as ancient Iron Age Taiwan Hanben, Gongguan individuals supporting the hypothesis that Sui people came from southern China originally. The indigenous Tai-Kadai-related ancestry (represented by Li), Northern East Asian-related ancestry, and Hmong-Mien-related lineage contributed to the formation processes of the Sui people. We identified the genetic substructure within Sui groups: Guizhou Sui people were relatively homogeneous and possessed similar genetic profiles with neighboring Tai-Kadai-related populations, such as Maonan. While Sui people in Yizhou and Huanjiang of Guangxi might receive unique, additional gene flow from Hmong-Mien-speaking populations and Northern East Asians, respectively, after the divergence within other Sui populations. Sui people could be modeled as the admixture of ancient Yellow River Basin farmer-related ancestry (36.2–54.7%) and ancient coastal Southeast Asian-related ancestry (45.3–63.8%). We also identified the potential positive selection signals related to the disease susceptibility in Sui people via integrated haplotype score (iHS) and number of segregating sites by length (nSL) scores. These genomic findings provided new insights into the demographic history of Tai-Kadai-speaking Sui people and their interaction with neighboring populations in Southern China.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.