Asia harbors substantial cultural and linguistic diversity, but the geographic structure of genetic variation across the continent remains enigmatic. Here we report a large-scale survey of autosomal variation from a broad geographic sample of Asian human populations. Our results show that genetic ancestry is strongly correlated with linguistic affiliations as well as geography. Most populations show relatedness within ethnic/linguistic groups, despite prevalent gene flow among populations. More than 90% of East Asian (EA) haplotypes could be found in either Southeast Asian (SEA) or Central-South Asian (CSA) populations and show clinal structure with haplotype diversity decreasing from south to north. Furthermore, 50% of EA haplotypes were found in SEA only and 5% were found in CSA only, indicating that SEA was a major geographic source of EA populations.
Indigenous populations of Malaysia known as Orang Asli (OA) show huge morphological, anthropological, and linguistic diversity. However, the genetic history of these populations remained obscure. We performed a high-density array genotyping using over 2 million single nucleotide polymorphisms in three major groups of Negrito, Senoi, and Proto-Malay. Structural analyses indicated that although all OA groups are genetically closest to East Asian (EA) populations, they are substantially distinct. We identified a genetic affinity between Andamanese and Malaysian Negritos which may suggest an ancient link between these two groups. We also showed that Senoi and Proto-Malay may be admixtures between Negrito and EA populations. Formal admixture tests provided evidence of gene flow between Austro-Asiatic-speaking OAs and populations from Southeast Asia (SEA) and South China which suggest a widespread presence of these people in SEA before Austronesian expansion. Elevated linkage disequilibrium (LD) and enriched homozygosity found in OAs reflect isolation and bottlenecks experienced. Estimates based on Ne and LD indicated that these populations diverged from East Asians during the late Pleistocene (14.5 to 8 KYA). The continuum in divergence time from Negritos to Senoi and Proto-Malay in combination with ancestral markers provides evidences of multiple waves of migration into SEA starting with the first Out-of-Africa dispersals followed by Early Train and subsequent Austronesian expansions.
BackgroundSouth East Asia (SEA) is home to over 30 tribes of indigenous population groups who are currently facing rapid socio-economic change. Epidemiological transition and increased prevalence of non-communicable diseases (NCD) has occured. In Peninsular Malaysia, the Orang Asli (OA) indigenous people comprise 0 · 6% (150,000) of the population and live in various settlements. OA comprise three distinct large tribes with smaller sub-tribes. The three large tribes include Proto-Malay (sub-tribes: Orang Seletar and Jakun), Senoi (sub-tribes: Mahmeri and Semai), and Negrito (sub-tribes: Jehai, Mendriq and Batek).MethodsWe studied the health of 636 OA from seven sub-tribes in the Peninsular. Parameters that were assessed included height, weight, BMI and waist circumference whilst blood pressure, cholesterols, fasting blood glucose and HbA1c levels were recorded. We then analysed cardio-metabolic risk factor prevalences and performed multiple pair-wise comparisons among different sub-tribes and socio-economic clusters.ResultsCardio-metabolic risk factors were recorded in the seven sub-tribes.. Prevalence for general and abdominal obesity were highest in the urbanized Orang Seletar (31 · 6 ± 5 · 7%; 66 · 1 ± 5 · 9%). Notably, hunter gatherer Jehai and Batek tribes displayed the highest prevalence for hypertension (43 · 8 ± 9 · 29% and 51 · 2 ± 15 · 3%) despite being the leanest and most remote, while the Mendriq sub-tribe, living in the same jungle area with access to similar resources as the Batek were less hypertensive (16.3 ± 11.0%), but displayed higher prevalence of abdominal obesity (27.30 ± 13.16%).ConclusionsWe describe the cardio-metabolic risk factors of seven indigenous communities in Malaysia. We report variable prevalence of obesity, cholesterol, hypertension and diabetes in the OA in contrast to the larger ethnic majorities such as Malays, Chinese and Indians in Malaysia These differences are likely to be due to socio-economic effects and lifestyle changes. In some sub-tribes, other factors including genetic predisposition may also play a role. It is expected that the cardio-metabolic risk factors may worsen with further urbanization, increase the health burden of these communities and strain the government’s resources.
Tibetan high-altitude adaptation (HAA) has been studied extensively, and many candidate genes have been reported. Subsequent efforts targeting HAA functional variants, however, have not been that successful (e.g., no functional variant has been suggested for the top candidate HAA gene, EPAS1). With WinXPCNVer, a method developed in this study, we detected in microarray data a Tibetan-enriched deletion (TED) carried by 90% of Tibetans; 50% were homozygous for the deletion, whereas only 3% carried the TED and 0% carried the homozygous deletion in 2,792 worldwide samples (p < 10(-15)). We employed long PCR and Sanger sequencing technologies to determine the exact copy number and breakpoints of the TED in 70 additional Tibetan and 182 diverse samples. The TED had identical boundaries (chr2: 46,694,276-46,697,683; hg19) and was 80 kb downstream of EPAS1. Notably, the TED was in strong linkage disequilibrium (LD; r(2) = 0.8) with EPAS1 variants associated with reduced blood concentrations of hemoglobin. It was also in complete LD with the 5-SNP motif, which was suspected to be introgressed from Denisovans, but the deletion itself was absent from the Denisovan sequence. Correspondingly, we detected that footprints of positive selection for the TED occurred 12,803 (95% confidence interval = 12,075-14,725) years ago. We further whole-genome deep sequenced (>60×) seven Tibetans and verified the TED but failed to identify any other copy-number variations with comparable patterns, giving this TED top priority for further study. We speculate that the specific patterns of the TED resulted from its own functionality in HAA of Tibetans or LD with a functional variant of EPAS1.
16Variability in the susceptibility to infectious disease and its clinical manifestation can be determined 17 by variation in the environment and by genetic variation in the pathogen and the host. Despite 18 several successes based on candidate gene studies, defining the host variation affecting infectious 19 disease has not been as successful as other multifactorial diseases. Both single nucleotide variation 20 and copy number variation (CNV) in the host contribute to the host's susceptibility to infectious 21 disease. In this review we focus on CNV, particularly on complex multiallelic CNV that is often not 22 well characterised either directly by hybridisation methods or indirectly by analysis of genotypes and 23 flanking single nucleotide variants. We summarise the well-known examples, such as alpha-globin 24 deletion and susceptibility to severe malaria, as well as more recent controversies, such as the 25 extensive CNV of the chemokine gene CCL3L1 and HIV infection. We discuss the potential biological 26 mechanisms that could underly any genetic association and reflect on the extensive complexity and 27 functional variation generated by a combination of CNV and sequence variation, as illustrated by the 28 Fc gamma receptor genes FCGR3A, FCGR3B and FCGR2C. We also highlight some understudied areas 29 that might prove fruitful areas for further research. 30 31
Peninsular Malaysia is a strategic region which might have played an important role in the initial peopling and subsequent human migrations in Asia. However, the genetic diversity and history of human populations--especially indigenous populations--inhabiting this area remain poorly understood. Here, we conducted a genome-wide study using over 900,000 single nucleotide polymorphisms (SNPs) in four major Malaysian ethnic groups (MEGs; Malay, Proto-Malay, Senoi and Negrito), and made comparisons of 17 world-wide populations. Our data revealed that Peninsular Malaysia has greater genetic diversity corresponding to its role as a contact zone of both early and recent human migrations in Asia. However, each single Orang Asli (indigenous) group was less diverse with a smaller effective population size (N(e)) than a European or an East Asian population, indicating a substantial isolation of some duration for these groups. All four MEGs were genetically more similar to Asian populations than to other continental groups, and the divergence time between MEGs and East Asian populations (12,000--6,000 years ago) was also much shorter than that between East Asians and Europeans. Thus, Malaysian Orang Asli groups, despite their significantly different features, may share a common origin with the other Asian groups. Nevertheless, we identified traces of recent gene flow from non-Asians to MEGs. Finally, natural selection signatures were detected in a batch of genes associated with immune response, human height, skin pigmentation, hair and facial morphology and blood pressure in MEGs. Notable examples include SYN3 which is associated with human height in all Orang Asli groups, a height-related gene (PNPT1) and two blood pressure-related genes (CDH13 and PAX5) in Negritos. We conclude that a long isolation period, subsequent gene flow and local adaptations have jointly shaped the genetic architectures of MEGs, and this study provides insight into the peopling and human migration history in Southeast Asia.
The indigenous populations from Peninsular Malaysia, locally known as Orang Asli, continue to adopt an agro-subsistence nomadic lifestyle, residing primarily within natural jungle habitats. Leading a hunter-gatherer lifestyle in a tropical jungle environment, the Orang Asli are routinely exposed to malaria. Here we surveyed the genetic architecture of individuals from four Orang Asli tribes with high-density genotyping across more than 2.5 million polymorphisms. These tribes reside in different geographical locations in Peninsular Malaysia and belong to three main ethno-linguistic groups, where there is minimal interaction between the tribes. We first dissect the genetic diversity and admixture between the tribes and with neighboring urban populations. Later, by implementing five metrics, we investigated the genome-wide signatures for positive natural selection of these Orang Asli, respectively. Finally, we searched for evidence of genomic adaptation to the pressure of malaria infection. We observed that different evolutionary responses might have emerged in the different Orang Asli communities to mitigate malaria infection.
The Malay people are an important ethnic composition in Southeast Asia, but their genetic make-up and population structure remain poorly studied. Here we conducted a genome-wide study of four geographical Malay populations: Peninsular Malaysian Malay (PMM), Singaporean Malay (SGM), Indonesian Malay (IDM) and Sri Lankan Malay (SLM). All the four Malay populations showed substantial admixture with multiple ancestries. We identified four major ancestral components in Malay populations: Austronesian (17%–62%), Proto-Malay (15%–31%), East Asian (4%–16%) and South Asian (3%–34%). Approximately 34% of the genetic makeup of SLM is of South Asian ancestry, resulting in its distinct genetic pattern compared with the other three Malay populations. Besides, substantial differentiation was observed between the Malay populations from the north and the south, and between those from the west and the east. In summary, this study revealed that the genetic identity of the Malays comprises a mixed entity of multiple ancestries represented by Austronesian, Proto-Malay, East Asian and South Asian, with most of the admixture events estimated to have occurred 175 to 1,500 years ago, which in turn suggests that geographical isolation and independent admixture have significantly shaped the genetic architectures and the diversity of the Malay populations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.