North African populations are distinct from sub-Saharan Africans based on cultural, linguistic, and phenotypic attributes; however, the time and the extent of genetic divergence between populations north and south of the Sahara remain poorly understood. Here, we interrogate the multilayered history of North Africa by characterizing the effect of hypothesized migrations from the Near East, Europe, and sub-Saharan Africa on current genetic diversity. We present dense, genome-wide SNP genotyping array data (730,000 sites) from seven North African populations, spanning from Egypt to Morocco, and one Spanish population. We identify a gradient of likely autochthonous Maghrebi ancestry that increases from east to west across northern Africa; this ancestry is likely derived from “back-to-Africa” gene flow more than 12,000 years ago (ya), prior to the Holocene. The indigenous North African ancestry is more frequent in populations with historical Berber ethnicity. In most North African populations we also see substantial shared ancestry with the Near East, and to a lesser extent sub-Saharan Africa and Europe. To estimate the time of migration from sub-Saharan populations into North Africa, we implement a maximum likelihood dating method based on the distribution of migrant tracts. In order to first identify migrant tracts, we assign local ancestry to haplotypes using a novel, principal component-based analysis of three ancestral populations. We estimate that a migration of western African origin into Morocco began about 40 generations ago (approximately 1,200 ya); a migration of individuals with Nilotic ancestry into Egypt occurred about 25 generations ago (approximately 750 ya). Our genomic data reveal an extraordinarily complex history of migrations, involving at least five ancestral populations, into North Africa.
North Africa is characterized by its diverse cultural and linguistic groups and its genetic heterogeneity. Genomic data has shown an amalgam of components mixed since pre-Holocean times. Though no differences have been found in uniparental and classical markers between Berbers and Arabs, the two main ethnic groups in the region, the scanty genomic data available have highlighted the singularity of Berbers. We characterize the genetic heterogeneity of North African groups, focusing on the putative differences of Berbers and Arabs, and estimate migration dates. We analyze genome-wide autosomal data in five Berber and six Arab groups, and compare them to Middle Easterns, sub-Saharans, and Europeans. Haplotype-based methods show a lack of correlation between geographical and genetic populations, and a high degree of genetic heterogeneity, without strong differences between Berbers and Arabs. Berbers enclose genetically diverse groups, from isolated endogamous groups with high autochthonous component frequencies, large homozygosity runs and low effective population sizes, to admixed groups with high frequencies of sub-Saharan and Middle Eastern components. Admixture time estimates show a complex pattern of recent historical migrations, with a peak around the 7th century C.E. coincident with the Arabization of the region; sub-Saharan migrations since the 1st century B.C. in agreement with Roman slave trade; and a strong migration in the 17th century C.E., coincident with a huge impact of the trans-Atlantic and trans-Saharan trade of sub-Saharan slaves in the Modern Era. The genetic complexity found should be taken into account when selecting reference groups in population genetics and biomedical studies.
SummaryBerbers live in groups scattered across North Africa whose origins and genetic relationships with their neighbours are not well established. The first hypervariable segment of the mitochondrial DNA (mtDNA) control region was sequenced in a total of 155 individuals from three Tunisian Berber groups and compared to other North Africans. The mtDNA lineages found belong to a common set of mtDNA haplogroups already described in North Africa. Besides the autochthonous North African U6 haplogroup, a group of L3 lineages characterized by the transition at position 16041 seems to be restricted to North Africans, suggesting that an expansion of this group of lineages took place around 10500 years ago in North Africa, and spread to neighbouring populations. Principal components and the coordinate analyses show that some Berber groups (the Tuareg, the Mozabite, and the Chenini-Douiret) are outliers within the North African genetic landscape. This outlier position is consistent with an isolation process followed by genetic drift in haplotype frequencies, and with the high heterogeneity displayed by Berbers compared to Arab samples as shown in the AMOVA. Despite this Berber heterogeneity, no significant differences were found between Berber and Arab samples, suggesting that the Arabization was mainly a cultural process rather than a demographic replacement.
During the 15th International Histocompatibility and Immunogenetics Workshop (IHIWS), 14 human leukocyte antigen (HLA) laboratories participated in the Analysis of HLA Population Data (AHPD) project where 18 new population samples were analyzed statistically and compared with data available from previous workshops. To that aim, an original methodology was developed and used (i) to estimate frequencies by taking into account ambiguous genotypic data, (ii) to test for Hardy-Weinberg equilibrium (HWE) by using a nested likelihood ratio test involving a parameter accounting for HWE deviations, (iii) to test for selective neutrality by using a resampling algorithm, and (iv) to provide explicit graphical representations including allele frequencies and basic statistics for each series of data. A total of 66 data series (1-7 loci per population) were analyzed with this standard approach. Frequency estimates were compliant with HWE in all but one population of mixed stem cell donors. Neutrality testing confirmed the observation of heterozygote excess at all HLA loci, although a significant deviation was established in only a few cases. Population comparisons showed that HLA genetic patterns were mostly shaped by geographic and/or linguistic differentiations in Africa and Europe, but not in America where both genetic drift in isolated populations and gene flow in admixed populations led to a more complex genetic structure. Overall, a fruitful collaboration between HLA typing laboratories and population geneticists allowed finding useful solutions to the problem of estimating gene frequencies and testing basic population diversity statistics on highly complex HLA data (high numbers of alleles and ambiguities), with promising applications in either anthropological, epidemiological, or transplantation studies.
Background Population demography and gene flow among African groups, as well as the putative archaic introgression of ancient hominins, have been poorly explored at the genome level. Results Here, we examine 15 African populations covering all major continental linguistic groups, ecosystems, and lifestyles within Africa through analysis of whole-genome sequence data of 21 individuals sequenced at deep coverage. We observe a remarkable correlation among genetic diversity and geographic distance, with the hunter-gatherer groups being more genetically differentiated and having larger effective population sizes throughout most modern-human history. Admixture signals are found between neighbor populations from both hunter-gatherer and agriculturalists groups, whereas North African individuals are closely related to Eurasian populations. Regarding archaic gene flow, we test six complex demographic models that consider recent admixture as well as archaic introgression. We identify the fingerprint of an archaic introgression event in the sub-Saharan populations included in the models (~ 4.0% in Khoisan, ~ 4.3% in Mbuti Pygmies, and ~ 5.8% in Mandenka) from an early divergent and currently extinct ghost modern human lineage. Conclusion The present study represents an in-depth genomic analysis of a Pan African set of individuals, which emphasizes their complex relationships and demographic history at population level. Electronic supplementary material The online version of this article (10.1186/s13059-019-1684-5) contains supplementary material, which is available to authorized users.
Highlights d Paleolithic genetic continuity is found in extant human North African genomes d There is a West-to-East genetic cline of the Paleolithic component in North Africa d Neolithization had a larger demographic impact than Arabization d Differential admixture and genetic drift have modeled North African genomes
Tunisia has experienced a variety of human migrations that have modeled the myriad cultural groups inhabiting the area. Both Arabic and Berber-speaking populations live in Tunisia. Berbers are commonly considered as in situ descendants of peoples who settled roughly in Palaeolithic times, and posterior demographic events such as the arrival of the Neolithic, the Arab migrations, and the expulsion of the "Moors" from Spain, had a strong cultural influence. Nonetheless, the genetic structure and the population relationships of the ethnic groups living in Tunisia have been poorly assessed. In order to gain insight into the paternal genetic landscape and population structure, more than 40 Y-chromosome single nucleotide polymorphisms and 17 short tandem repeats were analyzed in five Tunisian ethnic groups (three Berber-speaking isolates, one Andalusian, and one Cosmopolitan Arab). The most common lineage was the North African haplogroup E-M81 (71%), being fixed in two Berber samples (Chenini-Douiret and Jradou), suggesting isolation and genetic drift. Differential levels of paternal gene flow from the Near East were detected in the Tunisian samples (J-M267 lineage over 30%); however, no major sub-Saharan African or European influence was found. This result contrasts with the high amount of sub-Saharan and Eurasian maternal lineages previously described in Tunisia. Overall, our results reveal a certain genetic inter-population diversity, especially among Berber groups, and sexual asymmetry, paternal lineages being mostly of autochthonous origin. In addition, Andalusians, who are supposed to be migrants from southern Spain, do not exhibit any substantial contribution of European lineages, suggesting a North African origin for this ethnic group.
The geostrategic location of North Africa as a crossroad between three continents and as a stepping-stone outside Africa has evoked anthropological and genetic interest in this region. Numerous studies have described the genetic landscape of the human population in North Africa employing paternal, maternal, and biparental molecular markers. However, information from these markers which have different inheritance patterns has been mostly assessed independently, resulting in an incomplete description of the region. In this study, we analyze uniparental and genome-wide markers examining similarities or contrasts in the results and consequently provide a comprehensive description of the evolutionary history of North Africa populations. Our results show that both males and females in North Africa underwent a similar admixture history with slight differences in the proportions of admixture components. Consequently, genome-wide diversity show similar patterns with admixture tests suggesting North Africans are a mixture of ancestral populations related to current Africans and Eurasians with more affinity towards the out-of-Africa populations than to sub-Saharan Africans. We estimate from the paternal lineages that most North Africans emerged ∼15,000 years ago during the last glacial warming and that population splits started after the desiccation of the Sahara. Although most North Africans share a common admixture history, the Tunisian Berbers show long periods of genetic isolation and appear to have diverged from surrounding populations without subsequent mixture. On the other hand, continuous gene flow from the Middle East made Egyptians genetically closer to Eurasians than to other North Africans. We show that genetic diversity of today's North Africans mostly captures patterns from migrations post Last Glacial Maximum and therefore may be insufficient to inform on the initial population of the region during the Middle Paleolithic period.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.