The African continent is regarded as the cradle of modern humans and African genomes contain more genetic variation than those from any other continent, yet only a fraction of the genetic diversity among African individuals has been surveyed1. Here we performed whole-genome sequencing analyses of 426 individuals—comprising 50 ethnolinguistic groups, including previously unsampled populations—to explore the breadth of genomic diversity across Africa. We uncovered more than 3 million previously undescribed variants, most of which were found among individuals from newly sampled ethnolinguistic groups, as well as 62 previously unreported loci that are under strong selection, which were predominantly found in genes that are involved in viral immunity, DNA repair and metabolism. We observed complex patterns of ancestral admixture and putative-damaging and novel variation, both within and between populations, alongside evidence that Zambia was a likely intermediate site along the routes of expansion of Bantu-speaking populations. Pathogenic variants in genes that are currently characterized as medically relevant were uncommon—but in other genes, variants denoted as ‘likely pathogenic’ in the ClinVar database were commonly observed. Collectively, these findings refine our current understanding of continental migration, identify gene flow and the response to human disease as strong drivers of genome-level population variation, and underscore the scientific imperative for a broader characterization of the genomic diversity of African individuals to understand human ancestry and improve health.
Tsetse flies are the sole vectors of human African trypanosomiasis throughout sub-Saharan Africa. Both sexes of adult tsetse feed exclusively on blood and contribute to disease transmission. Notable differences between tsetse and other disease vectors include obligate microbial symbioses, viviparous reproduction, and lactation. Here, we describe the sequence and annotation of the 366-megabase Glossina morsitans morsitans genome. Analysis of the genome and the 12,308 predicted protein–encoding genes led to multiple discoveries, including chromosomal integrations of bacterial (Wolbachia) genome sequences, a family of lactation-specific proteins, reduced complement of host pathogen recognition proteins, and reduced olfaction/chemosensory associated genes. These genome data provide a foundation for research into trypanosomiasis prevention and yield important insights with broad implications for multiple aspects of tsetse biology.
The primary differentiation event during mammalian development occurs at the blastocyst stage and leads to the delineation of the inner cell mass (ICM) and the trophectoderm (TE). We provide the first global mRNA expression data from immunosurgically dissected ICM cells, TE cells, and intact human blastocysts. Using a cDNA microarray composed of 15,529 cDNAs from known and novel genes, we identify marker transcripts specific to the ICM (e.g., OCT4/POU5F1, NANOG, HMGB1, and DPPA5) and TE (e.g., CDX2, ATP1B3, SFN, and IPL), in addition to novel ICM-and TE-specific expressed sequence tags. The expression patterns suggest that the emergence of pluripotent ICM and TE cell lineages from the morula is controlled by metabolic and signaling pathways, which include inter alia, WNT, mitogen-activated protein kinase, transforming growth factor-beta, NOTCH, integrinmediated cell adhesion, phosphatidylinositol 3-kinase, and apoptosis. These data enhance our understanding of the first step in human cellular differentiation and, hence, the derivation of both embryonic stem cells and trophoblastic stem cells from these lineages. Stem Cells 2005;23:1514-1525
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.