The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
There are well-established disparities in cancer incidence and outcomes by race/ethnicity that result from the interplay between structural, socioeconomic, socio-environmental, behavioural and biological factors. However, large research studies designed to investigate factors contributing to cancer aetiology and progression have mainly focused on populations of European origin. The limitations in clinicopathological and genetic data, as well as the reduced availability of biospecimens from diverse populations, contribute to the knowledge gap and have the potential to widen cancer health disparities. In this review, we summarise reported disparities and associated factors in the United States of America (USA) for the most common cancers (breast, prostate, lung and colon), and for a subset of other cancers that highlight the complexity of disparities (gastric, liver, pancreas and leukaemia). We focus on populations commonly identified and referred to as racial/ethnic minorities in the USA—African Americans/Blacks, American Indians and Alaska Natives, Asians, Native Hawaiians/other Pacific Islanders and Hispanics/Latinos. We conclude that even though substantial progress has been made in understanding the factors underlying cancer health disparities, marked inequities persist. Additional efforts are needed to include participants from diverse populations in the research of cancer aetiology, biology and treatment. Furthermore, to eliminate cancer health disparities, it will be necessary to facilitate access to, and utilisation of, health services to all individuals, and to address structural inequities, including racism, that disproportionally affect racial/ethnic minorities in the USA.
There is great scientific and popular interest in understanding the genetic history of populations in the Americas. We wish to understand when different regions of the continent were inhabited, where settlers came from, and how current inhabitants relate genetically to earlier populations. Recent studies unraveled parts of the genetic history of the continent using genotyping arrays and uniparental markers. The 1000 Genomes Project provides a unique opportunity for improving our understanding of population genetic history by providing over a hundred sequenced low coverage genomes and exomes from Colombian (CLM), Mexican-American (MXL), and Puerto Rican (PUR) populations. Here, we explore the genomic contributions of African, European, and especially Native American ancestry to these populations. Estimated Native American ancestry is in MXL, in CLM, and in PUR. Native American ancestry in PUR is most closely related to populations surrounding the Orinoco River basin, confirming the Southern America ancestry of the Taíno people of the Caribbean. We present new methods to estimate the allele frequencies in the Native American fraction of the populations, and model their distribution using a demographic model for three ancestral Native American populations. These ancestral populations likely split in close succession: the most likely scenario, based on a peopling of the Americas thousand years ago (kya), supports that the MXL Ancestors split kya, with a subsequent split of the ancestors to CLM and PUR kya. The model also features effective populations of in Mexico, in Colombia, and in Puerto Rico. Modeling Identity-by-descent (IBD) and ancestry tract length, we show that post-contact populations also differ markedly in their effective sizes and migration patterns, with Puerto Rico showing the smallest effective size and the earlier migration from Europe. Finally, we compare IBD and ancestry assignments to find evidence for relatedness among European founders to the three populations.
The population of Argentina is the result of the intermixing between several groups, including Indigenous American, European and African populations. Despite the commonly held idea that the population of Argentina is of mostly European origin, multiple studies have shown that this process of admixture had an impact in the entire Argentine population. In the present study we characterized the distribution of Indigenous American, European and African ancestry among individuals from different regions of Argentina and evaluated the level of discrepancy between self-reported grandparental origin and genetic ancestry estimates. A set of 99 autosomal ancestry informative markers (AIMs) was genotyped in a sample of 441 Argentine individuals to estimate genetic ancestry. We used non-parametric tests to evaluate statistical significance. The average ancestry for the Argentine sample overall was 65% European (95%CI: 63–68%), 31% Indigenous American (28–33%) and 4% African (3–4%). We observed statistically significant differences in European ancestry across Argentine regions [Buenos Aires province (BA) 76%, 95%CI: 73–79%; Northeast (NEA) 54%, 95%CI: 49–58%; Northwest (NWA) 33%, 95%CI: 21–41%; South 54%, 95%CI: 49–59%; p<0.0001] as well as between the capital and immediate suburbs of Buenos Aires city compared to more distant suburbs [80% (95%CI: 75–86%) versus 68% (95%CI: 58–77%), p = 0.01]. European ancestry among individuals that declared all grandparents born in Europe was 91% (95%CI: 88–94%) compared to 54% (95%CI: 51–57%) among those with no European grandparents (p<0.001). Our results demonstrate the range of variation in genetic ancestry among Argentine individuals from different regions in the country, highlighting the importance of taking this variation into account in genetic association and admixture mapping studies in this population.
Hereditary cancer predisposition gene testing allows the identification of individuals at high risk of cancer that may benefit from increased surveillance, chemoprevention, and prophylactic surgery. In order to implement clinical genetic strategies adapted to each population’s needs and intrinsic genetic characteristic, this review aims to present the current status of knowledge about the spectrum of BRCA pathogenic variants in Latin American populations. We have conducted a comprehensive review of 33 studies published between 1994 and 2015 reporting the prevalence and/or spectrum of BRCA1 (OMIM 113705) and BRCA2 (OMIM 600185) variants. The combined sample size for these studies consisted of 4835 individuals from 13 countries in Latin America and the Caribbean, as well as in Hispanics in the United States. A total of 167 unique pathogenic variants have been reported in the existing literature. In unselected breast cancer cases, the prevalence ranged from 1.2 to 27.1 %. Some countries presented a few recurrent pathogenic variants, while others were characterized by diverse, non-recurrent variants. The proportion of BRCA pathogenic variants shared between Hispanics in the United States and Latin American populations was estimated at 10.4 %. Within Latin America and the Caribbean, 8.2 % of the BRCA variants reported were present in more than one country. Countries with high prevalence of BRCA pathogenic variants may benefit from more aggressive testing strategies, while testing of recurrent variant panels might present a cost-effective solution for improving genetic testing in some, but not all, countries.Electronic supplementary materialThe online version of this article (doi:10.1007/s10549-015-3629-3) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.