Here, we give a historical overview of the search for genetic variants that influence the susceptibility of an individual to a chronic disease, from RA Fisher's seminal work to the current excitement of whole-genome association studies (WGAS). We then discuss the concepts behind the identification of common variants as disease causal factors and contrast them to the basic ideas that underlie the rare variant hypothesis. The identification of rare variants involves the careful selection of candidate genes to examine, the availability of highly efficient resequencing techniques and the appropriate assessment of the functional consequences of the implicated variant. We believe that this strategy can be successfully applied at present in order to unravel the contribution of rare variants to the multifactorial inheritance of common diseases, which could lead to the implementation of much needed preventative screening schemes.
To control for hidden population stratification in genetic-association studies, statistical methods that use marker genotype data to infer population structure have been proposed as a possible alternative to family-based designs. In principle, it is possible to infer population structure from associations between marker loci and from associations of markers with the trait, even when no information about the demographic background of the population is available. In a model in which the total population is formed by admixture between two or more subpopulations, confounding can be estimated and controlled. Current implementations of this approach have limitations, the most serious of which is that they do not allow for uncertainty in estimations of individual admixture proportions or for lack of identifiability of subpopulations in the model. We describe methods that overcome these limitations by a combination of Bayesian and classical approaches, and we demonstrate the methods by using data from three admixed populations--African American, African Caribbean, and Hispanic American--in which there is extreme confounding of trait-genotype associations because the trait under study (skin pigmentation) varies with admixture proportions. In these data sets, as many as one-third of marker loci show crude associations with the trait. Control for confounding by population stratification eliminates these associations, except at loci that are linked to candidate genes for the trait. With only 32 markers informative for ancestry, the efficiency of the analysis is 70%. These methods can deal with both confounding and selection bias in genetic-association studies, making family-based designs unnecessary.
Ancestry informative markers (AIMs) are genetic loci showing alleles with large frequency differences between populations. AIMs can be used to estimate biogeographical ancestry at the level of the population, subgroup (e.g. cases and controls) and individual. Ancestry estimates at both the subgroup and individual level can be directly instructive regarding the genetics of the pheno-types that differ qualitatively or in frequency between populations. These estimates can provide a compelling foundation for the use of admixture mapping (AM) methods to identify the genes underlying these traits. We present details of a panel of 34 AIMs and demonstrate how such studies can proceed, by using skin pigmentation as a model phenotype. We have genotyped these markers in two population samples with primarily African ancestry, viz. African Americans from Washington D.C. and an African Caribbean sample from Britain, and in a sample of European Americans from Pennsylvania. In the two African population samples, we observed significant correlations between estimates of individual ancestry and skin pigmentation as measured by reflectometry (R 2 =0.21, P<0.0001 for the African-American sample and R 2 =0.16, P<0.0001 for the British African-Caribbean sample). These correlations confirm the validity of the ancestry estimates and also indicate the high level of population structure related to admixture, a level that characterizes these populations and that is detectable by using other tests to identify genetic structure. We have also applied two methods of admixture mapping to test for the effects of three candidate genes (TYR, OCA2, MC1R) on pigmentation. We show that TYR and OCA2 have measurable effects on skin pigmentation differences between the west African and west European parental populations. This work indicates that it is possible to estimate the individual ancestry of a person based on DNA analysis with a reasonable number of welldefined genetic markers. The implications and applications of ancestry estimates in biomedical research are discussed.
Gene flow between genetically distinct populations creates linkage disequilibrium (admixture linkage disequilibrium [ALD]) among all loci (linked and unlinked) that have different allele frequencies in the founding populations. We have explored the distribution of ALD by using computer simulation of two extreme models of admixture: the hybrid-isolation (HI) model, in which admixture occurs in a single generation, and the continuous-gene-flow (CGF) model, in which admixture occurs at a steady rate in every generation. Linkage disequilibrium patterns in African American population samples from Jackson, MS, and from coastal South Carolina resemble patterns observed in the simulated CGF populations, in two respects. First, significant association between two loci (FY and AT3) separated by 22 cM was detected in both samples. The retention of ALD over relatively large (>10 cM) chromosomal segments is characteristic of a CGF pattern of admixture but not of an HI pattern. Second, significant associations were also detected between many pairs of unlinked loci, as observed in the CGF simulation results but not in the simulated HI populations. Such a high rate of association between unlinked markers in these populations could result in false-positive linkage signals in an admixture-mapping study. However, we demonstrate that by conditioning on parental admixture, we can distinguish between true linkage and association resulting from shared ancestry. Therefore, populations with a CGF history of admixture not only are appropriate for admixture mapping but also have greater power for detection of linkage disequilibrium over large chromosomal regions than do populations that have experienced a pattern of admixture more similar to the HI model, if methods are employed that detect and adjust for disequilibrium caused by continuous admixture.
Fission yeast centromeric repeats are transcribed into small interfering RNA (siRNA) precursors (pre-siRNAs), which are processed by Dicer to direct heterochromatin formation. Recently, Rpb1 and Rpb2 subunits of RNA polymerase II (RNA Pol II) were shown to mediate RNA interference (RNAi)-directed chromatin modification but did not affect pre-siRNA levels. Here we show that another Pol II subunit, Rpb7 has a specific role in presiRNA transcription. We define a centromeric presiRNA promoter from which initiation is exquisitely sensitive to the rpb7-G150D mutation. In contrast to other Pol II subunits, Rpb7 promotes pre-siRNA transcription required for RNAi-directed chromatin silencing. In RNA interference (RNAi), Dicer processes doublestranded RNA to produce short small interfering RNAs (siRNAs) that act to silence gene expression post-transcriptionally (Fire et al. 1998;Zamore et al. 2000;Bernstein et al. 2001). RNAi also mediates transcriptional silencing via formation of heterochromatin. Centromeres in many organisms are composed of repetitive DNAs, which act as a substrate for the formation of kinetochores, flanked by domains of centromeric heterochromatin (Cleveland et al. 2003). In fission yeast (Schizosaccharomyces pombe) genes inserted within this heterochromatin are transcriptionally silenced (Allshire et al. 1995). Paradoxically, the centromeric repeats themselves are transcribed to form long siRNA precursors (pre-siRNAs), which are required for the RNAi-mediated chromatin modifications resulting in transcriptional silencing (Provost et al. 2002;Volpe et al. 2002). Defective RNAi leads to loss of silencing and defective heterochromatin formation in several organisms (Mochizuki et al. 2002;Pal-Bhadra et al. 2002;Zilberman et al. 2003) and transcripts homologous to centromere repeats have been detected in chicken cells as well as mice (Fukagawa et al. 2004;Kanellopoulou et al. 2005). Interestingly, in plants, specialized RNA polymerase IV (RNA Pol IV) subunits have been implicated in RNAidirected silencing (Herr et al. 2005;Onodera et al. 2005). Animal and fungal genomes lack such Pol IV subunits. Very recently, however, two subunits of RNA pol II (Rpb1 and Rpb2) were shown to have specific roles in chromatin modification and siRNA production, respectively, but did not affect pre-siRNA levels (Kato et al. 2005;Schramke et al. 2005).Here, we have characterized the temperature sensitive csp3 (centromere: suppressor of position effect) mutant isolated in a screen for trans-acting mutants that are required for centromeric silencing (Ekwall et al. 1999). csp3 is a G150D missense mutation in the rpb7 gene, a subunit of Pol II. We show that Rpb7 has a specific defect in centromeric pre-siRNA transcription. We define a centromeric pre-siRNA promoter from which initiation is exquisitely sensitive to rpb7-G150D. Thus, in contrast to the previously characterized silencing defective Pol II subunits, which affect siRNA production and/or downstream events, Rpb7 has a distinct role in generating centromeric pre-siRNAs ne...
We analyzed admixture in samples of six different African-American populations from South Carolina: Gullah-speaking Sea Islanders in coastal South Carolina, residents of four counties in the "Low Country" (Berkeley, Charleston, Colleton, and Dorchester), and persons living in the city of Columbia, located in central South Carolina. We used a battery of highly informative autosomal, mtDNA, and Y-chromosome markers. Two of the autosomal markers (FY and AT3) are linked and lie 22 cM apart on chromosome 1. The results of this study indicate, in accordance with previous historical, cultural, and anthropological evidence, a very low level of European admixture in the Gullah Sea Islanders (m = 3.5 +/- 0.9%). The proportion of European admixture is higher in the Low Country (m ranging between 9. 9 +/- 1.8% and 14.0 +/- 1.9%), and is highest in Columbia (m = 17.7 +/- 3.1%). A sex-biased European gene flow and a small Native American contribution to the African-American gene pool are also evident in these data. We studied the pattern of pairwise allelic associations between the FY locus and the nine other autosomal markers in our samples. In the combined sample from the Low Country (N = 548), a high level of linkage disequilibrium was observed between the linked markers, FY and AT3. Additionally, significant associations were also detected between FY and 4 of the 8 unlinked markers, suggesting the existence of significant genetic structure in this population. A continuous gene flow model of admixture could explain the observed pattern of genetic structure. A test conditioning on the overall admixture of each individual showed association of ancestry between the two linked markers (FY and AT3), but not between any of the unlinked markers, as theory predicts. Thus, even in the presence of genetic structure due to continuous gene flow or some other factor, it is possible to differentiate associations due to linkage from spurious associations due to genetic structure.
SummaryA defining feature of centromeres is the presence of the histone H3 variant CENP-ACnp1. It is not known how CENP-ACnp1 is specifically delivered to, and assembled into, centromeric chromatin. Through a screen for factors involved in kinetochore integrity in fission yeast, we identified Sim3. Sim3 is homologous to known histone binding proteins NASPHuman and N1/N2Xenopus and aligns with Hif1S. cerevisiae, defining the SHNi-TPR family. Sim3 is distributed throughout the nucleoplasm, yet it associates with CENP-ACnp1 and also binds H3. Cells defective in Sim3 function have reduced levels of CENP-ACnp1 at centromeres (and increased H3) and display chromosome segregation defects. Sim3 is required to allow newly synthesized CENP-ACnp1 to accumulate at centromeres in S and G2 phase-arrested cells in a replication-independent mechanism. We propose that one function of Sim3 is to act as an escort that hands off CENP-ACnp1 to chromatin assembly factors, allowing its incorporation into centromeric chromatin.
SummaryHispanic populations are a valuable resource that can and should facilitate the identification of complex trait genes by means of admixture mapping (AM). In this paper we focus on a particular Hispanic population living in the San Luis Valley (SLV) in Southern Colorado.We used a set of 22 Ancestry Informative Markers (AIMs) to describe the admixture process and dynamics in this population. AIMs are defined as genetic markers that exhibit allele frequency differences between parental populations ≥ 30%, and are more informative for studying admixed populations than random markers. The ancestral proportions of the SLV Hispanic population are estimated as 62.7 ± 2.1% European, 34.1 ± 1.9% Native American and 3.2 ± 1.5% West African. We also estimated the ancestral proportions of individuals using these AIMs. Population structure was demonstrated by the excess association of unlinked markers, the correlation between estimates of admixture based on unlinked marker sets, and by a highly significant correlation between individual Native American ancestry and skin pigmentation (R 2 = 0.082, p < 0.001). We discuss the implications of these findings in disease gene mapping efforts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.