The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome.
The identification of nuclease-hypersensitive sites in an active globin gene and in the 5' regions of fruit fly heat shock genes first suggested that chromatin changes accompany gene regulation in vivo. Here we present evidence that the basic repeating units of eukaryotic chromatin, nucleosomes, are depleted from active regulatory elements throughout the Saccharomyces cerevisiae genome in vivo. We found that during rapid mitotic growth, the level of nucleosome occupancy is inversely proportional to the transcriptional initiation rate at the promoter. We also observed a partial loss of histone H3 and H4 tetramers from the coding regions of the most heavily transcribed genes. Alterations in the global transcriptional program caused by heat shock or a change in carbon source resulted in an increased nucleosome occupancy at repressed promoters, and a decreased nucleosome occupancy at promoters that became active. Nuclease-hypersensitive sites occur in species from yeast to humans and result from chromatin perturbation. Given the conservation of sequence and function among components of both chromatin and the transcriptional machinery, nucleosome depletion at promoters may be a fundamental feature of eukaryotic transcriptional regulation.
The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq (Formaldehyde Assisted Isolation of Regulatory Elements) to map ''open chromatin.'' Over 870,000 DNaseI or FAIRE sites, which correspond tightly to nucleosome-depleted regions, were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and to be coincident with CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites and contained DNA motifs recognized by regulators of cell-type identity. We show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying noncoding DNA sequence variants that are causally linked to human disease.
Histone methylation and the enzymes that mediate it are important regulators of chromatin structure and gene transcription. In particular, the histone H3 lysine 36 (K36) methyltransferase Set2 has recently been shown to associate with the phosphorylated C-terminal domain (CTD) of RNA polymerase II (RNAPII), implying that this enzyme has an important role in the transcription elongation process. Here we show that a novel domain in the C terminus of Set2 is responsible for interaction between Set2 and RNAPII. This domain, termed the Set2 Rpb1 interacting (SRI) domain, is encompassed by amino acid residues 619 to 718 in Set2 and is found to occur in a number of putative Set2 homologs from Schizosaccharomyces pombe to humans. Unexpectedly, BIACORE analysis reveals that the SRI domain binds specifically, and with high affinity, to CTD repeats that are doubly modified (serine 2 and serine 5 phosphorylated), indicating that Set2 association across the body of genes requires a specific pattern of phosphorylated RNAPII. Deletion of the SRI domain not only abolishes Set2-RNAPII interaction but also abolishes K36 methylation in vivo, indicating that this interaction is required for establishing K36 methylation on chromatin. Using 6-azauracil (6AU) as an indicator of transcription elongation defects, we found that deletion of the SRI domain conferred a strong resistance to this compound, which was identical to that observed with set2 deletion mutants. Furthermore, yeast strains carrying set2 alleles that are catalytically inactive or yeast strains bearing point mutations at K36 were also found to be resistant to 6AU. These data suggest that it is the methylation by Set2 that affects transcription elongation. In agreement with this, we have determined that deletion of SET2, its SRI domain, or amino acid substitutions at K36 result in an alteration of RNAPII occupancy levels over transcribing genes. Taken together, these data indicate K36 methylation, established by the SRI domain-mediated association of Set2 with RNAPII, plays an important role in the transcription elongation process.Successful synthesis of mRNA by RNA polymerase II (RNAPII) requires tight regulation of the initiation, elongation, and termination processes of transcription. The process of transcription elongation is affected in part by the binding of regulatory factors to the phosphorylated C-terminal domain (CTD) of the RNAPII subunit Rpb1. Recent studies have highlighted an important role for histone methylation in the process of transcription elongation. In particular, studies have shown that the Set1 and Set2 methyltransferases, which catalyze methylation of histone H3 lysines 4 (K4) and 36 (K36), respectively, are associated with RNAPII at different stages of the transcription elongation cycle. While Set1 associates with RNAPII via the Paf1 transcription elongation complex in a manner that is dependent on the serine 5 (Ser5)-phosphorylated CTD, Set2 is recruited to RNAPII in a manner that is dependent on the CTD and the Ctk1 kinase (CTDK-I) that effects...
Histone methylation is now realized to be a pivotal regulator of gene transcription. Although recent studies have shed light on a trans-histone regulatory pathway that controls H3 Lys 4 and H3 Lys 79 methylation in Saccharomyces cerevisiae, the regulatory pathway that affects Set2-mediated H3 Lys 36 methylation is unknown. To determine the functions of Set2, and identify factors that regulate its site of methylation, we genomically tagged Set2 and identified its associated proteins. Here, we show that Set2 is associated with Rbp1 and Rbp2, the two largest subunits of RNA polymerase II (RNA pol II). Moreover, we find that this association is specific for the interaction of Set2 with the hyperphosphorylated form of RNA pol II. We further show that deletion of the RNA pol II C-terminal domain (CTD) kinase Ctk1, or partial deletion of the CTD, results in a selective abolishment of H3 Lys 36 methylation, implying a pathway of Set2 recruitment to chromatin and a role for H3 Lys 36 methylation in transcription elongation. In support, chromatin immunoprecipitation assays demonstrate the presence of Set2 methylation in the coding regions, as well as promoters, of genes regulated by Ctk1 or Set2. These data document a new link between histone methylation and the transcription apparatus and uncover a regulatory pathway that is selective for H3 Lys 36 methylation.
Phenotypic plasticity of T helper 17 (Th17) cells suggests instability of chromatin structure of key genes of this lineage. Here we identify epigenetic modifications across the clustered Il17a and Il17f, and Ifng loci before and after differential IL-12 or TGFβ signaling, which induce divergent fates of Th17 cell precursors. We find that Th17 precursors have substantial remodeling of the Ifng locus but undergo critical additional modifications to enable high-level expression when stimulated by IL-12. Permissive modifications across the Il17a-Il17f locus are amplified by TGFβ signaling in Th17 cells, but are rapidly reversed downstream of IL-12–induced, STAT4– and T-bet–mediated silencing of the Rorc gene. These findings reveal substantial chromatin instability of key transcription factor and cytokine genes of Th17 cells and support a model of Th17 lineage plasticity in which cell-extrinsic factors modulate Th17 cell fates through differential effects on the epigenetic status of Th17 lineage factors.
Interplays among lineage-specific nuclear proteins, chromatin modifying enzymes, and the basal transcription machinery govern cellular differentiation, but their dynamics of action and coordination with transcriptional control are not fully understood. Alterations in chromatin structure appear to establish a permissive state for gene activation at some loci, but they play an integral role in activation at other loci. To determine the predominant roles of chromatin states and factor occupancy in directing gene regulation during differentiation, we mapped chromatin accessibility, histone modifications, and nuclear factor occupancy genome-wide during mouse erythroid differentiation dependent on the master regulatory transcription factor GATA1. Notably, despite extensive changes in gene expression, the chromatin state profiles (proportions of a gene in a chromatin state dominated by activating or repressive histone modifications) and accessibility remain largely unchanged during GATA1-induced erythroid differentiation. In contrast, gene induction and repression are strongly associated with changes in patterns of transcription factor occupancy. Our results indicate that during erythroid differentiation, the broad features of chromatin states are established at the stage of lineage commitment, largely independently of GATA1. These determine permissiveness for expression, with subsequent induction or repression mediated by distinctive combinations of transcription factors.
Histone H3.3 is a highly conserved histone H3 replacement variant in metazoans and has been implicated in many important biological processes, including cell differentiation and reprogramming. Germline and somatic mutations in H3.3 genomic incorporation pathway components or in H3.3 encoding genes have been associated with human congenital diseases and cancers, respectively. However, the role of H3.3 in mammalian development remains unclear. To address this question, we generated H3.3-null mouse models through classical genetic approaches. We found that H3.3 plays an essential role in mouse development. Complete depletion of H3.3 leads to developmental retardation and early embryonic lethality. At the cellular level, H3.3 loss triggers cell cycle suppression and cell death. Surprisingly, H3.3 depletion does not dramatically disrupt gene regulation in the developing embryo. Instead, H3.3 depletion causes dysfunction of heterochromatin structures at telomeres, centromeres, and pericentromeric regions of chromosomes, leading to mitotic defects. The resulting karyotypical abnormalities and DNA damage lead to p53 pathway activation. In summary, our results reveal that an important function of H3.3 is to support chromosomal heterochromatic structures, thus maintaining genome integrity during mammalian development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.