Cellular development, morphology and function are governed by precise patterns of gene expression. These are established by the coordinated action of genomic regulatory elements known as enhancers or cis-regulatory modules. More than 30 years after the initial discovery of enhancers, many of their properties have been elucidated; however, despite major efforts, we only have an incomplete picture of enhancers in animal genomes. In this Review, we discuss how properties of enhancer sequences and chromatin are used to predict enhancers in genome-wide studies. We also cover recently developed high-throughput methods that allow the direct testing and identification of enhancers on the basis of their activity. Finally, we discuss recent technological advances and current challenges in the field of regulatory genomics.
Transcriptional enhancers are crucial regulators of gene expression and animal development and the characterization of their genomic organization, spatiotemporal activities and sequence properties is a key goal in modern biology. Here we characterize the in vivo activity of 7,705 Drosophila melanogaster enhancer candidates covering 13.5% of the non-coding non-repetitive genome throughout embryogenesis. 3,557 (46%) candidates are active, suggesting a high density with 50,000 to 100,000 developmental enhancers genome-wide. The vast majority of enhancers display specific spatial patterns that are highly dynamic during development. Most appear to regulate their neighbouring genes, suggesting that the cis-regulatory genome is organized locally into domains, which are supported by chromosomal domains, insulator binding and genome evolution. However, 12 to 21 per cent of enhancers appear to skip non-expressed neighbours and regulate a more distal gene. Finally, we computationally identify cis-regulatory motifs that are predictive and required for enhancer activity, as we validate experimentally. This work provides global insights into the organization of an animal regulatory genome and the make-up of enhancer sequences and confirms and generalizes principles from previous studies. All enhancer patterns are annotated manually with a controlled vocabulary and all results are available through a web interface (http://enhancers.starklab.org), including the raw images of all microscopy slides for manual inspection at arbitrary zoom levels.
Our results reveal how universal clock circuits can regulate tissue-specific rhythms and, more generally, provide insights into the mechanism by which universal TFs can be modulated to drive tissue-specific programs of gene expression.
One of the most important questions in biology is how transcription factors (TFs) and cofactors control enhancer function and thus gene expression. Enhancer activation usually requires combinations of several TFs, indicating that TFs function synergistically and combinatorially. However, while TF binding has been extensively studied, little is known about how combinations of TFs and cofactors control enhancer function once they are bound. It is typically unclear which TFs participate in combinatorial enhancer activation, whether different TFs form functionally distinct groups, or if certain TFs might substitute for each other in defined enhancer contexts. Here we assess the potential regulatory contributions of TFs and cofactors to combinatorial enhancer control with enhancer complementation assays. We recruited GAL4-DNA-binding-domain fusions of 812 Drosophila TFs and cofactors to 24 enhancer contexts and measured enhancer activities by 82,752 luciferase assays in S2 cells. Most factors were functional in at least one context, yet their contributions differed between contexts and varied from repression to activation (up to 289-fold) for individual factors. Based on functional similarities across contexts, we define 15 groups of TFs that differ in developmental functions and protein sequence features. Similar TFs can substitute for each other, enabling enhancer re-engineering by exchanging TF motifs, and TF-cofactor pairs cooperate during enhancer control and interact physically. Overall, we show that activators and repressors can have diverse regulatory functions that typically depend on the enhancer context. The systematic functional characterization of TFs and cofactors should further our understanding of combinatorial enhancer control and gene regulation.
HOT (highly occupied target) regions bound by many transcription factors are considered to be one of the most intriguing findings of the recent modENCODE reports, yet their functions have remained unclear. We tested 108 Drosophila melanogaster HOT regions in transgenic embryos with site-specifically integrated transcriptional reporters. In contrast to prior expectations, we found 102 (94%) to be active enhancers during embryogenesis and to display diverse spatial and temporal patterns, reminiscent of expression patterns for important developmental genes. Remarkably, HOT regions strongly activate nearby genes and are required for endogenous gene expression, as we show using bacterial artificial chromosome (BAC) transgenesis. HOT enhancers have a distinct cis-regulatory signature with enriched sequence motifs for the global activators Vielfaltig, also known as Zelda, and Trithoraxlike, also known as GAGA. This signature allows the prediction of HOT versus control regions from the DNA sequence alone. Here, we show that Drosophila HOT regions function as transcriptional enhancers with diverse activity patterns. While a large number of bound TFs is the defining feature of HOT regions, many TFs seem to be bound neutrally without any apparent contribution to enhancer activity. HOT enhancers are characterized by a distinct and predictive cis-regulatory signature, which includes motifs for Vielfaltig/Zelda (ZLD), a recently reported activator of the early Drosophila genome (Liang et al. 2008;Harrison et al. 2011;Nien et al. 2011), and Trithoraxlike/GAGA (GAGA), a TF known to form homomeric and heteromeric complexes (Bardwell and Treisman 1994) and to be required for the generation and maintenance of nucleosome-free regions (Croston et al. 1991;Nakayama et al. 2007). Results and Discussion HOT regions function as early embryonic enhancers with diverse patternsWe tested a representative set of 108 D. melanogaster HOT regions (see the Materials and Methods; Supplemental Tables 1, 2) in transgenic embryos with sitespecifically integrated transcriptional reporters (Fig. 1A). Strikingly, 94% (102) of these HOT regions drove reporter expression in a specific pattern during embryogenesis. In contrast, only 39% of control regions (16 of 41) functioned as enhancers, including 11 of 21 regions chosen to contain TF-binding sites and five of 20 regions chosen to contain no known binding sites (see the Materials and Methods; Supplemental Fig. S1; Supplemental Table 1). This enrichment of HOT over control regions is highest at stages 3-10 (corresponding to 1-5 h after fertilization) (Fig. 1B), at which most of the chromatin immunoprecipitation (ChIP) experiments leading to the definition of the HOT regions had been performed (MacArthur et al. 2009; The modENCODE Consortium 2010).Contrary to expectations that HOT enhancers might constitute a particular class of enhancers (for example, with ubiquitous activity), they display highly diverse spatial activity patterns in all major presumptive tissues of the blastoderm embryo ( Fig. 1C; Sup...
Gene expression is determined by genomic elements called enhancers, which contain short motifs bound by different transcription factors (TFs). However, how enhancer sequences and TF motifs relate to enhancer activity is unknown, and general sequence requirements for enhancers or comprehensive sets of important enhancer sequence elements have remained elusive. Here, we computationally dissect thousands of functional enhancer sequences from three different Drosophila cell lines. We find that the enhancers display distinct cis-regulatory sequence signatures, which are predictive of the enhancers' cell type-specific or broad activities. These signatures contain transcription factor motifs and a novel class of enhancer sequence elements, dinucleotide repeat motifs (DRMs). DRMs are highly enriched in enhancers, particularly in enhancers that are broadly active across different cell types. We experimentally validate the importance of the identified TF motifs and DRMs for enhancer function and show that they can be sufficient to create an active enhancer de novo from a nonfunctional sequence. The function of DRMs as a novel class of general enhancer features that are also enriched in human regulatory regions might explain their implication in several diseases and provides important insights into gene regulation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.