We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1Mb. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free conformation that enables maximally dense packing while preserving the ability to easily fold and
SummaryAs the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNaseI, leaving nucleotide-resolution footprints. Using genomic DNaseI footprinting across 41 diverse cell and tissue types, we detected 45 million factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis-regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNaseI cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein-DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50 base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation, and pluripotency.
Recent evidence suggests that long-range enhancers and gene promoters are in close proximity, which might reflect the formation of chromatin loops. Here, we examined the mechanism for DNA looping at the beta-globin locus. By using chromosome conformation capture (3C), we show that the hematopoietic transcription factor GATA-1 and its cofactor FOG-1 are required for the physical interaction between the beta-globin locus control region (LCR) and the beta-major globin promoter. Kinetic studies reveal that GATA-1-induced loop formation correlates with the onset of beta-globin transcription and occurs independently of new protein synthesis. GATA-1 occupies the beta-major globin promoter normally in fetal liver erythroblasts from mice lacking the LCR, suggesting that GATA-1 binding to the promoter and LCR are independent events that occur prior to loop formation. Together, these data demonstrate that GATA-1 and FOG-1 are essential anchors for a tissue-specific chromatin loop, providing general insights into long-range enhancer function.
The contribution of changes in cis-regulatory elements or trans-acting factors to interspecies differences in gene expression is not well understood. The mammalian β-globin loci have served as a paradigm for gene regulation during development. Transgenic mice harboring the human β-globin locus, consisting of the linked embryonic (ε), fetal (γ) and adult (β) genes, have been used as a model system to study the temporal switch from fetal to adult hemoglobin, as occurs in humans. We show that the human γ-globin genes in these mice behave as murine embryonic globin genes, revealing a limitation of the model and demonstrating that critical differences in the trans-acting milieu have arisen during mammalian evolution. We show that the expression of BCL11A, a repressor of human γ-globin expression identified through genome-wide association studies, differs between mouse and human. Developmental silencing of the mouse embryonic globin and human γ-globin genes fails to occur in mice in the absence of BCL11A. Thus, BCL11A is a critical mediator of species-divergent globin switching. By comparing the ontogeny of β-globin gene regulation in mice and humans, we have shown that alterations in expression of a trans-acting factor constitute a critical driver of gene expression changes during evolution.
To study the evolutionary dynamics of regulatory DNA, we mapped >1.3 million deoxyribonuclease I-hypersensitive sites (DHSs) in 45 mouse cell and tissue types, and systematically compared these with human DHS maps from orthologous compartments. We found that the mouse and human genomes have undergone extensive cis-regulatory rewiring that combines branch-specific evolutionary innovation and loss with widespread repurposing of conserved DHSs to alternative cell fates, and that this process is mediated by turnover of transcription factor (TF) recognition elements. Despite pervasive evolutionary remodeling of the location and content of individual cis-regulatory regions, within orthologous mouse and human cell types the global fraction of regulatory DNA bases encoding recognition sites for each TF has been strictly conserved. Our findings provide new insights into the evolutionary forces shaping mammalian regulatory DNA landscapes.
We have examined the relationship between nuclear localization and transcriptional activity of the endogenous murine -globin locus during erythroid differentiation. Murine fetal liver cells were separated into distinct erythroid maturation stages by fluorescence-activated cell sorting, and the nuclear position of the locus was determined at each stage. We find that the -globin locus progressively moves away from the nuclear periphery with increasing maturation. Contrary to the prevailing notion that the nuclear periphery is a repressive compartment in mammalian cells,  major -globin expression begins at the nuclear periphery prior to relocalization. However, relocation of the locus to the nuclear interior with maturation is accompanied by an increase in  major -globin transcription. The distribution of nuclear polymerase II (Pol II) foci also changes with erythroid differentiation: Transcription factories decrease in number and contract toward the nuclear interior. Moreover, both efficient relocalization of the -globin locus from the periphery and its association with hyperphosphorylated Pol II transcription factories require the locus control region (LCR). These results suggest that the LCR-dependent association of the -globin locus with transcriptionally engaged Pol II foci provides the driving force for relocalization of the locus toward the nuclear interior during erythroid maturation.[Keywords: -globin locus; nuclear organization; nuclear periphery; erythroid differentiation; RNA polymerase II; fluorescence in situ hybridization] Supplemental material is available at http://www.genesdev.org.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.