The genome is organised via CTCF/Cohesin binding sites, which partition chromosomes into 1-5Mb topologically associated domains (TADs), and further into smaller sub-domains (sub-TADs). Here we examined in vivo an ~80kb sub-TAD, containing the mouse α-globin gene cluster, lying within a ~1Mb TAD. We find that the sub-TAD is flanked by predominantly convergent CTCF/cohesin sites which are ubiquitously bound by CTCF but only interact during erythropoiesis, defining a self-interacting erythroid compartment. Whereas the α-globin regulatory elements normally act solely on promoters downstream of the enhancers, removal of a conserved upstream CTCF/cohesin boundary extends the sub-TAD to adjacent upstream CTCF/cohesin binding sites. The α-globin enhancers now interact with the flanking chromatin, upregulating expression of genes within this extended sub-TAD. Rather than acting solely as a barrier to chromatin modification, CTCF/cohesin boundaries in this sub-TAD delimit the region of chromatin to which enhancers have access and within which they interact with receptive promoters.
Predicting the impact of non-coding genetic variation requires interpreting it in the context of 3D genome architecture. We have developed deepC, a transfer learning based deep neural network that accurately predicts genome folding from megabase-scale DNA sequence. DeepC predicts domain boundaries at high-resolution, learns the sequence determinants of genome folding and predicts the impact of both large-scale structural and single base pair variations.
Mammalian gene expression patterns are controlled by regulatory elements, which interact within topologically associating domains (TADs). The relationship between activation of regulatory elements, formation of structural chromatin interactions and gene expression during development is unclear. Here, we present Tiled-C, a low-input chromosome conformation capture (3C) technique. We use this approach to study chromatin architecture at high spatial and temporal resolution through in vivo mouse erythroid differentiation. Integrated analysis of chromatin accessibility and single-cell expression data shows that regulatory elements gradually become accessible within pre-existing TADs during early differentiation. This is followed by structural reorganization within the TAD and formation of specific contacts between enhancers and promoters. Our high-resolution data show that these enhancer-promoter interactions are not established prior to gene expression, but formed gradually during differentiation, concomitant with progressive upregulation of gene activity. Together, these results provide new insight into the close, interdependent relationship between chromatin architecture and gene regulation during development.
β-Thalassemia is one of the most common inherited anemias, with no effective cure for most patients. The pathophysiology reflects an imbalance between α- and β-globin chains with an excess of free α-globin chains causing ineffective erythropoiesis and hemolysis. When α-thalassemia is co-inherited with β-thalassemia, excess free α-globin chains are reduced significantly ameliorating the clinical severity. Here we demonstrate the use of CRISPR/Cas9 genome editing of primary human hematopoietic stem/progenitor (CD34+) cells to emulate a natural mutation, which deletes the MCS-R2 α-globin enhancer and causes α-thalassemia. When edited CD34+ cells are differentiated into erythroid cells, we observe the expected reduction in α-globin expression and a correction of the pathologic globin chain imbalance in cells from patients with β-thalassemia. Xenograft assays show that a proportion of the edited CD34+ cells are long-term repopulating hematopoietic stem cells, demonstrating the potential of this approach for translation into a therapy for β-thalassemia.
Mouse embryonic stem cells (mESCs) can be manipulated in vitro to recapitulate the process of erythropoiesis, during which multipotent cells undergo lineage specification, differentiation and maturation to produce erythroid cells. Although useful for identifying specific progenitors and precursors, this system has not been fully exploited as a source of cells to analyse erythropoiesis. Here, we establish a protocol in which characterised erythroblasts can be isolated in a scalable manner from differentiated embryoid bodies (EBs). Using transcriptional and epigenetic analysis, we demonstrate that this system faithfully recapitulates normal primitive erythropoiesis and fully reproduces the effects of natural and engineered mutations seen in primary cells obtained from mouse models. We anticipate this system to be of great value in reducing the time and costs of generating and maintaining mouse lines in a number of research scenarios.
Chromosome conformation capture (3C) provides an adaptable tool for studying diverse biological questions. Current 3C methods generally provide either low-resolution interaction profiles across the entire genome, or high-resolution interaction profiles at limited numbers of loci. Due to technical limitations, generation of reproducible high-resolution interaction profiles has not been achieved at genome-wide scale. Here, to overcome this barrier, we systematically test each step of 3C and report two improvements over current methods. We show that up to 30% of reporter events generated using the popular in situ 3C method arise from ligations between two individual nuclei, but this noise can be almost entirely eliminated by isolating intact nuclei after ligation. Using Nuclear-Titrated Capture-C, we generate reproducible high-resolution genome-wide 3C interaction profiles by targeting 8055 gene promoters in erythroid cells. By pairing high-resolution 3C interaction calls with nascent gene expression we interrogate the role of promoter hubs and super-enhancers in gene regulation.
The T-box transcription factor (TF) Eomes is a key regulator of cell fate decisions during early mouse development. The cis-acting regulatory elements that direct expression in the anterior visceral endoderm (AVE), primitive streak (PS) and definitive endoderm (DE) have yet to be defined. Here, we identified three gene-proximal enhancer-like sequences (PSE_a, PSE_b and VPE) that faithfully activate tissue-specific expression in transgenic embryos. However, targeted deletion experiments demonstrate that PSE_a and PSE_b are dispensable, and only VPE is required for optimal Eomes expression in vivo. Embryos lacking this enhancer display variably penetrant defects in anterior-posterior axis orientation and DE formation. Chromosome conformation capture experiments reveal VPE-promoter interactions in embryonic stem cells (ESCs), prior to gene activation. The locus resides in a large (500 kb) pre-formed compartment in ESCs and activation during DE differentiation occurs in the absence of 3D structural changes. ATAC-seq analysis reveals that VPE, PSE_a and four additional putative enhancers display increased chromatin accessibility in DE that is associated with Smad2/3 binding coincident with transcriptional activation. By contrast, activation of the Eomes target genes Foxa2 and Lhx1 is associated with higher order chromatin reorganisation. Thus, diverse regulatory mechanisms govern activation of lineage specifying TFs during early development.
Several distinct differentiation protocols for deriving pancreatic progenitors (PPs) from human pluripotent stem cells have been described, but it remains to be shown how similar the PPs are across protocols and how well they resemble their in vivo counterparts. Here, we evaluated three differentiation protocols, performed RNA and assay for transposase-accessible chromatin using sequencing on isolated PPs derived with these, and compared them with fetal human pancreas populations. This enabled us to define a shared transcriptional and epigenomic signature of the PPs, including several genes not previously implicated in pancreas development. Furthermore, we identified a significant and previously unappreciated cross-protocol variation of the PPs through multi-omics analysis and demonstrate how such information can be applied to refine differentiation protocols for derivation of insulin-producing beta-like cells. Together, our study highlights the importance of a detailed characterization of defined cell populations derived from distinct differentiation protocols and provides a valuable resource for exploring human pancreatic development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.