Lineage-ambiguous leukemias are high-risk malignancies of poorly understood genetic basis. Here, we describe a distinct subgroup of acute leukemia with expression of myeloid, T lymphoid, and stem cell markers driven by aberrant allele-specific deregulation of BCL11B, a master transcription factor responsible for thymic T-lineage commitment and specification. Mechanistically, this deregulation was driven by chromosomal rearrangements that juxtapose BCL11B to superenhancers active in hematopoietic progenitors, or focal amplifications that generate a superenhancer from a noncoding element distal to BCL11B. Chromatin conformation analyses demonstrated long-range interactions of rearranged enhancers with the expressed BCL11B allele and association of BCL11B with activated hematopoietic progenitor cell cis-regulatory elements, suggesting BCL11B is aberrantly co-opted into a gene regulatory network that drives transformation by maintaining a progenitor state. These data support a role for ectopic BCL11B expression in primitive hematopoietic cells mediated by enhancer hijacking as an oncogenic driver of human lineage-ambiguous leukemia. Significance: Lineage-ambiguous leukemias pose significant diagnostic and therapeutic challenges due to a poorly understood molecular and cellular basis. We identify oncogenic deregulation of BCL11B driven by diverse structural alterations, including de novo superenhancer generation, as the driving feature of a subset of lineage-ambiguous leukemias that transcend current diagnostic boundaries. This article is highlighted in the In This Issue feature, p. 2659
SUMMARY Early B cell development is characterized by large scale Igh locus contraction prior to V(D)J recombination to facilitate a highly diverse Ig repertoire. However, an understanding of the molecular architecture that mediates locus contraction remains unclear. We have combined high resolution chromosome conformation capture (3C) techniques with 3D DNA FISH to identify three conserved topological sub-domains. Each of these topological folds encompasses a major VH gene family that become juxtaposed in pro-B cells via Mb-scale chromatin looping. The transcription factor Pax5 organizes the sub-domain that spans the VHJ558 gene family. In its absence the J558 VH genes fail to associate with the proximal VH genes, thereby providing a plausible explanation for reduced VHJ558 gene rearrangements in Pax5-deficient pro-B cells. We propose that Igh locus contraction is the cumulative effect of several independently controlled chromatin sub-domains that provide the structural infrastructure to coordinate optimal antigen receptor assembly.
Over 500 genetic loci have been associated with risk of cardiovascular diseases (CVDs); however, most loci are located in gene-distal non-coding regions and their target genes are not known. Here, we generated high-resolution promoter capture Hi-C (PCHi-C) maps in human induced pluripotent stem cells (iPSCs) and iPSC-derived cardiomyocytes (CMs) to provide a resource for identifying and prioritizing the functional targets of CVD associations. We validate these maps by demonstrating that promoters preferentially contact distal sequences enriched for tissue-specific transcription factor motifs and are enriched for chromatin marks that correlate with dynamic changes in gene expression. Using the CM PCHi-C map, we linked 1999 CVD-associated SNPs to 347 target genes. Remarkably, more than 90% of SNP-target gene interactions did not involve the nearest gene, while 40% of SNPs interacted with at least two genes, demonstrating the importance of considering long-range chromatin interactions when interpreting functional targets of disease loci.
24Over 500 genetic loci have been associated with risk of cardiovascular diseases (CVDs), 25 however most loci are located in gene-distal non-coding regions and their target genes are not known. 26Here, we generated high-resolution promoter capture Hi-C (PCHi-C) maps in human induced 27 pluripotent stem cells (iPSCs) and iPSC-derived cardiomyocytes (CMs) to provide a resource for 28 identifying and prioritizing the functional targets of CVD associations. We validate these maps by 29 demonstrating that promoters preferentially contact distal sequences enriched for tissue-specific 30 transcription factor motifs and are enriched for chromatin marks that correlate with dynamic changes in 31 was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
Disease-associated genetic variants that lie in non-coding regions found by genome-wide association studies are thought to alter the functionality of transcription regulatory elements and target gene expression. To uncover causal genetic variants, variant regulatory elements and their target genes, here we cross-reference human transcriptomic, epigenomic and chromatin conformation datasets. Of 104 genetic variant regions associated with atrial fibrillation candidate target genes are prioritized. We optimize EMERGE enhancer prediction and use accessible chromatin profiles of human atrial cardiomyocytes to more accurately predict cardiac regulatory elements and identify hundreds of sub-threshold variants that co-localize with regulatory elements. Removal of mouse homologues of atrial fibrillation-associated regions in vivo uncovers a distal regulatory region involved in Gja1 (Cx43) expression. Our analyses provide a shortlist of genes likely affected by atrial fibrillation-associated variants and provide variant regulatory elements in each region that link genetic variation and target gene regulation, helping to focus future investigations.
Conformation of antigen receptor gene loci spatially juxtaposes rearranging gene segments in the appropriate cell lineage and developmental stage. We describe a three-step pathway that establishes the structure of the 2.8-Mb immunoglobulin heavy chain gene (IgH) locus in pro-B cells. Each step uses a different transcription factor and leads to increasing levels of structural organization. CTCF mediates one level of compaction that folds the locus into several 250- to 400-kb subdomains, and Pax5 further compacts the 2-Mb region that encodes variable (VH) gene segments. The 5′ and 3′ domains are brought together by the transcription factor YY1 to establish the configuration within which gene recombination initiates. Such stepwise mechanisms may apply more generally to establish regulatory fine structure within megabase-sized topologically associated domains.
Supplemental Digital Content is available in the text.
ATAC-seq is a high-throughput sequencing technique that identifies open chromatin. Depending on the cell type, ATAC-seq samples may contain ~20–80% of mitochondrial sequencing reads. As the regions of open chromatin of interest are usually located in the nuclear genome, mitochondrial reads are typically discarded from the analysis. We tested two approaches to decrease wasted sequencing in ATAC-seq libraries generated from lymphoblastoid cell lines: targeted cleavage of mitochondrial DNA fragments using CRISPR technology and removal of detergent from the cell lysis buffer. We analyzed the effects of these treatments on the number of usable (unique, non-mitochondrial) reads and the number and quality of peaks called, including peaks identified in enhancers and transcription start sites. Both treatments resulted in considerable reduction of mitochondrial reads (1.7 and 3-fold, respectively). The removal of detergent, however, resulted in increased background and fewer peaks. The highest number of peaks and highest quality data was obtained by preparing samples with the original ATAC-seq protocol (using detergent) and treating them with CRISPR. This strategy reduced the amount of sequencing required to call a high number of peaks, which could lead to cost reduction when performing ATAC-seq on large numbers of samples and in cell types that contain a large amount of mitochondria.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.