Increasing evidence suggests that interactions between regulatory genomic elements play an important role in regulating gene expression. We generated a genome-wide interaction map of regulatory elements in human cells (ENCODE tier 1 cells, K562, GM12878) using Chromatin Interaction Analysis by Paired-End Tag sequencing (ChIA-PET) experiments targeting six broadly distributed factors. Bound regions covered 80% of DNase I hypersensitive sites including 99.7% of TSS and 98% of enhancers. Correlating this map with ChIP-seq and RNA-seq data sets revealed cohesin, CTCF, and ZNF143 as key components of three-dimensional chromatin structure and revealed how the distal chromatin state affects gene transcription. Comparison of interactions between cell types revealed that enhancer-promoter interactions were highly cell-type-specific. Construction and comparison of distal and proximal regulatory networks revealed stark differences in structure and biological function. Proximal binding events are enriched at genes with housekeeping functions, while distal binding events interact with genes involved in dynamic biological processes including response to stimulus. This study reveals new mechanistic and functional insights into regulatory region organization in the nucleus.
Alternative splicing shapes mammalian transcriptomes, with many RNA molecules undergoing multiple distant alternative splicing events. Comprehensive transcriptome analysis, including analysis of exon co-association in the same molecule, requires deep, long-read sequencing. Here we introduce an RNA sequencing method, synthetic long-read RNA sequencing (SLR-RNA-seq), in which small pools (≤1,000 molecules/pool, ≤1 molecule/gene for most genes) of full-length cDNAs are amplified, fragmented and short-read-sequenced. We demonstrate that these RNA sequences reconstructed from the short reads from each of the pools are mostly close to full length and contain few insertion and deletion errors. We report many previously undescribed isoforms (human brain: ∼13,800 affected genes, 14.5% of molecules; mouse brain ∼8,600 genes, 18% of molecules) and up to 165 human distant molecularly associated exon pairs (dMAPs) and distant molecularly and mutually exclusive pairs (dMEPs). Of 16 associated pairs detected in the mouse brain, 9 are conserved in human. Our results indicate conserved mechanisms that can produce distant but phased features on transcript and proteome isoforms.
Author contributions L.X. and A.M contributed equally to this study. S.M., E.M. and A.B. planned the study, with input from M.P.S. and J.W. S.M. and E.M. performed the reprogramming experiments, and analysed and interpreted data. S.M., E.M. and L.X. wrote the manuscript with the help of A.B. S.M. generated, processed and analysed bulk and single-cell RNA-seq datasets, analysed the metabolomics data, and performed most THY1-related and conditioned medium experiments. E.M. generated and propagated transgene-free iPS cell lines. All other studies were done by both E.M and S.M., unless otherwise noted. A.M. and S.M. performed wound healing experiments under the supervision of M.T.L. L.X. helped with reprogramming, FACS, and immunofluorescence experiments. F.J. generated the in vitro single-cell RNA-seq data under the supervision of M.P.S. R.S. generated the ChIP-seq libraries under the supervision of J.W. K.H. helped with statistics and PAGODA analysis. X.L. performed metabolomics experiments and helped with metabolomics data analysis and validation under the supervision of M.P.S. K.D. helped with reprogramming and western blotting experiments. L.P. helped with reprogramming and RT-qPCR experiments. C.E.A. and Y.S. performed the induced neuron reprogramming experiment under the supervision of M.W. B.A.B. helped with analysis of the epigenomic data. A.L.S.C. identified and collected the human samples. All authors discussed the results and commented on the manuscript. Data availabilityAll raw sequencing reads for population RNA-seq, ChIP-seq and single-cell RNA-seq data can be found under BioProject PRJNA316110. The command and configuration files, in addition to a list of all versioned dependencies present in the running environment, are available on the Github repository for this paper (https://github.com/brunetlab/Mahmoudi_et_al_2018) (except for the code for the processing of metabolomics data, which is available upon request).
Left ventricular non-compaction (LVNC) is the third most prevalent cardiomyopathy in children and its pathogenesis has been associated with the developmental defect of the embryonic myocardium. We show that patient-specific induced pluripotent stem cell-derived cardiomyocytes (iPSC-CMs) generated from LVNC patients carrying a mutation in the cardiac transcription factor TBX20 recapitulate a key aspect of the pathological phenotype at the single-cell level and was associated with perturbed transforming growth factor beta (TGFβ) signaling. LVNC iPSC-CMs have decreased proliferative capacity due to abnormal activation of TGFβ signaling. TBX20 regulates the expression of TGFβ signaling modifiers including a known genetic cause of LVNC, PRDM16, and genome editing of PRDM16 caused proliferation defects in iPSC-CMs. Inhibition of TGFβ signaling and genome correction of the TBX20 mutation were sufficient to reverse the disease phenotype. Our study demonstrates that iPSC-CMs are a useful tool for the exploration of pathological mechanisms underlying poorly understood cardiomyopathies including LVNC.
The ubiquitous presence of long noncoding RNAs (lncRNAs) in eukaryotes points to the importance of understanding how their sequences impact function. As many lncRNAs regulate nuclear events and thus must localize to nuclei, we analyzed the sequence requirements for nuclear localization in an intergenic lncRNA named BORG (BMP2-OP1-responsive gene), which is both spliced and polyadenylated but is strictly localized in nuclei. Subcellular localization of BORG was not dependent on the context or level of its expression or decay but rather depended on the sequence of the mature, spliced transcript. Mutational analyses indicated that nuclear localization of BORG was mediated through a novel RNA motif consisting of the pentamer sequence AGCCC with sequence restrictions at positions ؊8 (T or A) and ؊3 (G or C) relative to the first nucleotide of the pentamer. Mutation of the motif to a scrambled sequence resulted in complete loss of nuclear localization, while addition of even a single copy of the motif to a cytoplasmically localized RNA was sufficient to impart nuclear localization. Further, the presence of this motif in other cellular RNAs showed a direct correlation with nuclear localization, suggesting that the motif may act as a general nuclear localization signal for cellular RNAs.
Identifying bacterial strains in metagenome and microbiome samples using computational analyses of short-read sequence remains a difficult problem. Here, we present an analysis of a human gut microbiome using on Tru-seq synthetic long reads combined with new computational tools for metagenomic long-read assembly, variant-calling and haplotyping (Nanoscope and Lens). Our analysis identifies 178 bacterial species of which 51 were not found using short sequence reads alone. We recover bacterial contigs that comprise multiple operons, including 22 contigs of >1Mbp. Extensive intraspecies variation among microbial strains in the form of haplotypes that span up to hundreds of Kbp can be observed using our approach. Our method incorporates synthetic long-read sequencing technology with standard shotgun approaches to move towards rapid, precise and comprehensive analyses of metagenome and microbiome samples.
Long noncoding RNAs (lncRNAs) have emerged as potent regulators of breast cancer development and progression, including the metastatic spread of disease. Through in silico and biological analyses, we identified a novel lncRNA, BMP/OP-Responsive Gene (BORG), whose expression directly correlates with aggressive breast cancer phenotypes, as well as with metastatic competence and disease recurrence in multiple clinical cohorts. Mechanistically, BORG elicits the metastatic outgrowth of latent breast cancer cells by promoting the localization and transcriptional repressive activity of TRIM28, which binds BORG and induces substantial alterations in carcinoma proliferation and survival. Moreover, inhibiting BORG expression in metastatic breast cancer cells impedes their metastatic colonization of the lungs of mice, implying that BORG acts as a novel driver of the genetic and epigenetic alterations that underlie the acquisition of metastatic and recurrent phenotypes by breast cancer cells.
Summary Insulin-like growth factor 2 mRNA binding protein 3 (IGF2BP3) expression correlates with malignancy. But its role(s) in pathogenesis remain enigmatic. Here, we interrogated the IGF2BP3-RNA interaction network in pancreatic ductal adenocarcinoma (PDAC) cells. Using a combination of genome-wide approaches we identify 164 direct mRNA targets of IGF2BP3. These transcripts encode proteins enriched for functions such as cell migration, proliferation and adhesion. Loss of IGF2BP3 reduced PDAC cell invasiveness and remodeled focal adhesion junctions. Individual-nucleotide resolution crosslinking immunoprecipitation (iCLIP) revealed significant overlap of IGF2BP3 and miRNA binding sites. IGF2BP3 promotes association of the RNA induced silencing complex (RISC) with specific transcripts. Our results show that IGF2BP3 influences a malignancy-associated RNA regulon by modulating miRNA-mRNA interactions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.