SUMMARY Enhancers control the correct temporal and cell type-specific activation of gene expression in higher eukaryotes. Knowing their properties, regulatory activity and targets is crucial to understand the regulation of differentiation and homeostasis. We use the FANTOM5 panel of samples covering the majority of human tissues and cell types to produce an atlas of active, in vivo transcribed enhancers. We show that enhancers share properties with CpG-poor mRNA promoters but produce bidirectional, exosome-sensitive, relatively short unspliced RNAs, the generation of which is strongly related to enhancer activity. The atlas is used to compare regulatory programs between different cells at unprecedented depth, identify disease-associated regulatory single nucleotide polymorphisms, and classify cell type-specific and ubiquitous enhancers. We further explore the utility of enhancer redundancy, which explains gene expression strength rather than expression patterns. The online FANTOM5 enhancer atlas represents a unique resource for studies on cell type-specific enhancers and gene regulation.
Ntini et al. 3The non-protein-coding part of the human genome is pervasively transcribed into a large diversity of non-coding (nc) RNA 1 . A substantial fraction of this material derives from, or near, active gene promoters, that are producing a range of small-( 1-6 ) and long non-coding RNA (lncRNA) 7 . Indeed, it has been estimated that >60% of lncRNAs in human embryonic stem cells derive from promoters of active proteincoding genes 8 . Although some lncRNAs have reported functions, these species are generally kept at low abundance by cellular degradation activities 9,10 . For example, we previously coupled depletion of the major nuclear 3'-5' exonucleolytic activity, the RNA exosome, with tiling microarrays to reveal PROMPTs closely upstream of active human gene promoters 9 . PROMPTs are 5'capped, >100nt long and 3'end adenylated in the absence of exosome activity 11 . The mechanism underlying the efficient exosome-mediated suppression of these lncRNAs, while preserving the promoter-downstream mRNA, remains enigmatic.Here, we couple exosome-depletion to high-throughput 5'end-, 3'end-and regular RNA-sequencing (RNAseq) to create a genome-wide map of PROMPTs. Our results demonstrate that PROMPT transcription initiates antisense with respect to the downstream gene. We suggest that such initiating RNAPII, if stalled at a PROMPT-TSS proximal position, can elicit the production of previously reported TSSa-RNA.Sequence motifs around PROMPT 3'ends adhere to a pA site consensus and are significantly more abundant upstream than downstream of gene promoters. This provides a directional RNA output from human promoters by rapidly terminating antisense transcription and enforcing degradation of its RNA product. RESULTS PROMPTs initiate from bi-directional promoter activityTo obtain strand-specific and positional information of PROMPTs, we first subjected total RNA from HeLa cells, that had been treated with either a control (ctrl) eGFP siRNA or RRP40 siRNA ( Supplementary Fig. 1a), to regular RNA sequencing (RNAseq) as well as cap-selected RNA 5'end sequencing (Cap Analysis of Gene Expression (CAGE)). We focused our analysis on protein-coding genes and therefore considered reads mapping to the -3kb to +1kb regions of 2428 UCSC gene promoters, which were selected not to overlap any other annotated mRNAs. When aligned to the TSSs of these promoters, both RNAseq-and CAGE-data disclosed a strong presence of exosome-sensitive transcripts originating closely upstream of the gene TSS Ntini et al. 4(average peak CAGE position at -110bp) and commencing in the antisense direction relative to the neighboring mRNA TSS (Fig. 1a bottom panel, compare 'ctrl' and 'RRP40' plots). Only minor signal was detected in the sense direction of the same region. Whereas the abundance of antisense PROMPT (asPROMPT) CAGE tags increased by an average of 8-fold upon RRP40 depletion, the corresponding sense CAGE signals of the same region were largely unaffected (Fig. 1b, P<2e-16, twosided t-test). This predominant occurrence of asPROMPTs was also visi...
Mammalian genomes are pervasively transcribed, yielding a complex transcriptome with high variability in composition and cellular abundance. Although recent efforts have identified thousands of new long non-coding (lnc) RNAs and demonstrated a complex transcriptional repertoire produced by protein-coding (pc) genes, limited progress has been made in distinguishing functional RNA from spurious transcription events. This is partly due to present RNA classification, which is typically based on technical rather than biochemical criteria. Here we devise a strategy to systematically categorize human RNAs by their sensitivity to the ribonucleolytic RNA exosome complex and by the nature of their transcription initiation. These measures are surprisingly effective at correctly classifying annotated transcripts, including lncRNAs of known function. The approach also identifies uncharacterized stable lncRNAs, hidden among a vast majority of unstable transcripts. The predictive power of the approach promises to streamline the functional analysis of known and novel RNAs.
Mammalian genomes are pervasively transcribed, yielding a complex transcriptome with high variability in composition and cellular abundance. While recent efforts have identified thousands of new long non-coding (lnc) RNAs and demonstrated a complex transcriptional repertoire produced by protein-coding (pc) genes, limited progress has been made in distinguishing functional RNA from spurious transcription events. This is partly due to present RNA classification, which is typically based on technical rather than biochemical criteria. Here we devise a strategy to systematically categorize human RNAs by their sensitivity to the ribonucleolytic RNA exosome complex and by the nature of their transcription initiation. These measures are surprisingly effective at correctly classifying annotated transcripts, including lncRNAs of known function. The approach also identifies uncharacterized stable lncRNAs, hidden among a vast majority of unstable transcripts. The predictive power of the approach promises to streamline the functional analysis of known and novel RNAs.
BackgroundThe role of HNF4α has been extensively studied in hepatocytes and pancreatic β-cells, and HNF4α is also regarded as a key regulator of intestinal epithelial cell differentiation. The aim of the present work is to identify novel HNF4α target genes in the human intestinal epithelial cells in order to elucidate the role of HNF4α in the intestinal differentiation progress.MethodsWe have performed a ChIP-chip analysis of the human intestinal cell line Caco-2 in order to make a genome-wide identification of HNF4α binding to promoter regions. The HNF4α ChIP-chip data was matched with gene expression and histone H3 acetylation status of the promoters in order to identify HNF4α binding to actively transcribed genes with an open chromatin structure.Results1,541 genes were identified as potential HNF4α targets, many of which have not previously been described as being regulated by HNF4α. The 1,541 genes contributed significantly to gene ontology (GO) pathways categorized by lipid and amino acid transport and metabolism. An analysis of the homeodomain transcription factor Cdx-2 (CDX2), the disaccharidase trehalase (TREH), and the tight junction protein cingulin (CGN) promoters verified that these genes are bound by HNF4α in Caco2 cells. For the Cdx-2 and trehalase promoters the HNF4α binding was verified in mouse small intestine epithelium.ConclusionThe HNF4α regulation of the Cdx-2 promoter unravels a transcription factor network also including HNF1α, all of which are transcription factors involved in intestinal development and gene expression.
Inflammatory bowel disease (IBD) is a chronic intestinal disorder, with two main types: Crohn’s disease (CD) and ulcerative colitis (UC), whose molecular pathology is not well understood. The majority of IBD-associated SNPs are located in non-coding regions and are hard to characterize since regulatory regions in IBD are not known. Here we profile transcription start sites (TSSs) and enhancers in the descending colon of 94 IBD patients and controls. IBD-upregulated promoters and enhancers are highly enriched for IBD-associated SNPs and are bound by the same transcription factors. IBD-specific TSSs are associated to genes with roles in both inflammatory cascades and gut epithelia while TSSs distinguishing UC and CD are associated to gut epithelia functions. We find that as few as 35 TSSs can distinguish active CD, UC, and controls with 85% accuracy in an independent cohort. Our data constitute a foundation for understanding the molecular pathology, gene regulation, and genetics of IBD.
Metabolically healthy obese subjects display preserved insulin sensitivity and a beneficial white adipose tissue gene expression pattern. However, this observation stems from fasting studies when insulin levels are low. We investigated adipose gene expression by 5'Cap-mRNA sequencing in 17 healthy non-obese (NO), 21 insulin-sensitive severely obese (ISO), and 30 insulin-resistant severely obese (IRO) subjects, before and 2 hr into a hyperinsulinemic euglycemic clamp. ISO and IRO subjects displayed a clear but globally similar transcriptional response to insulin, which differed from the small effects observed in NO subjects. In the obese, 231 genes were altered; 71 were enriched in ISO subjects (e.g., phosphorylation processes), and 52 were enriched in IRO subjects (e.g., cellular stimuli). Common cardio-metabolic risk factors and gender do not influence these findings. This study demonstrates that differences in the acute transcriptional response to insulin are primarily driven by obesity per se, challenging the notion of healthy obese adipose tissue, at least in severe obesity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.