We mapped Polycomb-associated H3K27 trimethylation (H3K27me3) and Trithorax-associated H3K4 trimethylation (H3K4me3) across the whole genome in human embryonic stem (ES) cells. The vast majority of H3K27me3 colocalized on genes modified with H3K4me3. These commodified genes displayed low expression levels and were enriched in developmental function. Another significant set of genes lacked both modifications and was also expressed at low levels in ES cells but was enriched for gene function in physiological responses rather than development. Commodified genes could change expression levels rapidly during differentiation, but so could a substantial number of genes in other modification categories. SOX2, POU5F1, and NANOG, pluripotency-associated genes, shifted from modification by H3K4me3 alone to colocalization of both modifications as they were repressed during differentiation. Our results demonstrate that H3K27me3 modifications change during early differentiation, both relieving existing repressive domains and imparting new ones, and that colocalization with H3K4me3 is not restricted to pluripotent cells.
The core promoter of eukaryotic genes is the minimal DNA region that recruits the basal transcription machinery to direct efficient and accurate transcription initiation. The fraction of human and yeast genes that contain specific core promoter elements such as the TATA box and the initiator (INR) remains unclear and core promoter motifs specific for TATA-less genes remain to be identified. Here, we present genome-scale computational analyses indicating that ∼76% of human core promoters lack TATA-like elements, have a high GC content, and are enriched in Sp1 binding sites. We further identify two motifs -M3 (SCGGAAGY) and M22 (TGCGCANK) -that occur preferentially in human TATA-less core promoters. About 24% of human genes have a TATA-like element and their promoters are generally AT-rich; however, only ∼10% of these TATA-containing promoters have the canonical TATA box (TATAWAWR). In contrast, ∼46% of human core promoters contain the consensus INR (YYANWYY) and ∼30% are INR-containing TATA-less genes. Significantly, ∼46% of human promoters lack both TATA-like and consensus INR elements. Surprisingly, mammalian-type INR sequences are present -and tend to cluster -in the transcription start site (TSS) region of ∼40% of yeast core promoters and the frequency of specific core promoter types appears to be conserved in yeast and human genomes. Gene Ontology analyses reveal that TATA-less genes in humans, as in yeast, are frequently involved in basic "housekeeping" processes, while TATAcontaining genes are more often highly regulated, such as by biotic or stress stimuli. These results reveal unexpected similarities in the occurrence of specific core promoter types and in their associated biological processes in yeast and humans and point to novel vertebrate-specific DNA motifs that might play a selective role in TATA-independent transcription.
Epithelial formation is a central facet of organogenesis that relies on intercellular junction assembly to create functionally distinct apical and basal cell surfaces. How this process is regulated during embryonic development remains obscure. Previous studies using conditional knockout mice have shown that loss of hepatocyte nuclear factor 4␣ (HNF4␣) blocks the epithelial transformation of the fetal liver, suggesting that HNF4␣ is a central regulator of epithelial morphogenesis. Although HNF4␣-null hepatocytes do not express E-cadherin (also called CDH1), we show here that E-cadherin is dispensable for liver development, implying that HNF4␣ regulates additional aspects of epithelial formation. Microarray and molecular analyses reveal that HNF4␣ regulates the developmental expression of a myriad of proteins required for cell junction assembly and adhesion. Our findings define a fundamental mechanism through which generation of tissue epithelia during development is coordinated with the onset of organ function.cell junctions ͉ organogenesis ͉ transcription
Hepatocyte nuclear factor 4 alpha (HNF4α), a member of the nuclear receptor superfamily, is essential for liver function and is linked to several diseases including diabetes, hemophilia, atherosclerosis, and hepatitis. Although many DNA response elements and target genes have been identified for HNF4α the complete repertoire of binding sites and target genes in the human genome is unknown. Here, we adapt protein binding microarrays (PBMs) to examine the DNA-binding characteristics of two HNF4α species (rat and human) and isoforms (HNF4α2 and HNF4α8) in a high-throughput fashion. We identified ~1400 new binding sequences and used this dataset to successfully train a Support Vector Machine (SVM)model that predicts an additional ~10,000 unique HNF4α-binding sequences; we also identify new rules for HNF4α DNA binding. We performed expression profiling of an HNF4α RNA interference knockdown in HepG2 cells and compared the results to a search of the promoters of all human genes with the PBM and SVM models, as well as published genome-wide location analysis. Using this integrated approach, we identified ~240 new direct HNF4α human target genes, including new functional categories of genes not typically associated with HNF4α, such as cell cycle, immune function, apoptosis, stress response, and other cancer-related genes. Conclusion We report the first use of PBMs with a full-length liver-enriched transcription factor and greatly expand the repertoire of HNF4α-binding sequences and target genes, thereby identifying new functions for HNF4α. We also establish a web-based tool, HNF4 Motif Finder, that can be used to identify potential HNF4α-binding sites in any sequence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.