Patterns of gene expression are primarily determined by proteins that locally enhance or repress transcription. While many transcription factors target a restricted number of genes, others appear to modulate transcription levels globally. An example is MeCP2, an abundant methylated-DNA binding protein that is mutated in the neurological disorder Rett syndrome. Despite much research, the molecular mechanism by which MeCP2 regulates gene expression is not fully resolved. Here, we integrate quantitative, multidimensional experimental analysis and mathematical modeling to indicate that MeCP2 is a global transcriptional regulator whose binding to DNA creates “slow sites” in gene bodies. We hypothesize that waves of slowed-down RNA polymerase II formed behind these sites travel backward and indirectly affect initiation, reminiscent of defect-induced shockwaves in nonequilibrium physics transport models. This mechanism differs from conventional gene-regulation mechanisms, which often involve direct modulation of transcription initiation. Our findings point to a genome-wide function of DNA methylation that may account for the reversibility of Rett syndrome in mice. Moreover, our combined theoretical and experimental approach provides a general method for understanding how global gene-expression patterns are choreographed.
Summary Mammalian genomes contain long domains with distinct average compositions of A/T versus G/C base pairs. In a screen for proteins that might interpret base composition by binding to AT-rich motifs, we identified the stem cell factor SALL4, which contains multiple zinc fingers. Mutation of the domain responsible for AT binding drastically reduced SALL4 genome occupancy and prematurely upregulated genes in proportion to their AT content. Inactivation of this single AT-binding zinc-finger cluster mimicked defects seen in Sall4 null cells, including precocious differentiation of embryonic stem cells (ESCs) and embryonic lethality in mice. In contrast, deletion of two other zinc-finger clusters was phenotypically neutral. Our data indicate that loss of pluripotency is triggered by downregulation of SALL4, leading to de-repression of a set of AT-rich genes that promotes neuronal differentiation. We conclude that base composition is not merely a passive byproduct of genome evolution and constitutes a signal that aids control of cell fate.
Summary DNA methylation is implicated in neuronal biology via the protein MeCP2, the mutation of which causes Rett syndrome. MeCP2 recruits the NCOR1/2 co-repressor complexes to methylated cytosine in the CG dinucleotide, but also to sites of non-CG methylation, which are abundant in neurons. To test the biological significance of the dual-binding specificity of MeCP2, we replaced its DNA binding domain with an orthologous domain from MBD2, which can only bind mCG motifs. Knockin mice expressing the domain-swap protein displayed severe Rett-syndrome-like phenotypes, indicating that normal brain function requires the interaction of MeCP2 with sites of non-CG methylation, specifically mCAC. The results support the notion that the delayed onset of Rett syndrome is due to the simultaneous post-natal accumulation of mCAC and its reader MeCP2. Intriguingly, genes dysregulated in both Mecp2 null and domain-swap mice are implicated in other neurological disorders, potentially highlighting targets of relevance to the Rett syndrome phenotype.
MeCP2 is an abundant protein in mature nerve cells, where it binds to DNA sequences containing methylated cytosine. Mutations in the MECP2 gene cause the severe neurological disorder Rett syndrome (RTT), provoking intensive study of the underlying molecular mechanisms. Multiple functions have been proposed, one of which involves a regulatory role in splicing. Here we leverage the recent availability of high-quality transcriptomic data sets to probe quantitatively the potential influence of MeCP2 on alternative splicing. Using a variety of machine learning approaches that can capture both linear and non-linear associations, we show that widely different levels of MeCP2 have a minimal effect on alternative splicing in three different systems. Alternative splicing was also apparently indifferent to developmental changes in DNA methylation levels. Our results suggest that regulation of splicing is not a major function of MeCP2. They also highlight the importance of multi-variate quantitative analyses in the formulation of biological hypotheses.
Gene expression patterns depend on the interaction of diverse transcription factors with their target genes. While many factors have a restricted number of targets, some appear to affect transcription globally. An example of the latter is MeCP2; an abundant chromatin-associated protein that is mutated in the neurological disorder Rett Syndrome. To understand how MeCP2 affects transcription, we integrated mathematical modelling with quantitative experimental analysis of human neurons expressing graded levels of MeCP2. We first used a model of MeCP2-DNA binding to demonstrate that changes in gene expression reflect MeCP2 density downstream of transcription initiation. We then tested five biologically plausible hypotheses for the effect of MeCP2 on transcription. The only model compatible with the data involved slowing down of RNA polymerase II by MeCP2, causing reduced transcript output due to polymerase queueing. Our general approach may prove fruitful in deciphering the mechanisms by which other global regulators choreograph gene expression.
Spalt-like 4 (SALL4) maintains vertebrate embryonic stem cell identity and is required for the development of multiple organs, including limbs. Mutations in SALL4 are associated with Okihiro syndrome, and SALL4 is also a known target of thalidomide. SALL4 protein has a distinct preference for AT-rich sequences, recognised by a pair of zinc fingers at the C-terminus. However, unlike many characterised zinc finger proteins, SALL4 shows flexible recognition with many different combinations of AT-rich sequences being targeted. SALL4 interacts with the NuRD corepressor complex which potentially mediates repression of AT-rich genes. We present a crystal structure of SALL4 C-terminal zinc fingers with an AT-rich DNA sequence, which shows that SALL4 uses small hydrophobic and polar side chains to provide flexible recognition in the major groove. Missense mutations reported in patients that lie within the C-terminal zinc fingers reduced overall binding to DNA but not the preference for AT-rich sequences. Furthermore, these mutations altered association of SALL4 with AT-rich genomic sites, providing evidence that these mutations are likely pathogenic.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.