We have isolated and analyzed human CTCF cDNA clones and show here that the ubiquitously expressed 11-zinc-finger factor CTCF is an exceptionally highly conserved protein displaying 93% identity between avian and human amino acid sequences. It binds specifically to regulatory sequences in the promoter-proximal regions of chicken, mouse, and human c-myc oncogenes. CTCF contains two transcription repressor domains transferable to a heterologous DNA binding domain. One CTCF binding site, conserved in mouse and human c-myc genes, is found immediately downstream of the major P2 promoter at a sequence which maps precisely within the region of RNA polymerase II pausing and release. Gel shift assays of nuclear extracts from mouse and human cells show that CTCF is the predominant factor binding to this sequence. Mutational analysis of the P2-proximal CTCF binding site and transient-cotransfection experiments demonstrate that CTCF is a transcriptional repressor of the human c-myc gene. Although there is 100% sequence identity in the DNA binding domains of the avian and human CTCF proteins, the regulatory sequences recognized by CTCF in chicken and human c-myc promoters are clearly diverged. Mutating the contact nucleotides confirms that CTCF binding to the human c-myc P2 promoter requires a number of unique contact DNA bases that are absent in the chicken c-myc CTCF binding site. Moreover, proteolytic-protection assays indicate that several more CTCF Zn fingers are involved in contacting the human CTCF binding site than the chicken site. Gel shift assays utilizing successively deleted Zn finger domains indicate that CTCF Zn fingers 2 to 7 are involved in binding to the chicken c-myc promoter, while fingers 3 to 11 mediate CTCF binding to the human promoter. This flexibility in Zn finger usage reveals CTCF to be a unique "multivalent" transcriptional factor and provides the first feasible explanation of how certain homologous genes (i.e., c-myc) of different vertebrate species are regulated by the same factor and maintain similar expression patterns despite significant promoter sequence divergence.
CTCF, a conserved, ubiquitous, and highly versatile 11-zinc-finger factor involved in various aspects of gene regulation, forms methylation-sensitive insulators that regulate X chromosome inactivation and expression of imprinted genes. We document here the existence of a paralogous gene with the same exons encoding the 11-zincfinger domain as mammalian CTCF genes and thus the same DNAbinding potential, but with distinct amino and carboxy termini. We named this gene BORIS for Brother of the Regulator of Imprinted Sites. BORIS is present only in the testis, and expressed in a mutually exclusive manner with CTCF during male germ cell development. We show here that erasure of methylation marks during male germ-line development is associated with dramatic up-regulation of BORIS and down-regulation of CTCF expression. Because BORIS bears the same DNA-binding domain that CTCF employs for recognition of methylation marks in soma, BORIS is a candidate protein for the elusive epigenetic reprogramming factor acting in the male germ line.
Chromatin insulators demarcate expression domains by blocking the cis effects of enhancers or silencers in a positiondependent manner 1,2 . We show that the chromatin insulator protein CTCF carries a post-translational modification: poly(ADP-ribosyl)ation. Chromatin immunoprecipitation analysis showed that a poly(ADP-ribosyl)ation mark, which exclusively segregates with the maternal allele of the insulator domain in the H19 imprinting control region, requires the bases that are essential for interaction with CTCF 3 . Chromatin immunoprecipitation-on-chip analysis documented that the link between CTCF and poly(ADP-ribosyl)ation extended to more than 140 mouse CTCF target sites. An insulator trap assay showed that the insulator function of most of these CTCF target sites is sensitive to 3-aminobenzamide, an inhibitor of poly(ADP-ribose) polymerase activity. We suggest that poly(ADP-ribosyl)ation imparts chromatin insulator properties to CTCF at both imprinted and nonimprinted loci, which has implications for the regulation of expression domains and their demise in pathological lesions.Poly(ADP-ribosyl)ation is traditionally associated with DNA repair and apoptosis 4 , but this view may be too limited 5,6 . For example, one of the poly(ADP-ribose) (PAR) polymerases, PARP-1, is associated both with formation of heterochromatin and with regions of high transcriptional activity in fruit flies 7 . To explore a potential correlation between poly(ADP-ribosyl)ation and expression domains in the mouse, we analyzed the allelic distribution of poly(ADP-ribosyl)ated protein complexes on the chromatin insulator at the H19 imprinting control region (ICR), which partitions expression domains in a parent of origin-specific manner 8 . We analyzed chromatin-immunoprecipitated DNA of fetal liver of M. musculus domesticus  M. musculus musculus intraspecific hybrid crosses by a PCR assay, which exploited a polymorphic BsmAI restriction site at the second CTCF target site 9 .Only the maternally inherited allele was specifically captured using a specific antibody that detects polymers containing ten or more ADP-ribose units (Fig. 1a).As the chromatin insulator protein CTCF is the only factor known to interact preferentially with the maternal H19 ICR allele in vivo 3 , we examined the interaction between poly(ADP-ribosyl)ated proteins and the H19 ICR with point-mutated CTCF target sites 10 . We carried out chromatin immunoprecipitation (ChIP) analysis of primary mouse fibroblast cultures, with the mutation inherited maternally or paternally, followed by PCR of the H19 ICR. The H19 ICR was associated with a poly(ADP-ribosyl)ation mark only if the wild-type allele was inherited maternally (Fig. 1b). Although this result suggested that the poly(ADP-ribosyl)ation mark of the maternal H19 ICR allele requires functional CTCF target sites, we could not rule out indirect effects from de novo methylation 3 . We therefore mixed equimolar amounts of plasmids containing the wild-type H19 ICR and plasmids containing the H19 ICR with mutations of CTC...
A novel sequence-specific DNA-binding protein, CTCF, which interacts with the chicken c-myc gene promoter, has been identified and partially characterized (V. V
The multifunctional zinc-finger protein CCCTC-binding factor (CTCF) is a very strong candidate for the role of coordinating the expression level of coding sequences with their three-dimensional position in the nucleus, apparently responding to a "code" in the DNA itself. Dynamic interactions between chromatin fibers in the context of nuclear architecture have been implicated in various aspects of genome functions. However, the molecular basis of these interactions still remains elusive and is a subject of intense debate. Here we discuss the nature of CTCF-DNA interactions, the CTCF-binding specificity to its binding sites and the relationship between CTCF and chromatin, and we examine data linking CTCF with gene regulation in the three-dimensional nuclear space. We discuss why these features render CTCF a very strong candidate for the role and propose a unifying model, the "CTCF code," explaining the mechanistic basis of how the information encrypted in DNA may be interpreted by CTCF into diverse nuclear functions.
CTCF is a transcription factor with highly versatile functions ranging from gene activation and repression to the regulation of insulator function and imprinting. Although many of these functions rely on CTCF-DNA interactions, it is an emerging realization that CTCF-dependent molecular processes involve CTCF interactions with other proteins. In this study, we report the association of a subpopulation of CTCF with the RNA polymerase II (Pol II) protein complex. We identified the largest subunit of Pol II (LS Pol II) as a protein significantly colocalizing with CTCF in the nucleus and specifically interacting with CTCF in vivo and in vitro. The role of CTCF as a link between DNA and LS Pol II has been reinforced by the observation that the association of LS Pol II with CTCF target sites in vivo depends on intact CTCF binding sequences. “Serial” chromatin immunoprecipitation (ChIP) analysis revealed that both CTCF and LS Pol II were present at the β-globin insulator in proliferating HD3 cells but not in differentiated globin synthesizing HD3 cells. Further, a single wild-type CTCF target site (N-Myc-CTCF), but not the mutant site deficient for CTCF binding, was sufficient to activate the transcription from the promoterless reporter gene in stably transfected cells. Finally, a ChIP-on-ChIP hybridization assay using microarrays of a library of CTCF target sites revealed that many intergenic CTCF target sequences interacted with both CTCF and LS Pol II. We discuss the possible implications of our observations with respect to plausible mechanisms of transcriptional regulation via a CTCF-mediated direct link of LS Pol II to the DNA.
The differentially methylated imprinting control region (ICR) region upstream of the H19 gene regulates allelic Igf2 expression by means of a methylation-sensitive chromatin insulator function. We have previously shown that maternal inheritance of mutated (three of the four) target sites for the 11-zinc finger protein CTCF leads to loss of Igf2 imprinting. Here we show that a mutation in only CTCF site 4 also leads to robust activation of the maternal Igf2 allele despite a noticeably weaker interaction in vitro of site 4 DNA with CTCF compared to other ICR sites, sites 1 and 3. Moreover, maternally inherited sites 1 to 3 become de novo methylated in complex patterns in subpopulations of liver and heart cells with a mutated site 4, suggesting that the methylation privilege status of the maternal H19 ICR allele requires an interdependence between all four CTCF sites. In support of this conclusion, we show that CTCF molecules bind to each other both in vivo and in vitro, and we demonstrate strong interaction between two CTCF-DNA complexes, preassembled in vitro with sites 3 and 4. We propose that the CTCF sites may cooperate to jointly maintain both methylation-free status and insulator properties of the maternal H19 ICR allele. Considering many other CTCF targets, we propose that site-specific interactions between various DNA-bound CTCF molecules may provide general focal points in the organization of looped chromatin domains involved in gene regulation.The neighboring Igf2 and H19 genes are expressed monoallelically from opposite chromosomes and are generally recognized as the paradigm of mammalian genomic imprinting (1, 11). The cis elements underlying their parent-of-origin-dependent expression patterns were traced to a differentially methylated region (imprinting control region [ICR]) located 2 kb upstream of the H19 transcriptional unit (5, 26-28). Subsequently, it was documented that repression of the paternal H19 allele was initiated, but not maintained by the H19 ICR, while the continuous presence of this region was required to suppress the maternal Igf2 allele (25). Arguments that this region contained a methylation-sensitive chromatin insulator to block communication between downstream enhancers and Igf2 promoters (3, 9, 12, 13) were later confirmed by experiments (10).Despite similar functions, the mouse and human H19 ICRs display little sequence similarity with the exception of a CpGrich element repeated four times in the mouse H19 ICR and seven times in the human H19 ICR (9). We and others have noted that the 11-zinc finger protein CTCF, which associates with all known vertebrate chromatin insulator cis elements (2, 19), interacted with these repeat elements in a methylationsensitive manner (3, 9, 13). Importantly, CTCF interacts in vivo with only the maternal H19 ICR allele (13), which is consistent with an involvement with the in vivo insulation of the maternal Igf2 from downstream lineage-specific enhancers. To confirm an in vivo role of the CTCF target sites at the Igf2/H19 locus, we previously cr...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.