Transposable elements (TEs) are a prolific source of tightly regulated, biochemically active non-coding elements, such as transcription factor binding sites and non-coding RNAs. A wealth of recent studies reinvigorates the idea that these elements are pervasively co-opted for the regulation of host genes. We argue that the inherent genetic properties of TEs and conflicting relationships with their hosts facilitate their recruitment for regulatory functions in diverse genomes. We review recent findings supporting the long-standing hypothesis that the waves of TE invasions endured by organisms for eons have catalyzed the evolution of gene regulatory networks. We also discuss the challenges of dissecting and interpreting the phenotypic impact of regulatory activities encoded by TEs in health and disease.
Endogenous retroviruses (ERVs) are abundant in mammalian genomes and contain sequences modulating transcription. How ERV propagation impacts the evolution of gene regulation remains poorly understood. Here we show that ERVs have shaped the evolution of a transcriptional network underlying the interferon (IFN) response, a major branch of innate immunity. We found that lineage-specific ERVs have dispersed numerous IFN-inducible enhancers independently in diverse mammalian genomes. CRISPR-Cas9 deletion of a subset of these ERV elements in the human genome impaired expression of adjacent IFN-induced genes and revealed their involvement in the regulation of essential immune functions, including activation of the AIM2 inflammasome. While these regulatory sequences likely arose in ancient viruses, they now constitute a dynamic reservoir of IFN-inducible enhancers fueling genetic innovation in mammalian immune defenses.
The mammalian placenta is remarkably distinct between species, suggesting a history of rapid evolutionary diversification1. To gain insight into the molecular drivers of placental evolution, we compared biochemically predicted enhancers between mouse and rat trophoblast stem cells (TSCs) and find that species-specific enhancers are highly enriched for endogenous retroviruses (ERVs) on a genome-wide level. One of these ERV families, RLTR13D5, contributes hundreds of mouse-specific H3K4me1/H3K27ac-defined enhancers that functionally bind Cdx2, Eomes, and Elf5 - core factors that define the TSC regulatory network. Furthermore, we demonstrate that RLTR13D5 is capable of driving gene expression in rat placental cells. Comparison with other tissues revealed that species-specific ERV enhancer activity is generally restricted to hypomethylated tissues, suggesting that tissues permissive to ERV activity gain access to an otherwise silenced source of regulatory variation. Overall, our results implicate ERV enhancer cooption as a mechanism underlying the striking evolutionary diversification of placental development.
The first stages of embryonic differentiation are driven by signaling pathways hardwired to induce particular fates. Endoderm commitment is controlled by the TGF-β superfamily member, Nodal, which utilizes the transcription factors, SMAD2/3, SMAD4 and FOXH1, to drive target gene expression. While the role of Nodal is well defined within the context of endoderm commitment, mechanistically it is unknown how this signal interacts with chromatin on a genome wide scale to trigger downstream responses. To elucidate the Nodal transcriptional network that governs endoderm formation, we used ChIP-seq to identify genomic targets for SMAD2/3, SMAD3, SMAD4, FOXH1 and the active and repressive chromatin marks, H3K4me3 and H3K27me3, in human embryonic stem cells (hESCs) and derived endoderm. We demonstrate that while SMAD2/3, SMAD4 and FOXH1 associate with DNA in a highly dynamic fashion, there is an optimal bivalent signature at 32 gene loci for driving endoderm commitment. Initially, this signature is marked by both H3K4me3 and H3K27me3 as a very broad bivalent domain in hESCs. Within the first 24h, SMAD2/3 accumulation coincides with H3K27me3 reduction so that these loci become monovalent marked by H3K4me3. JMJD3, a histone demethylase, is simultaneously recruited to these promoters, suggesting a conservation of mechanism at multiple promoters genome-wide. The correlation between SMAD2/3 binding, monovalent formation and transcriptional activation suggests a mechanism by which SMAD proteins coordinate with chromatin at critical promoters to drive endoderm specification.
Discovery of lineage-specific somatic copy number variation (CNV) in mammals has led to debate over whether CNVs are mutations that propagate disease or whether they are a normal, and even essential, aspect of cell biology. We show that 1,000N polyploid trophoblast giant cells (TGCs) of the mouse placenta contain 47 regions, totaling 138 Megabases, where genomic copies are underrepresented (UR). UR domains originate from a subset of late-replicating heterochromatic regions containing gene deserts and genes involved in cell adhesion and neurogenesis. While lineage-specific CNVs have been identified in mammalian cells, classically in the immune system where V(D)J recombination occurs, we demonstrate that CNVs form during gestation in the placenta by an underreplication mechanism, not by recombination nor deletion. Our results reveal that large scale CNVs are a normal feature of the mammalian placental genome, which are regulated systematically during embryogenesis and are propagated by a mechanism of underreplication.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.