Background To investigate the mechanisms driving regulatory evolution across tissues, we experimentally mapped promoters, enhancers, and gene expression in the liver, brain, muscle, and testis from ten diverse mammals. Results The regulatory landscape around genes included both tissue-shared and tissue-specific regulatory regions, where tissue-specific promoters and enhancers evolved most rapidly. Genomic regions switching between promoters and enhancers were more common across species, and less common across tissues within a single species. Long Interspersed Nuclear Elements (LINEs) played recurrent evolutionary roles: LINE L1s were associated with tissue-specific regulatory regions, whereas more ancient LINE L2s were associated with tissue-shared regulatory regions and with those switching between promoter and enhancer signatures across species. Conclusions Our analyses of the tissue-specificity and evolutionary stability among promoters and enhancers reveal how specific LINE families have helped shape the dynamic mammalian regulome.
Cancers arise through the acquisition of oncogenic mutations and grow through clonal expansion 1,2 . Here we reveal that most mutagenic DNA lesions are not resolved as mutations within a single cell-cycle. Instead, DNA lesions segregate unrepaired into daughter cells for multiple cell generations, resulting in the chromosome-scale phasing of subsequent mutations. We characterise this process in mutagen-induced mouse liver tumours and show that DNA replication across persisting lesions can produce multiple alternative alleles in successive cell divisions, thereby generating both multi-allelic and combinatorial genetic diversity. The phasing of lesions enables the accurate measurement of strand biased repair processes, the quantification of oncogenic selection, and the fine mapping of sister chromatid exchange events. Finally, we demonstrate that lesion segregation is a unifying property of exogenous mutagens, including UV light and chemotherapy agents in human cells and tumours, which has profound implications for the evolution and adaptation of cancer genomes.
Genome stability relies on proper coordination of mitosis and cytokinesis, where dynamic microtubules capture and faithfully segregate chromosomes into daughter cells. With a high-content RNAi imaging screen targeting more than 2,000 human lncRNAs, we identify numerous lncRNAs involved in key steps of cell division such as chromosome segregation, mitotic duration and cytokinesis. Here, we provide evidence that the chromatin-associated lncRNA, linc00899, leads to robust mitotic delay upon its depletion in multiple cell types. We perform transcriptome analysis of linc00899-depleted cells and identify the neuronal microtubule-binding protein, TPPP/p25, as a target of linc00899. We further show that linc00899 binds TPPP/p25 and suppresses its transcription. In cells depleted of linc00899, upregulation of TPPP/p25 alters microtubule dynamics and delays mitosis. Overall, our comprehensive screen uncovers several lncRNAs involved in genome stability and reveals a lncRNA that controls microtubule behaviour with functional implications beyond cell division.
Maximum lifespan of a species is the oldest that individuals can survive, reflecting the genetic limit of longevity in an ideal environment. Here we report methylation-based models that accurately predict maximum lifespan (r=0.89), gestational time (r=0.96), and age at sexual maturity (r=0.87), using cytosine methylation patterns collected from over 12,000 samples derived from 192 mammalian species. Our epigenetic maximum lifespan predictor corroborated the extended lifespan in growth hormone receptor knockout mice and rapamycin treated mice. Across dog breeds, epigenetic maximum lifespan correlates positively with breed lifespan but negatively with breed size. Lifespan-related cytosines are located in transcriptional regulatory regions, such as bivalent chromatin promoters and polycomb-repressed regions, which were hypomethylated in long-lived species. The epigenetic estimators of maximum lifespan and other life history traits will be useful for characterizing understudied species and for identifying interventions that extend lifespan.
Pseudogenes are ideal markers of genome remodelling. In turn, the mouse is an ideal platform for studying them, particularly with the recent availability of strain-sequencing and transcriptional data. Here, combining both manual curation and automatic pipelines, we present a genome-wide annotation of the pseudogenes in the mouse reference genome and 18 inbred mouse strains (available via the mouse.pseudogene.org resource). We also annotate 165 unitary pseudogenes in mouse, and 303, in human. The overall pseudogene repertoire in mouse is similar to that in human in terms of size, biotype distribution, and family composition (e.g. with GAPDH and ribosomal proteins being the largest families). Notable differences arise in the pseudogene age distribution, with multiple retrotranspositional bursts in mouse evolutionary history and only one in human. Furthermore, in each strain about a fifth of all pseudogenes are unique, reflecting strain-specific evolution. Finally, we find that~15% of the mouse pseudogenes are transcribed, and that highly transcribed parent genes tend to give rise to many processed pseudogenes.
Background The introduction of novel CTCF binding sites in gene regulatory regions in the rodent lineage is partly the effect of transposable element expansion, particularly in the murine lineage. The exact mechanism and functional impact of evolutionarily novel CTCF binding sites are not yet fully understood. We investigated the impact of novel subspecies-specific CTCF binding sites in two Mus genus subspecies, Mus musculus domesticus and Mus musculus castaneus, that diverged 0.5 million years ago. Results CTCF binding site evolution is influenced by the action of the B2-B4 family of transposable elements independently in both lineages, leading to the proliferation of novel CTCF binding sites. A subset of evolutionarily young sites may harbour transcriptional functionality as evidenced by the stability of their binding across multiple tissues in M. musculus domesticus (BL6), while overall the distance of subspecies-specific CTCF binding to the nearest transcription start sites and/or topologically associated domains (TADs) is largely similar to musculus-common CTCF sites. Remarkably, we discovered a recurrent regulatory architecture consisting of a CTCF binding site and an interferon gene that appears to have been tandemly duplicated to create a 15-gene cluster on chromosome 4, thus forming a novel BL6 specific immune locus in which CTCF may play a regulatory role. Conclusions Our results demonstrate that thousands of CTCF binding sites show multiple functional signatures rapidly after incorporation into the genome.
Two-thirds of gene promoters in mammals are associated with regions of non-methylated DNA, called CpG islands (CGIs), which counteract the repressive effects of DNA methylation on chromatin. In cold-blooded vertebrates, computational CGI predictions often reside away from gene promoters, suggesting a major divergence in gene promoter architecture across vertebrates. By experimentally identifying non-methylated DNA in the genomes of seven diverse vertebrates, we instead reveal that non-methylated islands (NMIs) of DNA are a central feature of vertebrate gene promoters. Furthermore, NMIs are present at orthologous genes across vast evolutionary distances, revealing a surprising level of conservation in this epigenetic feature. By profiling NMIs in different tissues and developmental stages we uncover a unifying set of features that are central to the function of NMIs in vertebrates. Together these findings demonstrate an ancient logic for NMI usage at gene promoters and reveal an unprecedented level of epigenetic conservation across vertebrate evolution.
Aberrant expression of transcription factor oncogenes is a common feature in the pathogenesis of T-cell acute lymphoblastic leukemia (T-ALL), however, oncogenic transcriptional programs downstream of T-ALL oncogenes are mostly unknown. We used chromatin immunoprecipitation combined with promoter microarrays (ChIP on chip) to identify direct transcriptional targets of TAL1/SCL, a bHLH transcription factor aberrantly expressed in 60% of patients with T-ALL. Using the Hu12K arrays that contain over 12,000 human promoter regions, we have identified 71 direct targets of TAL1 in the Jurkat T-ALL cell line by ChIP on chip, and have been able to validate 80% of them by conventional chromatin immunoprecipitation followed by quantitative real time PCR. It has previously been demonstrated that TAL1 needs to bind to E proteins (E2A, HEB or E2-2) to achieve an efficient interaction with DNA. We verified that promoters occupied by TAL1 are bound by both the E-proteins E2A and HEB in at least 50% of TAL1-bound promoter regions, suggesting that TAL1/E2A as well as TAL1/HEB heterodimers play a role in the transformation of T-cell precursors. Using RNA interference to knock down TAL1, we have demonstrated functional regulation through TAL1-binding of a significant percentage of the targets identified by ChIP on chip. Results demonstrate that some TAL1 direct targets are repressed by the binding of this transcription factor, while others seem to be activated or are not altered, indicating a high level of complexity in the organization of TAL1-mediated transcriptional regulatory networks. Also, specific association of the expression levels of TAL1 targets in TAL1 positive primary leukemia samples has been shown by comparing gene expression arrays from the different T-ALL subsets. Our results indicate that TAL1 acts as a master transcriptional regulator at the top of a complex regulatory network that contributes to leukemogenesis by regulating T-cell differentiation, proliferation and survival.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.