Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
SummaryThe ADAR RNA-editing enzymes deaminate adenosine bases to inosines in cellular RNAs. Aberrant interferon expression occurs in patients in whom ADAR1 mutations cause Aicardi-Goutières syndrome (AGS) or dystonia arising from striatal neurodegeneration. Adar1 mutant mouse embryos show aberrant interferon induction and die by embryonic day E12.5. We demonstrate that Adar1 embryonic lethality is rescued to live birth in Adar1; Mavs double mutants in which the antiviral interferon induction response to cytoplasmic double-stranded RNA (dsRNA) is prevented. Aberrant immune responses in Adar1 mutant mouse embryo fibroblasts are dramatically reduced by restoring the expression of editing-active cytoplasmic ADARs. We propose that inosine in cellular RNA inhibits antiviral inflammatory and interferon responses by altering RLR interactions. Transfecting dsRNA oligonucleotides containing inosine-uracil base pairs into Adar1 mutant mouse embryo fibroblasts reduces the aberrant innate immune response. ADAR1 mutations causing AGS affect the activity of the interferon-inducible cytoplasmic isoform more severely than the nuclear isoform.
Argonaute proteins play a major part in transcriptional gene silencing in many organisms, but their role in the nucleus of somatic mammalian cells remains elusive. Here, we have immunopurified human Argonaute-1 and Argonaute-2 (AGO1 and AGO2) chromatin-embedded proteins and found them associated with chromatin modifiers and, notably, with splicing factors. Using the CD44 gene as a model, we show that AGO1 and AGO2 facilitate spliceosome recruitment and modulate RNA polymerase II elongation rate, thereby affecting alternative splicing. Proper AGO1 and AGO2 recruitment to CD44 transcribed regions required the endonuclease Dicer and the chromobox protein HP1γ, and resulted in increased histone H3 lysine 9 methylation on variant exons. Our data thus uncover a new model for the regulation of alternative splicing, in which Argonaute proteins couple RNA polymerase II elongation to chromatin modification.
The functional repertoire of long intergenic noncoding RNA (lincRNA) molecules has begun to be elucidated in mammals. Determining the biological relevance and potential gene regulatory mechanisms of these enigmatic molecules would be expedited in a more tractable model organism, such as Drosophila melanogaster. To this end, we defined a set of 1,119 putative lincRNA genes in D. melanogaster using modENCODE whole transcriptome (RNA-seq) data. A large majority (1.1 of 1.3 Mb; 85%) of these bases were not previously reported by modENCODE as being transcribed. Significant selective constraint on the sequences of these loci predicts that virtually all have sustained functionality across the Drosophila clade. We observe biases in lincRNA genomic locations and expression profiles that are consistent with some of these lincRNAs being involved in the regulation of neighboring protein-coding genes with developmental functions. We identify lincRNAs that may be important in the developing nervous system and in male-specific organs, such as the testes. LincRNA loci were also identified whose positions, relative to nearby protein-coding loci, are equivalent between D. melanogaster and mouse. This study predicts that the genomes of not only vertebrates, such as mammals, but also an invertebrate (fruit fly) harbor large numbers of lincRNA loci. Our findings now permit exploitation of Drosophila genetics for the investigation of lincRNA mechanisms, including lincRNAs with potential functional analogues in mammals.
Long noncoding RNAs (lncRNAs) constitute the majority of transcripts in the mammalian genomes, and yet, their functions remain largely unknown. As part of the FANTOM6 project, we systematically knocked down the expression of 285 lncRNAs in human dermal fibroblasts and quantified cellular growth, morphological changes, and transcriptomic responses using Capped Analysis of Gene Expression (CAGE). Antisense oligonucleotides targeting the same lncRNAs exhibited global concordance, and the molecular phenotype, measured by CAGE, recapitulated the observed cellular phenotypes while providing additional insights on the affected genes and pathways. Here, we disseminate the largest-todate lncRNA knockdown data set with molecular phenotyping (over 1000 CAGE deep-sequencing libraries) for further exploration and highlight functional roles for ZNF213-AS1 and lnc-KHDC3L-2.
Promoters are central to the regulation of gene expression. Changes in gene regulation are thought to underlie much of the adaptive diversification between species and phenotypic variation within populations. In contrast to earlier work emphasizing the importance of enhancer evolution and subtle sequence changes at promoters, we show that dramatic changes such as the complete gain and loss (collectively, turnover) of functional promoters are common. Using quantitative measures of transcription initiation in both humans and mice across 52 matched tissues, we discriminate promoter sequence gains from losses and resolve the lineage of changes. We also identify expression divergence and functional turnover between orthologous promoters, finding only the latter is associated with local sequence changes. Promoter turnover has occurred at the majority (>56%) of protein-coding genes since humans and mice diverged. Tissue-restricted promoters are the most evolutionarily volatile where retrotransposition is an important, but not the sole, source of innovation. There is considerable heterogeneity of turnover rates between promoters in different tissues, but the consistency of these in both lineages suggests that the same biological systems are similarly inclined to transcriptional rewiring. The genes affected by promoter turnover show evidence of adaptive evolution. In mice, promoters are primarily lost through deletion of the promoter containing sequence, whereas in humans, many promoters appear to be gradually decaying with weak transcriptional output and relaxed selective constraint. Our results suggest that promoter gain and loss is an important process in the evolutionary rewiring of gene regulation and may be a significant source of phenotypic diversification.
Phenotypic differences between individuals and species are controlled in part through differences in expression of a relatively conserved set of genes. Genes expressed in the immune system are subject to especially powerful selection. We have investigated the evolution of both gene expression and candidate enhancers in human and mouse macrophages exposed to glucocorticoid (GC), a regulator of innate immunity and an important therapeutic agent. Our analyses revealed a very limited overlap in the repertoire of genes responsive to GC in human and mouse macrophages. Peaks of inducible binding of the glucocorticoid receptor (GR) detected by ChIP-Seq correlated with induction, but not repression, of target genes in both species, occured at distal regulatory sites not promoters, and were strongly enriched for the consensus GR binding motif. Turnover of GR binding between mouse and human was associated with gain and loss of the motif. There was no detectable signal of positive selection at species-specific GR binding sites, but clear evidence of purifying selection at the small number of conserved sites. We conclude that enhancer divergence underlies the difference in transcriptional activation after GC treatment between mouse and human macrophages. Only the shared inducible loci show evidence of selection and therefore these loci may be important for the subset of responses to GC that is shared between species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.