Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
Most of the mammalian genome is transcribed. This generates a vast repertoire of transcripts that includes protein-coding messenger RNAs, long non-coding RNAs (lncRNAs) and repetitive sequences, such as SINEs (short interspersed nuclear elements). A large percentage of ncRNAs are nuclear-enriched with unknown function. Antisense lncRNAs may form sense-antisense pairs by pairing with a protein-coding gene on the opposite strand to regulate epigenetic silencing, transcription and mRNA stability. Here we identify a nuclear-enriched lncRNA antisense to mouse ubiquitin carboxy-terminal hydrolase L1 (Uchl1), a gene involved in brain function and neurodegenerative diseases. Antisense Uchl1 increases UCHL1 protein synthesis at a post-transcriptional level, hereby identifying a new functional class of lncRNAs. Antisense Uchl1 activity depends on the presence of a 5' overlapping sequence and an embedded inverted SINEB2 element. These features are shared by other natural antisense transcripts and can confer regulatory activity to an artificial antisense to green fluorescent protein. Antisense Uchl1 function is under the control of stress signalling pathways, as mTORC1 inhibition by rapamycin causes an increase in UCHL1 protein that is associated to the shuttling of antisense Uchl1 RNA from the nucleus to the cytoplasm. Antisense Uchl1 RNA is then required for the association of the overlapping sense protein-coding mRNA to active polysomes for translation. These data reveal another layer of gene expression control at the post-transcriptional level.
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.
The genetic determinism of type-1 diabetes in NOD mice likely involves complementary defects in central T cell tolerance induction and peripheral immunoregulation. To study the contribution of the NOD genetic background to central tolerance, we followed the behavior of BDC2.5 clonotype thymocytes in fetal thymic organ cultures (FTOC). The NOD genetic background encodes a quantitative deficiency in the ability to delete these self-reactive thymocytes and to divert them to the CD8alphaalpha lineage. In genetic analyses, comparing NOD and B6.H2g7 FTOCs, the NOD defect incorporated the influence of several loci (notably ones on chr1 and 3). Microarray analyses assessing FTOCs from the same two strains argued that the NOD abnormality reflects the combined effects of turning down the gene expression program that provokes apoptosis and turning on a new program promoting cell survival. Intersection of the data from the two approaches points to a small set of attractive candidate genes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.