Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
The aims of our study were to verify whether it was possible to generate in vitro, from different adult human tissues, a population of cells that behaved, in culture, as multipotent stem cells and if these latter shared common properties. To this purpose, we grew and cloned finite cell lines obtained from adult human liver, heart, and bone marrow and named them human multipotent adult stem cells (hMASCs). Cloned hMASCs, obtained from the 3 different tissues, expressed the pluripotent state-specific transcription factors Oct-4, NANOG, and REX1, displayed telomerase activity, and exhibited a wide range of differentiation potential, as shown both at a morphologic and functional level. hMASCs maintained a human diploid DNA content, and shared a common gene expression signature, compared with several somatic cell lines and irrespectively of the tissue of isolation. In particular, the pathways regulating stem cell self-renewal/maintenance, such as Wnt, Hedgehog, and Notch, were transcriptionally active. Our findings demonstrate that we have optimized an in vitro protocol to generate and expand cells from multiple organs that could be induced to acquire morphologic and func- IntroductionThe presently accumulated evidence indicates that adult bone marrow (BM) contains at least 2 populations of stem cells: hematopoietic stem cells (HSCs) and mesenchymal stem cells (MSCs), responsible for the generation of the BM microenvironment. 1 Intriguingly, several reports have demonstrated the ability of MSCs to differentiate toward derivatives of germ layers other than mesoderm. [2][3][4][5][6] Although it is still unclear whether widely multipotent cells do exist in vivo and if they play a significant role in tissue repair and turnover, the ability to generate in vitro cells that, under defined culture conditions, display a very high developmental plasticity is nonetheless of important clinical relevance.Until now, the most convincing evidence, although debated, 7 of the possibility to grow in culture a population of widely multipotent cells in humans has been obtained only for BM, 8 while a similar feature has been just postulated for other adult human tissues. 9 We therefore planned to verify if human multipotent adult stem cells (hMASCs) could be produced from other adult human organs on top of BM, and we used this latter as a control/reference tissue.By systematically using a highly reproducible method, we were able to grow in culture cell lines from adult human liver, heart, and BM. These cell lines, once cloned at single-cell level, maintained the in vitro properties of parental lines, including the capability to differentiate into morphologically mature and functionally competent cells, even of tissues embryologically not related to the one of origin.Finally, we performed a comparative in vitro analysis on hMASCs originated from the 3 different sources with respect to immunophenotype, growth kinetics, specific transcriptional settings, telomerase activity, and global gene expression profile. Altogether the obtained result...
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.