Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
High-throughput mapping of cellular differentiation hierarchies from single-cell data promises to empower systematic interrogations of vertebrate development and disease. Here, we applied single-cell RNA sequencing to >92,000 cells from zebrafish embryos during the first day of development. Using a graph-based approach, we mapped a cell state landscape that describes axis patterning, germ layer formation, and organogenesis. We tested how clonally related cells traverse this landscape by developing a transposon-based barcoding approach (“TracerSeq”) for reconstructing single-cell lineage histories. Clonally related cells were often restricted by the state landscape, including a case in which two independent lineages converge on similar fates. Cell fates remained restricted to this landscape in chordin-deficient embryos. We provide web-based resources for further analysis of the single-cell data.
Although it is generally accepted that cellular differentiation requires changes to transcriptional networks, dynamic regulation of promoters and enhancers at specific sets of genes has not been previously studied en masse. Exploiting the fact that active promoters and enhancers are transcribed, we simultaneously measured their activity in 19 human and 14 mouse time courses covering a wide range of cell types and biological stimuli. Enhancer RNAs, then messenger RNAs encoding transcription factors, dominated the earliest responses. Binding sites for key lineage transcription factors were simultaneously overrepresented in enhancers and promoters active in each cellular system. Our data support a highly generalizable model in which enhancer transcription is the earliest event in successive waves of transcriptional change during cellular differentiation or activation.
Time series of single-cell transcriptome measurements can reveal dynamic features of cell differentiation pathways. From measurements of whole frog embryos spanning zygotic genome activation through early organogenesis, we derived a detailed catalog of cell states in vertebrate development and a map of differentiation across all lineages over time. The inferred map recapitulates most if not all developmental relationships and associates new regulators and marker genes with each cell state. We find that many embryonic cell states appear earlier than previously appreciated. We also assess conflicting models of neural crest development. Incorporating a matched time series of zebrafish development from a companion paper, we reveal conserved and divergent features of vertebrate early developmental gene expression programs.
Schizophrenia (SZ) is a complex disease characterized by impaired neuronal functioning. Although defective alternative splicing has been linked to SZ, the molecular mechanisms responsible are unknown. Additionally, there is limited understanding of the early transcriptomic responses to neuronal activation. Here, we profile these transcriptomic responses and show that long non-coding RNAs (lncRNAs) are dynamically regulated by neuronal activation, including acute downregulation of the lncRNA Gomafu, previously implicated in brain and retinal development. Moreover, we demonstrate that Gomafu binds directly to the splicing factors QKI and SRSF1 (serine/arginine-rich splicing factor 1) and dysregulation of Gomafu leads to alternative splicing patterns that resemble those observed in SZ for the archetypal SZ-associated genes DISC1 and ERBB4. Finally, we show that Gomafu is downregulated in post-mortem cortical gray matter from the superior temporal gyrus in SZ. These results functionally link activity-regulated lncRNAs and alternative splicing in neuronal function and suggest that their dysregulation may contribute to neurological disorders.
Only relatively recently has it become clear that mammalian genomes encode tens of thousands of long non-coding RNAs (lncRNAs). A striking 40% of these are expressed specifically in the brain, where they show precisely regulated temporal and spatial expression patterns. This begs the question, what is the functional role of these many lncRNA transcripts in the brain? Here we canvass a growing number of mechanistic studies that have elucidated central roles for lncRNAs in the regulation of nervous system development and function. We also survey studies indicating that neurological and psychiatric disorders may ensue when these mechanisms break down. Finally, we synthesize these insights with evidence from comparative genomics to argue that lncRNAs may have played important roles in brain evolution, by virtue of their abundant sequence innovation in mammals and plausible mechanistic connections to the adaptive processes that occurred recently in the primate and human lineages.
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.
Down syndrome (DS) is the most frequent cause of human congenital mental retardation. Cognitive deficits in DS result from perturbations of normal cellular processes both during development and in adult tissues, but the mechanisms underlying DS etiology remain poorly understood. To assess the ability of induced pluripotent stem cells (iPSCs) to model DS phenotypes, as a prototypical complex human disease, we generated bona fide DS and wild-type (WT) nonviral iPSCs by episomal reprogramming. DS iPSCs selectively overexpressed chromosome 21 genes, consistent with gene dosage, which was associated with deregulation of thousands of genes throughout the genome. DS and WT iPSCs were neurally converted at >95% efficiency and had remarkably similar lineage potency, differentiation kinetics, proliferation, and axon extension at early time points. However, at later time points DS cultures showed a twofold bias toward glial lineages. Moreover, DS neural cultures were up to two times more sensitive to oxidative stress-induced apoptosis, and this could be prevented by the antioxidant N-acetylcysteine. Our results reveal a striking complexity in the genetic alterations caused by trisomy 21 that are likely to underlie DS developmental phenotypes, and indicate a central role for defective early glial development in establishing developmental defects in DS brains. Furthermore, oxidative stress sensitivity is likely to contribute to the accelerated neurodegeneration seen in DS, and we provide proof of concept for screening corrective therapeutics using DS iPSCs and their derivatives. Nonviral DS iPSCs can therefore model features of complex human disease in vitro and provide a renewable and ethically unencumbered discovery platform.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.