The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals1. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species. Despite the overall conservation, however, many species-dependent specializations are apparent, including differences in cell-type proportions, gene expression, DNA methylation and chromatin state. Few cell-type marker genes are conserved across species, revealing a short list of candidate genes and regulatory mechanisms that are responsible for conserved features of homologous cell types, such as the GABAergic chandelier cells. This consensus transcriptomic classification allows us to use patch–seq (a combination of whole-cell patch-clamp recordings, RNA sequencing and morphological characterization) to identify corticospinal Betz cells from layer 5 in non-human primates and humans, and to characterize their highly specialized physiology and anatomy. These findings highlight the robust molecular underpinnings of cell-type diversity in M1 across mammals, and point to the genes and regulatory pathways responsible for the functional identity of cell types and their species-specific adaptations.
Analysis of chromatin accessibility can reveal transcriptional regulatory sequences, but heterogeneity of primary tissues poses a significant challenge in mapping the precise chromatin landscape in specific cell types. Here we report single-nucleus ATAC-seq, a combinatorial barcoding-assisted single-cell assay for transposase-accessible chromatin that is optimized for use on flash-frozen primary tissue samples. We apply this technique to the mouse forebrain through eight developmental stages. Through analysis of more than 15,000 nuclei, we identify 20 distinct cell populations corresponding to major neuronal and non-neuronal cell types. We further define cell-type-specific transcriptional regulatory sequences, infer potential master transcriptional regulators and delineate developmental changes in forebrain cellular composition. Our results provide insight into the molecular and cellular dynamics that underlie forebrain development in the mouse and establish technical and analytical frameworks that are broadly applicable to other heterogeneous tissues.
Chromatin architecture has been implicated in cell-type-specific gene regulatory programs; yet, how chromatin remodels during development remains to be fully elucidated. Here, by interrogating chromatin reorganization during human pluripotent stem cell (PSC) differentiation, we discover a role for the primate-specific endogenous retrotransposon HERV-H in creating topologically associating domains (TAD) in human PSCs. Deleting these HERV-H elements eliminates their corresponding TAD boundaries and reduces transcription of upstream genes, while de novo insertion of HERV-Hs can introduce new TAD boundaries. HERV-H’s ability to create these TAD boundaries depends on high transcription, as transcriptional repression of HERV-H elements prevents formation of these boundaries. This ability is not limited to human PSCs, as these actively transcribed HERV-Hs and their corresponding TAD boundaries also appear in PSCs from other hominids but not in more distantly related species lacking HERV-Hs. Overall, our results provide direct evidence for retrotransposons in actively shaping cell-type- and species-specific chromatin architecture.
Identification of the cis-regulatory elements controlling cell-type specific gene expression patterns is essential for understanding the origin of cellular diversity. Conventional assays to map regulatory elements via open chromatin analysis of primary tissues is hindered by sample heterogeneity. Single cell analysis of accessible chromatin (scATAC-seq) can overcome this limitation. However, the high-level noise of each single cell profile and the large volume of data pose unique computational challenges. Here, we introduce SnapATAC, a software package for analyzing scATAC-seq datasets. SnapATAC dissects cellular heterogeneity in an unbiased manner and map the trajectories of cellular states. Using the Nyström method, SnapATAC can process data from up to a million cells. Furthermore, SnapATAC incorporates existing tools into a comprehensive package for analyzing single cell ATAC-seq dataset. As demonstration of its utility, SnapATAC is applied to 55,592 single-nucleus ATAC-seq profiles from the mouse secondary motor cortex. The analysis reveals ~370,000 candidate regulatory elements in 31 distinct cell populations in this brain region and inferred candidate cell-type specific transcriptional regulators.
Focal chromosomal amplification is an important route to generating cancer through mediating over-expression of oncogenes 1 – 3 or to developing cancer therapy resistance by increasing expression of a gene whose action diminishes efficacy of an anti-cancer drug. Here we used whole-genome sequencing of clonal isolates developing chemotherapeutic resistance to identify chromothripsis as a major driver of extrachromosomal DNA (ecDNA) amplification into circular double minutes (DMs) through PARP- and DNA-PKcs-dependent mechanisms. Longitudinal analyses revealed that DMs undergo continuing structural evolution to promote increased drug tolerance through additional chromothriptic events. In-situ Hi-C sequencing is used to demonstrate that DMs preferentially tether near chromosome ends where they re-integrate when DNA damage is present. Intrachromosomal amplifications formed initially under low-level drug selection undergo continuing breakage-fusion-bridge cycles, generating >100 megabase-long amplicons that we show become trapped within interphase bridges and then shattered, producing micronuclei that mediate DM formation. Similar genome rearrangement profiles linked to localized gene amplification are identified in human cancers with acquired drug resistance or with oncogene amplifications. We propose that chromothripsis is a primary mechanism accelerating genomic DNA amplification and which enables rapid acquisition of tolerance to altered growth conditions.
Millions of cis-regulatory elements are predicted in the human genome, but direct evidence for their biological function is still scarce. Here we report a high-throughput method, Cis-Regulatory Element Scan by Tiling-deletion and sequencing (CREST-seq), for unbiased discovery and functional assessment of cis regulatory sequences in the genome. We use it to interrogate the 2Mbp POU5F1 locus in the human embryonic stem cells and identify 45 cis-regulatory elements of POU5F1. A majority of these elements display active chromatin marks, DNase hypersensitivity and occupancy by multiple transcription factors, confirming the utility of chromatin signatures in cis elements mapping. Notably, 17 of them are previously annotated promoters of functionally unrelated genes, and like typical enhancers, they form extensive spatial contacts with the POU5F1 promoter. Taken together, these results support the utility of CREST-seq for large-scale cis regulatory element discovery and point to commonality of enhancer-like promoters in the human genome.
Non-coding genetic variation is a major driver of phenotypic diversity and allows the investigation of mechanisms that control gene expression. Here, we systematically investigated the effects of >50 million variations from five strains of mice on mRNA, nascent transcription, transcription start sites, and transcription factor binding in resting and activated macrophages. We observed substantial differences associated with distinct molecular pathways. Evaluating genetic variation provided evidence for roles of ∼100 TFs in shaping lineage-determining factor binding. Unexpectedly, a substantial fraction of strain-specific factor binding could not be explained by local mutations. Integration of genomic features with chromatin interaction data provided evidence for hundreds of connected cis-regulatory domains associated with differences in transcription factor binding and gene expression. This system and the >250 datasets establish a substantial new resource for investigation of how genetic variation affects cellular phenotypes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.