Single cell transcriptomics has transformed the characterization of brain cell identity by providing quantitative molecular signatures for large, unbiased samples of brain cell populations. With the proliferation of taxonomies based on individual datasets, a major challenge is to integrate and validate results toward defining biologically meaningful cell types. We used a battery of single-cell transcriptome and epigenome measurements generated by the BRAIN Initiative Cell Census Network (BICCN) to comprehensively assess the molecular signatures of cell types in the mouse primary motor cortex (MOp). We further developed computational and statistical methods to integrate these multimodal data and quantitatively validate the reproducibility of the cell types. The reference atlas, based on more than 600,000 high quality single-cell or -nucleus samples assayed by six molecular modalities, is a comprehensive molecular account of the diverse neuronal and non-neuronal cell types in MOp.Collectively, our study indicates that the mouse primary motor cortex contains over 55 neuronal cell types that are highly replicable across analysis methods, sequencing technologies, and modalities. We find many concordant multimodal markers for each cell type, as well as thousands of genes and gene regulatory elements with discrepant transcriptomic and epigenomic signatures. These data highlight the complex molecular regulation of brain cell types and will directly enable design of reagents to target specific MOp cell types for functional analysis.
IntroductionNeural circuits are characterized by extraordinary diversity of their cellular components 1,2 . Single-cell molecular assays, especially transcriptomic measurements by RNA-Seq, have accelerated the discovery and characterization of cell types across brain regions and in diverse species. Recent advances include single-cell transcriptome datasets with >10 5 individual cells, identifying hundreds of neuronal and non-neuronal cell types across the mouse nervous system 3-5 . As the number of profiled cells grows into the millions, a key question is whether these data will converge toward a comprehensive and coherent taxonomy of cell types with broad utility for organizing knowledge of brain cells and their function. Data from different modalities, including transcriptomic and epigenomic data, must be cross-referenced and integrated to establish robust and consistent cell type classifications.Although a comprehensive atlas should incorporate anatomical and physiological information, the high throughput of single cell sequencing assays makes integration of molecular data a particularly urgent challenge. A rigorous and reproducible consensus molecular atlas of brain cell types would drive progress across modalities, including obtaining functional information.Single cell sequencing technologies can measure multiple molecular signatures of cell identity. The core molecular identity of a cell is largely established during development and maintained by a combination of gene regulatory proteins...