The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~10 6 cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration. Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:
The rapidly emerging diversity of single cell RNAseq datasets allows us to characterize the transcriptional behav-1 ior of cell types across a wide variety of biological and clinical conditions. With this comprehensive breadth comes a major 2 analytical challenge. The same cell type across tissues, from different donors, or in different disease states, may appear 3 to express different genes. A joint analysis of multiple datasets requires the integration of cells across diverse conditions. 4 This is particularly challenging when datasets are assayed with different technologies in which real biological differences 5 are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding 6 in which cells group by cell type rather than dataset-specific conditions. Unlike available single-cell integration methods, 7 Harmony can simultaneously account for multiple experimental and biological factors. We develop objective metrics to 8 evaluate the quality of data integration. In four separate analyses, we demonstrate the superior performance of Harmony to 9 four single-cell-specific integration algorithms. Moreover, we show that Harmony requires dramatically fewer computational 10 resources. It is the only available algorithm that makes the integration of ∼ 10 6 cells feasible on a personal computer. We 11 demonstrate that Harmony identifies both broad populations and fine-grained subpopulations of PBMCs from datasets with 12 large experimental differences. In a meta-analysis of 14,746 cells from 5 studies of human pancreatic islet cells, Harmony 13 accounts for variation among technologies and donors to successfully align several rare subpopulations. In the resulting in-14 tegrated embedding, we identify a previously unidentified population of potentially dysfunctional alpha islet cells, enriched 15 for genes active in the Endoplasmic Reticulum (ER) stress response. The abundance of these alpha cells correlates across 16 donors with the proportion of dysfunctional beta cells also enriched in ER stress response genes. Harmony is a fast and 17 flexible general purpose integration algorithm that enables the identification of shared fine-grained subpopulations across a 18 variety of experimental and biological conditions. 19Recent technological advances 1 have enabled unbiased single cell transcriptional profiling of thousands of cells in a 20 single experiment. Projects such as the Human Cell Atlas 2 (HCA) and Accelerating Medicines Partnership 3, 4 exemplify 21 the growing body of reference datasets of primary human tissues. While individual experiments contribute incrementally 22 to our understanding of cell types, a comprehensive catalogue of healthy and diseased cells will require the integration of 23 multiple datasets across donors, studies, and technological platforms. Moreover, in translational research, joint analyses 24 across tissues and clinical conditions will be essential to identify disease expanded populations. However, meaningful 25 biological variatio...
Summary The identification of lymphocyte subsets with non-overlapping effector functions has been pivotal to the development of targeted therapies in immune mediated inflammatory diseases (IMIDs)1,2. However it remains unclear whether fibroblast subclasses with non-overlapping functions also exist and are responsible for the wide variety of tissue driven processes observed in IMIDs such as inflammation and damage3–5. Here we identify and describe the biology of distinct subsets of fibroblasts responsible for mediating either inflammation or tissue damage in arthritis. We show that deletion of FAPα+ fibroblasts suppressed both inflammation and bone erosions in murine models of resolving and persistent arthritis. Single cell transcriptional analysis identified two distinct fibroblast subsets within the FAPα+ population: FAPα+ THY1+ immune effector fibroblasts located in the synovial sub-lining, and FAPα+ THY1- destructive fibroblasts restricted to the synovial lining layer. When adoptively transferred into the joint, FAPα+ THY1- fibroblasts selectively mediate bone and cartilage damage with little effect on inflammation, whereas transfer of FAPα+ THY1+ fibroblasts resulted in a more severe and persistent inflammatory arthritis, with minimal effect on bone and cartilage. Our findings describing anatomically discrete, functionally distinct fibroblast subsets with non-overlapping functions have important implications for cell based therapies aimed at modulating inflammation and tissue damage.
The synovium is a mesenchymal tissue composed mainly of fibroblasts with a lining and sublining that surrounds the joints. In rheumatoid arthritis (RA), the synovial tissue undergoes marked hyperplasia, becomes inflamed and invasive and destroys the joint 1 , 2 . Recently, we and others found that a subset of fibroblasts located in the sublining undergoes major expansion in RA and is linked to disease activity 3 , 4 , 5 . However, the molecular mechanism by which these fibroblasts differentiate and expand in RA remains unknown. Here, we identified a critical role for NOTCH3 signaling in the differentiation of perivascular and sublining CD90( THY1 )+ fibroblasts. Using single cell RNA-sequencing and synovial tissue organoids, we found that NOTCH3 signaling drives both transcriptional and spatial gradients in fibroblasts emanating from vascular endothelial cells outward. In active RA, NOTCH3 and NOTCH target genes are markedly upregulated in synovial fibroblasts. Importantly, genetic deletion of Notch3 or monoclonal antibody-blockade of NOTCH3 signaling attenuates inflammation and prevents joint damage in inflammatory arthritis. Our results indicate that synovial fibroblasts exhibit positional identity regulated by endothelium-derived Notch signaling and that this stromal crosstalk pathway underlies inflammation and pathology in inflammatory arthritis.
Stable BioNetGen releases (Linux, Mac OS/X and Windows), with documentation, are available at http://bionetgen.org Source code is available at http://github.com/RuleWorld/bionetgen CONTACT: bionetgen.help@gmail.comSupplementary information: Supplementary data are available at Bioinformatics online.
Supplementary data are available at Bioinformatics online.
How innate T cells (ITC), including invariant natural killer T (iNKT) cells, mucosal-associated invariant T (MAIT) cells, and γδ T cells, maintain a poised effector state has been unclear. Here we address this question using low-input and single-cell RNA-seq of human lymphocyte populations. Unbiased transcriptomic analyses uncover a continuous ‘innateness gradient’, with adaptive T cells at one end, followed by MAIT, iNKT, γδ T and natural killer cells at the other end. Single-cell RNA-seq reveals four broad states of innateness, and heterogeneity within canonical innate and adaptive populations. Transcriptional and functional data show that innateness is characterized by pre-formed mRNA encoding effector functions, but impaired proliferation marked by decreased baseline expression of ribosomal genes. Together, our data shed new light on the poised state of ITC, in which innateness is defined by a transcriptionally-orchestrated trade-off between rapid cell growth and rapid effector function.
High dimensional single-cell analyses have improved the ability to resolve complex mixtures of cells from human disease samples; however, identifying disease-associated cell types or cell states in patient samples remains challenging due to technical and inter-individual variation. Here we present Mixed-effects modeling of Associations of Single Cells (MASC), a reverse single cell association strategy for testing whether case-control status influences the membership of single cells in any of multiple cellular subsets while accounting for technical confounders and biological variation. Applying MASC to mass cytometry analyses of CD4+ T cells from the blood of rheumatoid arthritis (RA) patients and controls revealed a significantly expanded population of CD4+ T cells, identified as CD27- HLA-DR+ effector memory cells, in RA patients (OR = 1.7; p = 1.1 × 10−3). The frequency of CD27- HLA-DR+ cells was similarly elevated in blood samples from a second RA patient cohort, and CD27- HLA-DR+ cell frequency decreased in RA patients who responded to immunosuppressive therapy. Mass cytometry and flow cytometry analyses indicated that CD27- HLA-DR+ cells were associated with RA (meta-analysis p = 2.3 × 10−4). Compared to peripheral blood, synovial fluid and synovial tissue samples from RA patients contained ~5-fold higher frequencies of CD27- HLA-DR+ cells, which comprised ~10% of synovial CD4+ T cells. CD27- HLA-DR+ cells expressed a distinctive effector memory transcriptomic program with Th1- and cytotoxicity-associated features, and produced abundant IFN-γ and granzyme A protein upon stimulation. We propose that MASC is a broadly applicable method to identify disease-associated cell populations in high-dimensional single cell data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.