Multi-modal single-cell technologies capable of simultaneously assaying gene expression and surface phenotype across large numbers of immune cells have described extensive heterogeneity within these complex populations, in healthy and diseased states. In the case of T cells, these technologies have made it possible to profile clonotype, defined by T cell receptor (TCR) sequence, and phenotype, as reflected in gene expression (GEX) profile, surface protein expression, and peptide:MHC (pMHC) binding, across large and diverse cell populations. These rich, high-dimensional datasets have the potential to reveal new relationships between TCR sequence and T cell phenotype that go beyond identification of features shared by clonally related cells. In order to uncover these connections in an unbiased way, we developed a graph-theoretic approach---clonotype neighbor-graph analysis or "CoNGA"---that identifies correlations between GEX profile and TCR sequence through statistical analysis of a pair of T cell similarity graphs, one in which cells are linked based on gene expression similarity and another in which cells are linked by similarity of TCR sequence. Applying CoNGA across diverse human and mouse T cell datasets uncovered known and novel associations between TCR sequence features and cellular phenotype including the classical invariant T cell subsets; a novel defined population of human blood CD8+ T cells expressing the transcription factors HOBIT and HELIOS, NK-associated receptors, and a biased TCR repertoire, representing a potential previously undescribed lineage of "natural lymphocytes"; a striking association between usage of a specific V-beta gene segment and expression of the EPHB6 gene that is conserved between mouse and human; and TCR sequence determinants of differentiation in developing thymocytes. As the size and scale of single-cell datasets continue to grow, we expect that CoNGA will prove to be a useful tool for deconvolving complex relationships between TCR sequence and cellular state in single-cell applications..
Multi-modal single-cell technologies capable of simultaneously assaying gene expression and surface phenotype across large numbers of immune cells have described extensive heterogeneity within these complex populations, in healthy and diseased states. In the case of T cells, these technologies have made it possible to profile clonotype, defined by T cell receptor (TCR) sequence, and phenotype, as reflected in gene expression (GEX) profile, surface protein expression, and peptide:MHC (pMHC) binding, across large and diverse cell populations. These rich, high-dimensional datasets have the potential to reveal new relationships between TCR sequence and T cell phenotype that go beyond identification of features shared by clonally related cells. In order to uncover these connections in an unbiased way, we developed a graph-theoretic approach---clonotype neighbor-graph analysis or "CoNGA"---that identifies correlations between GEX profile and TCR sequence through statistical analysis of a pair of T cell similarity graphs, one in which cells are linked based on gene expression similarity and another in which cells are linked by similarity of TCR sequence. Applying CoNGA across diverse human and mouse T cell datasets uncovered known and novel associations between TCR sequence features and cellular phenotype including the classical invariant T cell subsets; a novel defined population of human blood CD8+ T cells expressing the transcription factors HOBIT and HELIOS, NK-associated receptors, and a biased TCR repertoire, representing a potential previously undescribed lineage of "natural lymphocytes"; a striking association between usage of a specific V-beta gene segment and expression of the EPHB6 gene that is conserved between mouse and human; and TCR sequence determinants of differentiation in developing thymocytes. As the size and scale of single-cell datasets continue to grow, we expect that CoNGA will prove to be a useful tool for deconvolving complex relationships between TCR sequence and cellular state in single-cell applications.
Recent advances now allow for deep simultaneous profiling of T cell clonotypes, defined by T cell receptor (TCR) sequence, and phenotype, as reflected in gene expression (GEX) profile, surface protein expression, and epitope binding at the single-cell-level. However, there currently few tools available for unsupervised discovery of relationships between TCR sequence and cell phenotype. We hypothesized that by identifying correlations between “TCR neighborhoods”, defined by shared TCR sequence and GEX features, we could move beyond simply measuring GEX variation within clonal descendants and identify novel associations between T cell specificities and states. Previously, we introduced TCRdist, a measure for assessing inter-TCR similarity capable of identifying closely related clonotypes based on shared sequence features. Using TCRdist to quantify TCR similarity, we developed a graph-theoretic approach—clonotype neighbor-graph analysis or “CoNGA”—that identifies correlations between GEX profile and TCR sequence in an unbiased and automated manner through statistical analysis of GEX and TCR similarity graphs. Applying CoNGA, we uncovered novel associations between TCR and GEX space including a previously undescribed “natural lymphocyte” population of human blood CD8+ T cells; an association between TRBV gene usage and EPHB6 expression; and TCR sequence determinants of differentiation in thymocytes. These examples demonstrate that CoNGA is able to effectively deconvolve complex relationships between TCR sequence and cellular state. Conceptually, CoNGA could be extended to other clonally-related populations (e.g. B cells, tumors), and can easily incorporate other measurable features (e.g. ATAC-Seq).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.