Unsupervised machine learning reveals risk stratifying glioblastoma tumor cells

Leelatian, Nalin; Sinnaeve, Justine; Mistry, Akshitkumar M.; Barone, Sierra; Brockman, Asa; Diggins, Kirsten E.; Greenplate, Allison R.; Weaver, Kyle D.; Thompson, Reid C.; Chambless, Lola B.; Mobley, Bret C.; Ihrie, Rebecca A.; Irish, Jonathan M.

doi:10.7554/elife.56879

Cited by 24 publications

(45 citation statements)

References 76 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Here we used the t-SNE algorithm as a core method to reduce the dimensionality of the dataset and to visualize our data. t-SNE has been widely used in the unsupervised analysis of many types of biological data (Berman et al, 2014; Kollmorgen et al, 2020; Chen et al, 2020; Macosko et al, 2015; Kobak and Berens, 2019; Leelatian et al, 2020), including neural recordings (Dimitriadis et al, 2018). t-SNE minimizes the Kullback-Leibler divergence between a Gaussian distribution modeling pairwise distances between data points and a Student t-distribution modeling distances between the same points in a low (typically two) dimensional embedding (Van der Maaten and Hinton, 2008; Linderman and Steinerberger, 2019).…”

Section: Discussionmentioning

confidence: 99%

Mapping circuit dynamics during function and dysfunction

Gorur-Shandilya

Cronin

et al. 2021

Preprint

View full text Add to dashboard Cite

Neural circuits can generate many spike patterns, but only some are functional. The study of how circuits generate and maintain functional dynamics is hindered by a poverty of description of circuit dynamics across functional and dysfunctional states. For example, although the regular oscillation of a central pattern generator is well characterized by its frequency and the phase relationships between its neurons, these metrics are ineffective descriptors of the irregular and aperiodic dynamics that circuits can generate under perturbation or in disease states. By recording the circuit dynamics of the well-studied pyloric circuit in C. borealis, we used statistical features of spike times from neurons in the circuit to visualize the spike patterns generated by this circuit under a variety of conditions. This unsupervised approach captures both the variability of functional rhythms and the diversity of atypical dynamics in a single map. Clusters in the map identify qualitatively different spike patterns hinting at different dynamical states in the circuit. State probability and the statistics of the transitions between states varied with environmental perturbations, removal of descending neuromodulation, and the addition of exogenous neuromodulators. This analysis reveals strong mechanistically interpretable links between complex changes in the collective behavior of a neural circuit and specific experimental manipulations, and can constrain hypotheses of how circuits generate functional dynamics despite variability in circuit architecture and environmental perturbations.

show abstract

Section: Discussionmentioning

confidence: 99%

Mapping circuit dynamics during function and dysfunction

Gorur-Shandilya

Cronin

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Marker Enrichment Modeling from the MEM package (https://github.com/cytolab/mem) was used to characterize feature enrichment in KNN region around each cell. MEM normally requires a comparison of a population against a reference control, such as a common reference sample (Diggins et al, 2017), all other cells (Diggins et al, 2018;Leelatian et al, 2020), or induced pluripotent stem cells (Greenplate et al, 2019). Here, a statistical reference point intended as a statistical null hypothesis was used as the MEM reference.…”

Section: Mem Analysis Of Enriched Featuresmentioning

confidence: 99%

“…Analysis algorithms typically rely on aggregate statistics for groups of cells, but the process of grouping the cells works best with larger, established populations ( Diggins et al, 2015 ; Irish et al, 2006 ; Saeys et al, 2016 ) or may include pre-filtering of cells by human experts ( Greenplate et al, 2016a ; Greenplate et al, 2019 ). Cytometry tools like SPADE ( Bendall et al, 2011 ; Qiu et al, 2011 ), FlowSOM ( Van Gassen et al, 2015 ), Phenograph ( Levine et al, 2015 ), Citrus ( Bruggner et al, 2014 ), and RAPID ( Leelatian et al, 2020 ) generally work best to characterize cell subsets representing >1% of the sample and are less capable of capturing extremely rare cells or subsets distinguished by only a fraction of measured features. Tools like t-SNE ( Amir el et al, 2013 ; Krijthe et al, 2015 ), opt-SNE ( Belkina et al, 2019 ), and UMAP ( Becht et al, 2018 ; McInnes et al, 2018 ) embed cells or learn a manifold and represent these transformations as algorithmically-generated axes.…”

Section: Introductionmentioning

confidence: 99%

“…Tools like t-SNE ( Amir el et al, 2013 ; Krijthe et al, 2015 ), opt-SNE ( Belkina et al, 2019 ), and UMAP ( Becht et al, 2018 ; McInnes et al, 2018 ) embed cells or learn a manifold and represent these transformations as algorithmically-generated axes. In addition to assisting with data visualization, these tools frequently reveal unexpected cells and facilitate their identification through manual or automated clustering ( Amir el et al, 2013 ; Becher et al, 2014 ; Diggins et al, 2015 ; Diggins et al, 2017 ; Gandelman et al, 2019 ; Leelatian et al, 2020 ). Sconify ( Burns et al, 2018 ) is one such tool that applies k -nearest neighbors (KNN) to calculate aggregate statistics for the immediate phenotypic neighborhood around a given cell on a t-SNE plot that combines data from multiple cytometry samples.…”

Section: Introductionmentioning

confidence: 99%

“…For a biologist, these tools provide a way to organize cells according to phenotypic relationships that span multiple measured features, such as the proteins quantified on each of millions of cells in the datasets here. In addition to assisting with data visualization, these tools frequently reveal unexpected cells and facilitate their identification through manual or automated clustering 6, 15, 16, 21–23 . Sconify 24 is one such tool that applies k -nearest neighbors (KNN) to calculate aggregate statistics for the immediate phenotypic neighborhood around a given cell on a t-SNE plot representing data from multiple cytometry samples.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Unsupervised machine learning reveals key immune cell subsets in COVID-19, rhinovirus infection, and cancer therapy

Barone

Paul

Muehling

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

For an emerging disease like COVID-19, systems immunology tools may quickly identify and quantitatively characterize cells associated with disease progression or clinical response. With repeated sampling, immune monitoring creates a real-time portrait of the cells reacting to a novel virus before disease specific knowledge and tools are established. However, single cell analysis tools can struggle to reveal rare cells that are under 0.1% of the population. Here, the machine learning workflow Tracking Responders Expanding (T-REX) was created to identify changes in both very rare and common cells in diverse human immune monitoring settings. T-REX identified cells that were highly similar in phenotype and localized to hotspots of significant change during rhinovirus and SARS-CoV-2 infections. MHC tetramers were not used during unsupervised analysis and instead "left out" to serve as a test of whether T-REX identifies biologically significant cells. In the rhinovirus challenge study, T-REX identified virus-specific CD4 + T cells based on these cells being a distinct phenotype that expanded by ≥95% following infection. T-REX successfully identified hotspots with virus-specific T cells using pairs of samples comparing Day 7 of infection to samples taken either after clearing the infection (Day 28) or samples taken prior to infection (Day 0). Mapping pairwise comparisons in samples according to both the direction and degree of change provided a framework to compare systems level immune changes during infectious disease or therapy response. This revealed that the magnitude and direction of systemic immune change in some COVID-19 patients was comparable to that of blast crisis acute myeloid leukemia patients undergoing induction chemotherapy and characterized the identity of the immune cells that changed the most. Other COVID-19 patients instead matched an immune trajectory like that of individuals with rhinovirus infection or melanoma patients receiving checkpoint inhibitor therapy. T-REX analysis of paired blood samples provides an approach to rapidly identify and characterize mechanistically significant cells and to place emerging diseases into a systems immunology context.

show abstract

Single‐cell protein profiling defines cell populations associated with triple‐negative breast cancer aggressiveness

et al. 2023

View full text Add to dashboard Cite

Triple‐negative breast cancer (TNBC) is an aggressive and complex subtype of breast cancer that lacks targeted therapy. TNBC manifests characteristic, extensive intratumoral heterogeneity that promotes disease progression and influences drug response. Single‐cell techniques in combination with next‐generation computation provide an unprecedented opportunity to identify molecular events with therapeutic potential. Here, we describe the generation of a comprehensive mass cytometry panel for multiparametric detection of 23 phenotypic markers and 13 signaling molecules. This single‐cell proteomic approach allowed us to explore the landscape of TNBC heterogeneity, with particular emphasis on the tumor microenvironment. We prospectively profiled freshly resected tumors from 26 TNBC patients. These tumors contained phenotypically distinct subpopulations of cancer and stromal cells that were associated with the patient's clinical status at the time of surgery. We further classified the epithelial‐mesenchymal plasticity of tumor cells, and molecularly defined phenotypically diverse populations of tumor‐associated stroma. Furthermore, in a retrospective tissue‐microarray TNBC cohort, we showed that the level of CD97 at the time of surgery has prognostic potential.

show abstract

Unsupervised machine learning reveals risk stratifying glioblastoma tumor cells

Cited by 24 publications

References 76 publications

Mapping circuit dynamics during function and dysfunction

Mapping circuit dynamics during function and dysfunction

Unsupervised machine learning reveals key immune cell subsets in COVID-19, rhinovirus infection, and cancer therapy

Single‐cell protein profiling defines cell populations associated with triple‐negative breast cancer aggressiveness

Contact Info

Product

Resources

About