High-throughput single-cell sequencing technologies hold tremendous potential for defining cell types in an unbiased fashion using gene expression and epigenomic state. A key challenge in realizing this potential is integrating single-cell datasets from multiple protocols, biological contexts, and data modalities into a joint definition of cellular identity. We previously developed an approach called Linked Inference of Genomic Experimental Relationships (LIGER) that uses integrative nonnegative matrix factorization to address this challenge. Here, we provide a step-by-step protocol for using LIGER to jointly define cell types from multiple single-cell datasets. The main steps of the protocol include data preprocessing and normalization, joint factorization, quantile normalization and joint clustering, and visualization. We describe how to jointly define cell types from single-cell RNA-seq and single-nucleus ATAC-seq data, but similar steps apply across a wide range of other settings and data types, including cross-species analysis, single-nucleus DNA methylation, and spatial transcriptomics. Our protocol contains examples of expected results, describes common pitfalls, and relies only on our freely available, open-source R implementation of LIGER. We also provide Rmarkdown tutorials showing the outputs from each individual code segment. The analysis process can be performed in 1 -4 h depending on dataset size and assumes no specialized bioinformatics training..
Integrating large single-cell gene expression, chromatin accessibility and DNA methylation datasets requires general and scalable computational approaches. Here we describe online integrative nonnegative matrix factorization (iNMF), an algorithm for integrating large, diverse, and continually arriving single-cell datasets. Our approach scales to arbitrarily large numbers of cells using fixed memory, iteratively incorporates new datasets as they are generated, and allows many users to simultaneously analyze a single copy of a large dataset by streaming it over the internet. Iterative data addition can also be used to map new data to a reference dataset. Comparisons with previous methods indicate that the improvements in efficiency do not sacrifice dataset alignment and cluster preservation performance. We demonstrate the effectiveness of online iNMF by integrating more than a million cells on a standard laptop, integrating large single-cell RNA-seq and spatial transcriptomic datasets, and iteratively constructing a single-cell multi-omic atlas of the mouse motor cortex.
Mutant isocitrate-dehydrogenase 1 (mIDH1) synthesizes the oncometabolite 2-hydroxyglutarate (2HG), which elicits epigenetic reprogramming of the glioma cells' transcriptome by inhibiting DNA and histone demethylases. We show that the efficacy of immune-stimulatory gene therapy (TK/Flt3L) is enhanced in mIDH1 gliomas, due to the reprogramming of the myeloid cells' compartment infiltrating the tumor microenvironment (TME). We uncovered that the immature myeloid cells infiltrating the mIDH1 TME are mainly nonsuppressive neutrophils and preneutrophils. Myeloid cell reprogramming was triggered by granulocyte colony-stimulating factor (G-CSF) secreted by mIDH1 glioma stem/progenitor-like cells. Blocking G-CSF in mIDH1 glioma-bearing mice restores the inhibitory potential of the tumor-infiltrating myeloid cells, accelerating tumor progression. We demonstrate that G-CSF reprograms bone marrow granulopoiesis, resulting in noninhibitory myeloid cells within mIDH1 glioma TME and enhancing the efficacy of immune-stimulatory gene therapy.
The bone marrow contains various populations of skeletal stem cells (SSCs) in the stromal compartment, which are important regulators of bone formation. It is well-described that leptin receptor (LepR)+ perivascular stromal cells provide a major source of bone-forming osteoblasts in adult and aged bone marrow. However, the identity of SSCs in young bone marrow and how they coordinate active bone formation remains unclear. Here we show that bone marrow endosteal SSCs are defined by fibroblast growth factor receptor 3 (Fgfr3) and osteoblast-chondrocyte transitional (OCT) identities with some characteristics of bone osteoblasts and chondrocytes. These Fgfr3-creER-marked endosteal stromal cells contribute to a stem cell fraction in young stages, which is later replaced by Lepr-cre-marked stromal cells in adult stages. Further, Fgfr3+ endosteal stromal cells give rise to aggressive osteosarcoma-like lesions upon loss of p53 tumor suppressor through unregulated self-renewal and aberrant osteogenic fates. Therefore, Fgfr3+ endosteal SSCs are abundant in young bone marrow and provide a robust source of osteoblasts, contributing to both normal and aberrant osteogenesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.