2022
DOI: 10.1038/s41467-022-28431-4
|View full text |Cite
|
Sign up to set email alerts
|

UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization

Abstract: Single-cell genomic technologies provide an unprecedented opportunity to define molecular cell types in a data-driven fashion, but present unique data integration challenges. Many analyses require “mosaic integration”, including both features shared across datasets and features exclusive to a single experiment. Previous computational integration approaches require that the input matrices share the same number of either genes or cells, and thus can use only shared features. To address this limitation, we derive… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
55
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(67 citation statements)
references
References 49 publications
1
55
0
Order By: Relevance
“…To compare scMoMaT with baseline methods, we ran two recently published methods which can also work with this integration scenario: MultiMap 21 and UINMF 16 (Methods, visualization of latent embedding in Figs. S1a,b), and quantitatively measured the overall performance of all four methods with Graph connectivity (GC) score, NMI score, and ARI score (cite, Methods) using the label in the original data paper as ground truth.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To compare scMoMaT with baseline methods, we ran two recently published methods which can also work with this integration scenario: MultiMap 21 and UINMF 16 (Methods, visualization of latent embedding in Figs. S1a,b), and quantitatively measured the overall performance of all four methods with Graph connectivity (GC) score, NMI score, and ARI score (cite, Methods) using the label in the original data paper as ground truth.…”
Section: Resultsmentioning
confidence: 99%
“…It does not discuss how to utilize the information from all modalities for the batches that are measured with more than one modality. UINMF 16 uses a matrix bi-factorization framework to integrate data matrices with shared and unshared features. UINMF works with most mosaic integration cases, but it focuses on learning cell embedding and does not simultaneously learn a feature embedding along with the marker features of cell identities.…”
Section: Introductionmentioning
confidence: 99%
“…Both approaches explore efficient algorithms, but do not explicitly provide models associating molecular features to the ‘latent space’. MOFA ( Argelaguet et al , 2020 ), scAI ( Jin et al , 2020 ), totalVI ( Gayoso et al , 2021 ) and LIGER ( Kriebel and Welch, 2022 ) explore distinct methods for matrix factorization and estimation of shared latent spaces between modalities. Moreover, estimated matrices can be used for model interpretation, i.e.…”
Section: Introductionmentioning
confidence: 99%
“…To achieve such goals, it is necessary to (identify and) align cells in comparable states across related datasets. Yet another important application is the transfer and integration of complementary biological information across datasets: for example, if one dataset contains individual cells' spatial information within a tissue, matching it with a non-spatial single-cell dataset bears the potential of transferring spatial information to a different measurement modality (e.g., [43,23]).…”
Section: Introductionmentioning
confidence: 99%