2022
DOI: 10.1101/2022.02.24.481684
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Dictionary learning for integrative, multimodal, and scalable single-cell analysis

Abstract: Mapping single-cell sequencing profiles to comprehensive reference datasets represents a powerful alternative to unsupervised analysis. Reference datasets, however, are predominantly constructed from single-cell RNA-seq data, and cannot be used to annotate datasets that do not measure gene expression. Here we introduce 'bridge integration', a method to harmonize single-cell datasets across modalities by leveraging a multi-omic dataset as a molecular bridge. Each cell in the multi-omic dataset comprises an elem… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
163
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 138 publications
(171 citation statements)
references
References 80 publications
0
163
0
Order By: Relevance
“…In contrast, sparse matrices have been used for high-performance computing for a long time (Orchard-Eays 1956; Markowitz 1957), and can drastically reduce the memory overhead required to perform memory-intensive computations. While recently developed “sketching” procedures (Hao et al 2022) that subsample matrix operations for scalable computation may provide workarounds for dense matrices, we believe that sparsity will remain an important consideration for normalization transformations for the foreseeable future.…”
Section: Resultsmentioning
confidence: 99%
“…In contrast, sparse matrices have been used for high-performance computing for a long time (Orchard-Eays 1956; Markowitz 1957), and can drastically reduce the memory overhead required to perform memory-intensive computations. While recently developed “sketching” procedures (Hao et al 2022) that subsample matrix operations for scalable computation may provide workarounds for dense matrices, we believe that sparsity will remain an important consideration for normalization transformations for the foreseeable future.…”
Section: Resultsmentioning
confidence: 99%
“…3h ). Cluster assignments were further supported via orthogonal annotation utilizing cell mapping through a novel multiomic bridge integration approach 54 ( Extended Data Fig. 3i; see materials and methods ).…”
Section: Resultsmentioning
confidence: 99%
“…h , Gene accessibility scores for key stem cell ( HLF, AVP, CD34, CD38 ) marker genes across clusters HSPC1 and HSPC2 (Wilcoxon rank sum test), consistent with HSPC1 representing earlier HSPCs (higher HLF, AVP and CD34 ) and HSPC2 representing later HSPCs (higher CD38 and lower HLF, AVP and CD34 ). i , Confusion matrix between manually annotated cluster labels and predicted labels based on scRNA-seq reference via bridge integration mapping 54 (see materials and methods).…”
Section: Extended Datamentioning
confidence: 99%
“…Computational strategies encounter several considerations as how to define anchors, scalability and handling missing data ( 43 ). Several of these challenges are being addressed by recently developed tools including MOFA+ ( 24 ), multiVI ( 44 ), COBOLT ( 45 ), StabMap ( 46 ) scMVP ( 47 ), and Bridge Integration ( 48 ). So-far there was no illustration of integrating mRNA datasets with a comprehensive intracellular phospho-protein and transcription factor dataset using a common set of surface proteins.…”
Section: Discussionmentioning
confidence: 99%