2022
DOI: 10.3389/fmolb.2022.962644
|View full text |Cite
|
Sign up to set email alerts
|

The performance of deep generative models for learning joint embeddings of single-cell multi-omics data

Abstract: Recent extensions of single-cell studies to multiple data modalities raise new questions regarding experimental design. For example, the challenge of sparsity in single-omics data might be partly resolved by compensating for missing information across modalities. In particular, deep learning approaches, such as deep generative models (DGMs), can potentially uncover complex patterns via a joint embedding. Yet, this also raises the question of sample size requirements for identifying such patterns from single-ce… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
8
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 75 publications
0
8
0
Order By: Relevance
“…This reconstruction of “highly probable signaling networks” is often based on scRNA-seq data, without exact pairing transcriptomic quantification with probability-based protein identification ( 30 ). Models capable of indexing both transcriptomes and epitopes by sequencing, such as CITE-seq, already exist ( 31 ) and will help to combine RNA-data and protein abundance in a contextual manner. However, as Alexander F. Schier has already asked: … “Is “landscapes” even the proper analogy for multidimensional phenotypic complexity?…”
Section: Decrypting Cellular Communications In Situ ...mentioning
confidence: 99%
“…This reconstruction of “highly probable signaling networks” is often based on scRNA-seq data, without exact pairing transcriptomic quantification with probability-based protein identification ( 30 ). Models capable of indexing both transcriptomes and epitopes by sequencing, such as CITE-seq, already exist ( 31 ) and will help to combine RNA-data and protein abundance in a contextual manner. However, as Alexander F. Schier has already asked: … “Is “landscapes” even the proper analogy for multidimensional phenotypic complexity?…”
Section: Decrypting Cellular Communications In Situ ...mentioning
confidence: 99%
“…For a dataset to be embeddable, it must have a column containing information that uniquely identifies that product, such as product ID [29]. The identifier ensures that the embeddings can be accurately linked with the corresponding products during training, querying, or making predictions thereby allowing for the preservation of the integrity of the embeddings and their association back to the original products in the dataset [30].…”
Section: What Are Embeddings and How Are They Generated?mentioning
confidence: 99%
“…In another study, Brombacher et al . compared the performance of several deep-learning-based joint representation learners as a function of the number of cells in the datasets [ 27 ]. This study was focused on single-cell data and did not compare to any well-established linear methods.…”
Section: Introductionmentioning
confidence: 99%