2020
DOI: 10.1093/bioinformatics/btaa843
|View full text |Cite
|
Sign up to set email alerts
|

SCIM: universal single-cell matching with unpaired feature sets

Abstract: Motivation Recent technological advances have led to an increase in the production and availability of single-cell data. The ability to integrate a set of multi-technology measurements would allow the identification of biologically or clinically meaningful observations through the unification of the perspectives afforded by each technology. In most cases, however, profiling technologies consume the used cells and thus pairwise correspondences between datasets are lost. Due to the sheer size s… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
49
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 50 publications
(54 citation statements)
references
References 30 publications
0
49
0
Order By: Relevance
“…Several ML approaches have been developed for that purpose, for instance by characterizing cells across measurements, projecting multiple measurements into a common latent space or learning the missing modalities. Transcriptomics is typically one of the modalities that is integrated, together with chromatin accessibility [69, 52, 70], DNA [71], DNA methylation [72, 52], proteomic data [73, 74, 69, 75, 76] or CRISPR perturbations [77 • , 78].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Several ML approaches have been developed for that purpose, for instance by characterizing cells across measurements, projecting multiple measurements into a common latent space or learning the missing modalities. Transcriptomics is typically one of the modalities that is integrated, together with chromatin accessibility [69, 52, 70], DNA [71], DNA methylation [72, 52], proteomic data [73, 74, 69, 75, 76] or CRISPR perturbations [77 • , 78].…”
Section: Introductionmentioning
confidence: 99%
“…While these methods can in theory be applied to any bi-modal omics dataset, hyperparameter selection is difficult when no co-assay data is available for MMD-MA, SCOT and UnionCOM. Among models that do not require co-assay data, some use weak supervision such as SCIM [72], an adversarial AE model that assumes that the cell types are known for a fraction of the cells and Seurat v3 [52], a canonical correlation analysis (CCA)-based model that relies on building anchor cells using mutual nearest neighbours. Applied to single-cell CRISPR screenings, scMAGeCK [78] relies on statistical analyses and MUSIC [77 • ] on topic modeling in order to link gene perturbations to cell phenotype.…”
Section: Introductionmentioning
confidence: 99%
“…At the omics feature level, presumed feature interactions have been used via feature conversion 15,16 or coupled matrix factorization 18,19 . At the cell level, known cell types have also been used via (semi)supervised learning 46,47 , but this approach incurs substantial limitations in terms of applicability since such supervision is typically unavailable and in many cases serves as the purpose of multi-omics integration per se 25 . Notably, one of these methods was proposed with a similar autoencoder architecture and adversarial alignment 47 , but it relied on matched cell types or clusters to orient the alignment.…”
Section: Discussionmentioning
confidence: 99%
“…Single-Cell Data Integration via Matching (SCIM) (Stark, Ficek et al 2020) is another deep generative approach that constructs a technology-invariant latent space to recover cell correspondences among datasets, even with unpaired feature sets. The architecture is a modified auto-encoder with an integrated discriminator network, similar to the one in GANs, allowing the network to be trained in an adversarial manner.…”
Section: Deep Learning In the Integration Of Single-cell Multimodal Omics Datamentioning
confidence: 99%