2022
DOI: 10.1038/s41587-021-01206-w
|View full text |Cite
|
Sign up to set email alerts
|

A Python library for probabilistic analysis of single-cell omics data

Abstract: To the Editor -Methods for analyzing single-cell data 1-4 perform a core set of computational tasks. These tasks include dimensionality reduction, cell clustering, cell-state annotation, removal of unwanted variation, analysis of differential expression, identification of spatial patterns of gene expression, and joint analysis of multi-modal omics data. Many of these methods rely on likelihood-based models to represent variation in the data; we refer to these as 'probabilistic

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
204
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 253 publications
(205 citation statements)
references
References 17 publications
1
204
0
Order By: Relevance
“…scAR can precisely infer the native signals for protein data in CITE-seq and mRNA data in scRNAseq. Recent approaches 11,12,20,27,33,37 introduce deep learning technologies (such as AE and VAE) for these tasks and show great promise. These approaches generally design noise models based on data patterns (e.g., zero-inflation) and learn the model parameters through neural networks 11,13,33 .…”
Section: Discussionmentioning
confidence: 99%
“…scAR can precisely infer the native signals for protein data in CITE-seq and mRNA data in scRNAseq. Recent approaches 11,12,20,27,33,37 introduce deep learning technologies (such as AE and VAE) for these tasks and show great promise. These approaches generally design noise models based on data patterns (e.g., zero-inflation) and learn the model parameters through neural networks 11,13,33 .…”
Section: Discussionmentioning
confidence: 99%
“…The functions and the function f w are encoder and decoder functions, respectively. To be as comparable as possible to PeakVI as implemented in scvi-tools 8,18 (v.0.15.0), we use the same architecture. Specifically, these functions consist of two repeated blocks of fully connected neural networks with a fixed number of hidden dimensions set to the square root of the number of input dimensions, a dropout layer, a layer-norm layer, and leakyReLU activation.…”
Section: Methodsmentioning
confidence: 99%
“…Amongst other data that can be optionally passed into PhyloVision are numerical and categorical metadata, as well as a two-dimensional projections of the cells for visualization purposes (e.g. from t-distributed stochastic neighbor embedding [tSNE] ( van der Maaten, 2008 ) of the main principal components or of an embedding learned by methods such as scVI ( Lopez et al., 2018 ; Gayoso et al, 2022 ). In the original VISION pipeline, cell-level clustering and consistency evaluation was performed on a user-specified “latent space” (a low-dimensional embedding such as the top principal components or an embedding inferred with tools like scVI).…”
Section: Methodsmentioning
confidence: 99%