2018
DOI: 10.1038/s41467-017-02554-5
|View full text |Cite|
|
Sign up to set email alerts
|

A general and flexible method for signal extraction from single-cell RNA-seq data

Abstract: Single-cell RNA-sequencing (scRNA-seq) is a powerful high-throughput technique that enables researchers to measure genome-wide transcription levels at the resolution of single cells. Because of the low amount of RNA present in a single cell, some genes may fail to be detected even though they are expressed; these genes are usually referred to as dropouts. Here, we present a general and flexible zero-inflated negative binomial model (ZINB-WaVE), which leads to low-dimensional representations of the data that ac… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

7
633
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 612 publications
(665 citation statements)
references
References 54 publications
7
633
0
Order By: Relevance
“…pCMF models both dropout events and the mean-variance dependence resulting from the count nature of scRNAseq data [31]. ZINB-WaVE incorporates additional gene-level and sample-level covariates for more accurate DR [32]. Finally, several deep learning-based DR methods have recently been developed to enable scalable and effective computation in large-scale scRNAseq data, including data that are collected by 10X Genomics techniques [33] and/or from large consortium studies such as Human Cell Atlas (HCA) [34,35].…”
Section: Introductionmentioning
confidence: 99%
“…pCMF models both dropout events and the mean-variance dependence resulting from the count nature of scRNAseq data [31]. ZINB-WaVE incorporates additional gene-level and sample-level covariates for more accurate DR [32]. Finally, several deep learning-based DR methods have recently been developed to enable scalable and effective computation in large-scale scRNAseq data, including data that are collected by 10X Genomics techniques [33] and/or from large consortium studies such as Human Cell Atlas (HCA) [34,35].…”
Section: Introductionmentioning
confidence: 99%
“…However, choosing genes solely based on log‐normalized single‐cell variance fails to account for the mean‐variance relationship that is inherent to single‐cell RNA‐seq . Variance‐stabilizing transformation can be applied to transform both lowly and highly expressed genes onto a common scale, thus eliminating the effect of sequencing depth variation …”
Section: Computational Analysis Of Scrna‐seq Datamentioning
confidence: 99%
“…23 Variance-stabilizing transformation can be applied to transform both lowly and highly expressed genes onto a common scale, thus eliminating the effect of sequencing depth variation. 24 Dimension reduction is widely used in machine learning that project the high dimensional data to lower dimensional space, and preserve important features of the original data. Principal component analysis (PCA), T-distributed stochastic neighbor embedding (tSNE), 25 Diffusion maps (DM) 26 and Uniform Manifold Approximation and Projection (UMAP) 27 are widely used dimension reduction methods.…”
Section: Feature Selection and Dimension Reductionmentioning
confidence: 99%
“…As scRNA-seq data can be approximated by negative binomial (NB) or zeroinflated NB (ZINB) distribution 16,17 , we considered the use of the statistic, (expression entropydifferential entropy of expression distribution, as defined in Methods), to capture the degree of disorder or randomness of gene expression. Notably, we observed a strong relationship between and the mean expression level ( ) of genes, thus forming the basis for our expression entropy model (model, Fig.…”
Section: Expression Entropy Model Enables Sensitive and Accurate Idenmentioning
confidence: 99%