2020
DOI: 10.1101/gr.251603.119
|View full text |Cite
|
Sign up to set email alerts
|

netNMF-sc: leveraging gene–gene interactions for imputation and dimensionality reduction in single-cell expression analysis

Abstract: Single-cell RNA-sequencing (scRNA-seq) enables high-throughput measurement of RNA expression in single cells. However, because of technical limitations, scRNA-seq data often contain zero counts for many transcripts in individual cells. These zero counts, or dropout events, complicate the analysis of scRNA-seq data using standard methods developed for bulk RNA-seq data. Current scRNA-seq analysis methods typically overcome dropout by combining information across cells in a lower-dimensional space, leveraging th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
56
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 83 publications
(57 citation statements)
references
References 49 publications
1
56
0
Order By: Relevance
“…3 ). Median L1 distance, cosine similarity, and root-mean-square-deviation (RMSE) scores between the original data set and the imputed values for these synthetic entries were calculated to compare scGNN with MAGIC 4 , SAUCIE 10 , SAVER 19 , scImpute 33 , scVI 32 , DCA 11 , DeepImpute 34 , scIGANs 35 , and netNMF-sc 36 (see “Methods” section). scGNN achieves the best results in recovering gene expressions in terms of median L1 distance, and RMSE at the 10 and 30% synthetic dropout rate, respectively.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…3 ). Median L1 distance, cosine similarity, and root-mean-square-deviation (RMSE) scores between the original data set and the imputed values for these synthetic entries were calculated to compare scGNN with MAGIC 4 , SAUCIE 10 , SAVER 19 , scImpute 33 , scVI 32 , DCA 11 , DeepImpute 34 , scIGANs 35 , and netNMF-sc 36 (see “Methods” section). scGNN achieves the best results in recovering gene expressions in terms of median L1 distance, and RMSE at the 10 and 30% synthetic dropout rate, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…The process is repeated three times, and the mean and standard deviation were selected as a comparison. The scores are compared between scGNN and nine imputation tools (i.e., MAGIC 4 , SAUCIE 10 , SAVER 19 , scImpute 33 , scVI 32 , DCA 11 , DeepImpute 34 , scIGANs 35 , and netNMF-sc 36 ), using the default parameters.…”
Section: Methodsmentioning
confidence: 99%
“…Such excess zeros would bias the estimation of gene expression correlations [42] and hinder the capture of gene expression dynamics [43] from scRNAseq data. In early scRNA-seq data analyses, the high data sparsity provoked the use of zeroinflated models [36,38,44] and the development of imputation methods for reducing zeros [20,42,43,[45][46][47][48][49][50][51][52][53][54][55][56][57][58][59][60]. More recently, however, there were voices against the use of zero-inflated models for scRNA-seq data generated by UMI protocols [61].…”
Section: Introductionmentioning
confidence: 99%
“…While much of the motivation for this work is the difficulty of acquiring scRNA for primary tumor samples when examining metastases years later, we do anticipate that this problem will lessen over time. It is thus worth considering for the future whether our methods might be adapted for working on limited and noisy scRNA with matched bulk data ( Elyanow et al , 2020 ).…”
Section: Discussionmentioning
confidence: 99%