2020
DOI: 10.3390/genes11040377
|View full text |Cite
|
Sign up to set email alerts
|

Network-Based Single-Cell RNA-Seq Data Imputation Enhances Cell Type Identification

Abstract: Single-cell RNA sequencing is a powerful technology for obtaining transcriptomes at single-cell resolutions. However, it suffers from dropout events (i.e., excess zero counts) since only a small fraction of transcripts get sequenced in each cell during the sequencing process. This inherent sparsity of expression profiles hinders further characterizations at cell/gene-level such as cell type identification and downstream analysis. To alleviate this dropout issue we introduce a network-based method, netImpute, b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 35 publications
(45 reference statements)
0
13
0
Order By: Relevance
“…Nevertheless, the use of "dropouts" in later papers became inconsistent and confusing: most papers meant non-biological zeros [20,36,40,50,53,93,94]; some meant non-biological zeros and low expression measurements [43,95]; some meant all zeros [44,45,96]. In addition,…”
Section: Clarification Of Zero-related Terminologymentioning
confidence: 99%
See 1 more Smart Citation
“…Nevertheless, the use of "dropouts" in later papers became inconsistent and confusing: most papers meant non-biological zeros [20,36,40,50,53,93,94]; some meant non-biological zeros and low expression measurements [43,95]; some meant all zeros [44,45,96]. In addition,…”
Section: Clarification Of Zero-related Terminologymentioning
confidence: 99%
“…” Hence, dropouts, as a data-driven concept, are not equivalent to either biological or non-biological zeros. Nevertheless, the use of “dropouts” in later papers became inconsistent and confusing: most papers meant non-biological zeros [20, 36, 40, 52, 55, 93, 94]; some meant non-biological zeros and low expression measurements [45, 95]; some meant all zeros [46, 47, 96]. In addition, “dropout” was often used as an adjective to mean the existence of many zeros [97].…”
Section: Introductionmentioning
confidence: 99%
“…However, for the sake of completeness we also examined the possible role of "imputation" and using raw data instead of the binarized scRNA-seq data. We tried to impute the dropouts in scRNA-seq data using SAVER 19 and netImpute 20 , but no significant improvement was gained in terms of enhancing our metric scores. On the other hand, although our analysis indicated that using raw data instead of the binarized data can potentially increase the consistency between gene expression pattern similarity and cell proximity in this challenge (according to M 1 and M 2 metrics), we are limited by the fact that the true locations of the cells to be predicted are unknown, and prediction accuracy is at least partially defined by comparing to the “gold standard” location obtained from binarized data.…”
Section: Resultsmentioning
confidence: 99%
“…Thus, a particular strength of network propagation is the fact that prior knowledge is utilized for the analysis of new data, which potentially helps increasing the signal-to-noise ratio and which aids the mechanistic interpretation of results. Within the realm of molecular biology, network propagation has a wide range of applications such as imputation of missing values (van Dijk et al 2017; Ronen and Akalin 2018; Zand and Ruan 2020), protein function prediction (Warde-Farley et al 2010), inferring condition-specifically altered sub-networks (Hofree et al 2013), and prioritization of disease genes (Cowen et al 2017; Hadas, Kupiec, and Sharan 2019).…”
Section: Introductionmentioning
confidence: 99%