2020
DOI: 10.1101/2020.11.17.387779
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PseudotimeDE: inference of differential gene expression along cell pseudotime with well-calibrated p-values from single-cell RNA sequencing data

Abstract: In the investigation of molecular mechanisms underlying cell state changes, a crucial analysis is to identify differentially expressed (DE) genes along a continuous cell trajectory, which can be estimated by pseudotime inference from single-cell RNA-sequencing (scRNA-seq) data. However, existing methods that identify DE genes based on inferred pseudotime do not account for the uncertainty in pseudotime inference. Also, they either have ill-posed p-values that hinder the control of false discovery rate (FDR) or… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3
3

Relationship

4
2

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 69 publications
0
5
0
Order By: Relevance
“…In addition to selecting experimental protocols before conducting scRNA-seq experiments, a common challenge after collecting scRNA-seq data is to choose among the many available data analysis methods in an unbiased manner. For example, many algorithms have been developed for missing gene expression imputation [36,37], dimensionality reduction [38][39][40], cell clustering [41][42][43][44], rare cell type detection [45][46][47], differentially expressed gene identification [48][49][50][51][52], and trajectory inference [53][54][55][56][57]. Even though several benchmark and comparative studies have been carried out for common analysis tasks [58][59][60][61][62][63], most of them have only evaluated a subset of available computational methods using data from limited experimental protocols.…”
Section: Introductionmentioning
confidence: 99%
“…In addition to selecting experimental protocols before conducting scRNA-seq experiments, a common challenge after collecting scRNA-seq data is to choose among the many available data analysis methods in an unbiased manner. For example, many algorithms have been developed for missing gene expression imputation [36,37], dimensionality reduction [38][39][40], cell clustering [41][42][43][44], rare cell type detection [45][46][47], differentially expressed gene identification [48][49][50][51][52], and trajectory inference [53][54][55][56][57]. Even though several benchmark and comparative studies have been carried out for common analysis tasks [58][59][60][61][62][63], most of them have only evaluated a subset of available computational methods using data from limited experimental protocols.…”
Section: Introductionmentioning
confidence: 99%
“…This issue is evidenced by serious concerns about the widespread miscalculation and misuse of p-values in the scientific community [30]. As a result, bioinformatics tools using questionable p-values either cannot reliably control the FDR to a target level [23] or lack power to make discoveries [31]; see Results. Therefore, p-value-free control of FDR is desirable, as it would make data analysis more transparent and thus improve the reproducibility of scientific research.…”
Section: Resultsmentioning
confidence: 99%
“…However, there is a growing need to generalize our framework to identify features across more than two conditions. For example, temporal analysis of scRNA-seq data aims to identify genes whose expression levels change along cell pseudotime [31]. To tailor Clipper for such analysis, we could define a new contrast score that differentiates the genes with stationary expression (uninteresting features) from the other genes with varying expression (interesting features).…”
Section: Discussionmentioning
confidence: 99%
“…This issue is evidenced by serious concerns about the widespread miscalculation and misuse of p-values in the scientific community [30]. As a result, bioinformatics tools using questionable p-values either cannot reliably control the FDR to a target level [23] or lack power to make discoveries [31]; see the "Results" section. Therefore, p-value-free control of FDR is desirable, as it would make data analysis more transparent and thus improve the reproducibility of scientific research.…”
Section: Introductionmentioning
confidence: 99%