2019
DOI: 10.1101/684340
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A quantitative framework for evaluating single-cell data structure preservation by dimensionality reduction techniques

Abstract: SummaryHigh-dimensional data, such as those generated using single-cell RNA sequencing, present challenges in interpretation and visualization. Numerical and computational methods for dimensionality reduction allow for low-dimensional representation of genome-scale expression data for downstream clustering, trajectory reconstruction, and biological interpretation. However, a comprehensive and quantitative evaluation of the performance of these techniques has not been establishe… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 40 publications
0
4
0
Order By: Relevance
“…A number of good comparison and benchmark studies have already been performed on various steps related to scRNAseq processing and analysis and can guide the choice of methodology [3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20][21]. However, these recommendations need constant updating and often leave open many details of an analysis.…”
Section: Introductionmentioning
confidence: 99%
“…A number of good comparison and benchmark studies have already been performed on various steps related to scRNAseq processing and analysis and can guide the choice of methodology [3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20][21]. However, these recommendations need constant updating and often leave open many details of an analysis.…”
Section: Introductionmentioning
confidence: 99%
“…The performances of the DR methods on the real and simulated CyTOF data are overall comparable, indicating the validity of our evaluation results as well as the simulation approach. Previous comparative works in the field of scRNAseq have supported the notions that tSNE and UMAP are the top performers in general and even linear methods are well-suited for certain workflows 20,37,38 . Few attempts at tackling this issue for CyTOF data have been made and the field seems to think in general that the best methodologies for scRNA-seq data can be directly applied on CyTOF data.…”
Section: Discussionmentioning
confidence: 91%
“…Here, we treat PCDs (of the original space data and DR embedding space data, respectively) as empirical one-dimensional distributions: in practice, we flatten the 𝑁 × 𝐶 matrices into vectors to represent observations from distributions that encapsulate the underlying global structure. Following the procedure of a previous usage of EMD as a global metric 20 , we perform min-max normalization of each vector to account for the difference in scale (original space PCDs can be on a different scale than embedding space PCDs). Subsequently, EMD is calculated after normalization.…”
Section: Accuracy -Global Structure Preservationmentioning
confidence: 99%
See 1 more Smart Citation