2019
DOI: 10.1101/836650
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploring High-Dimensional Biological Data with Sparse Contrastive Principal Component Analysis

Abstract: Motivation: Statistical analyses of high-throughput sequencing data have re-shaped the biological sciences. In spite of myriad advances, recovering interpretable biological signal from data corrupted by technical noise remains a prevalent open problem. Several classes of procedures, among them classical dimensionality reduction techniques, and others incorporating subject-matter knowledge, have provided e ective advances; however, no procedure currently satisfies the dual objectives of recovering stable and re… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 39 publications
(51 reference statements)
0
9
0
Order By: Relevance
“…Although SPCA and cPCA have proven useful in resolving individual shortcomings of PCA, neither is capable of tackling the issues of stability and relevance simultaneously. The scPCA R package implements sparse constrastive PCA (scPCA) (Boileau, Hejazi, & Dudoit, 2019), a combination of these methods, drawing on cPCA to remove unwanted effects and on SPCA to sparsify the principal component loadings. In both simulation studies and data analysis, Boileau et al (2019) provided practical demonstrations of scPCA's ability to extract stable, interpretable, and uncontaminated signal from high-dimensional biological data.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Although SPCA and cPCA have proven useful in resolving individual shortcomings of PCA, neither is capable of tackling the issues of stability and relevance simultaneously. The scPCA R package implements sparse constrastive PCA (scPCA) (Boileau, Hejazi, & Dudoit, 2019), a combination of these methods, drawing on cPCA to remove unwanted effects and on SPCA to sparsify the principal component loadings. In both simulation studies and data analysis, Boileau et al (2019) provided practical demonstrations of scPCA's ability to extract stable, interpretable, and uncontaminated signal from high-dimensional biological data.…”
Section: Discussionmentioning
confidence: 99%
“…The scPCA R package implements sparse constrastive PCA (scPCA) (Boileau, Hejazi, & Dudoit, 2019), a combination of these methods, drawing on cPCA to remove unwanted effects and on SPCA to sparsify the principal component loadings. In both simulation studies and data analysis, Boileau et al (2019) provided practical demonstrations of scPCA's ability to extract stable, interpretable, and uncontaminated signal from high-dimensional biological data. Indeed, scPCA was found to produce more informative and interpretable embeddings than linear (e.g.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Specifically, in any of these algorithms, one can consider replacing the distance by the "constrastive distance." For example, sparse CPCA has been developed following this philosophy (Boileau et al, 2020), although without this justification. However, it is important to note that the contrastive distance is not a well-defined distance, which may violate the assumptions of traditional distance-based algorithms, and so cannot be used to replace distance metrics in existing algorithms without some luck.…”
Section: Geometric Interpretation Of Cpcamentioning
confidence: 99%
“…Specifically, genes can be selected based on their contributions to the projected low dimensions found by PCA or NMF [1719]. Although many variants of PCA and NMF algorithms have been developed for scRNA-seq data analysis, they are not designed for gene selection [2026].…”
Section: Introductionmentioning
confidence: 99%