2020
DOI: 10.48550/arxiv.2012.07977
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Probabilistic Contrastive Principal Component Analysis

Didong Li,
Andrew Jones,
Barbara Engelhardt

Abstract: Dimension reduction is useful for exploratory data analysis. In many applications, it is of interest to discover variation that is enriched in a "foreground" dataset relative to a "background" dataset. Recently, contrastive principal component analysis (CPCA) was proposed for this setting. However, the lack of a formal probabilistic model makes it difficult to reason about CPCA and to tune its hyperparameter. In this work, we propose probabilistic contrastive principal component analysis (PCPCA), a model-based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 22 publications
0
5
0
Order By: Relevance
“…Isolating salient variations present only in a target dataset is the subject of contrastive analysis (CA) [56,3,22,29,40,2,48]. While many recent studies have modeled scRNAseq data by fitting probabilistic models and representing the data in a lower dimension [30,38,18,32,31], few of these models are designed for CA.…”
Section: Mainmentioning
confidence: 99%
See 2 more Smart Citations
“…Isolating salient variations present only in a target dataset is the subject of contrastive analysis (CA) [56,3,22,29,40,2,48]. While many recent studies have modeled scRNAseq data by fitting probabilistic models and representing the data in a lower dimension [30,38,18,32,31], few of these models are designed for CA.…”
Section: Mainmentioning
confidence: 99%
“…Isolating the variations enriched in a target dataset is the subject of contrastive analysis (CA) [5, 6, 7, 8, 9, 10]. Many recent studies proposed probabilistic latent variable models for analyzing single-cell data [11, 12, 13].…”
Section: Mainmentioning
confidence: 99%
See 1 more Smart Citation
“…The idea behind leak detection water distribution systems is to maintain disinfection levels, pressure and reduce water loss are equally important. To address the leak detection purposes, several approaches have been developed including principal component analysis (PCA) [1], [2], nonlinear PCA (NPCA) [3], Multi-Regional PCA (MRPCA) [4], probabilistic PCA (PPCA) [5], [6] and attribute PCA (APCA) [7]. However, most industrial systems are nonlinear.…”
Section: Introductionmentioning
confidence: 99%
“…To address the fault detection purposes, several approaches have been developed, including principal component analysis (PCA) [1][2][3], nonlinear PCA (NPCA) [4], Multi-Regional PCA (MRPCA) [5], probabilistic PCA (PPCA) [6], attribute PCA (APCA) [7] and interval PCA (IPCA) [8]. Other detection indices, including the Hotelling T 2 statistic [9], the sum of squared residuals SPE and the generalized likelihood ratio test (GLRT) have been used.…”
Section: Introductionmentioning
confidence: 99%