2020
DOI: 10.1093/bioinformatics/btaa176
|View full text |Cite
|
Sign up to set email alerts
|

Exploring high-dimensional biological data with sparse contrastive principal component analysis

Abstract: Algorithm 1: scPCA Result: Produces a sparse low-dimensional representation of the target data, X n×p , by contrasting the variation of X n×p and some background data, Y m×p , while applying an 1 penalty to the loadings generated by cPCA. Input : target dataset: X background dataset: Y binary variable indicating whether to column-scale the data: scale vector of possible contrastive parameters: γ = (γ 1 , . . . , γ s ) vector of possible 1 penalty parameters: λ 1 = (λ 1,1 , . . . , λ 1,d ) number of sparse cont… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 31 publications
(14 citation statements)
references
References 41 publications
(26 reference statements)
0
13
0
Order By: Relevance
“…When applying ULCA to such a dataset, many of the attributes (e.g., 100 attributes) could significantly contribute to embedding axes. To avoid using many attributes when constructing the embedding axes, various sparse DR methods, such as sparse PCA [98], sparse LDA [15], and space cPCA [7], have been developed. These methods produce a sparse projection matrix by penalizing a case where an embedding uses many attributes.…”
Section: Discussionmentioning
confidence: 99%
“…When applying ULCA to such a dataset, many of the attributes (e.g., 100 attributes) could significantly contribute to embedding axes. To avoid using many attributes when constructing the embedding axes, various sparse DR methods, such as sparse PCA [98], sparse LDA [15], and space cPCA [7], have been developed. These methods produce a sparse projection matrix by penalizing a case where an embedding uses many attributes.…”
Section: Discussionmentioning
confidence: 99%
“…A sparse version of CPCA was recently developed, which allows for greater interpretability of the estimated components, especially in high-dimensional settings (Boileau et al, 2020). Building off of sparse PCA (Zou et al, 2006), which uses element-wise 1 regularization to encourage zeros in the loadings matrix, the authors propose an estimation procedure that alternates between estimating the principal components and the sparse loadings matrix.…”
Section: Contrastive Dimension Reduction Methodsmentioning
confidence: 99%
“…We have data on the BMMCs before stemcell transplant and the BMMCs after stem-cell transplant. We preprocess the data sets as described by the authors in [4] 2 keeping the 1000 most variable genes measured across all 16856 cells (patient-035: 4501 cells and two healthy individuals; one of 1985 cells and the other of 2472 cells). The data from the two healthy patients are combined to create a background data matrix of dimension 4457 × 1000 and we use patient-035 data set as the target data matrix of dimension 4501 × 1000 .…”
Section: Numerical Experimentsmentioning
confidence: 99%