2019
DOI: 10.1002/sta4.231
|View full text |Cite
|
Sign up to set email alerts
|

Estimating the number of signals using principal component analysis

Abstract: In this work, we develop inferential tools for determining the correct number of principal components under a general noisy latent variable model, which includes as a special case, for example, the noisy independent component model. The problem is approached using hypothesis testing, and we provide both a large‐sample test and several resampling‐based alternatives. Simulations and an application to sound data reveal that both types of approaches keep the desired levels and have good power.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 9 publications
(16 citation statements)
references
References 24 publications
0
15
0
Order By: Relevance
“…Again, as it is risky to base the decision here on one lag τ only, that is, on AMUSE alone, pooling information is again a safer choice. The pooled "SOBI" eigenvalues are defined as Matilainen et al (2018); Virta and Nordhausen (2021) for this purpose is…”
Section: Estimation Of Q In Inmmentioning
confidence: 99%
See 1 more Smart Citation
“…Again, as it is risky to base the decision here on one lag τ only, that is, on AMUSE alone, pooling information is again a safer choice. The pooled "SOBI" eigenvalues are defined as Matilainen et al (2018); Virta and Nordhausen (2021) for this purpose is…”
Section: Estimation Of Q In Inmmentioning
confidence: 99%
“…where Matilainen et al (2018) suggested bootstrapping strategies to get the distribution of t k under H 0k , and Virta and Nordhausen (2021) showed that under quite broad assumptions for a time series of length T, T j T j p −q ð Þ 2 t q converges to a χ 2 Virta and Nordhausen (2021) also showed that successive testing can be used to get a consistent estimate for q.…”
Section: Estimation Of Q In Inmmentioning
confidence: 99%
“…If r < r, then we may fail to capture all relevant information in L. On the other hand, if r > r, then the estimate L will overfit the training data, potentially leading to downstream overfitting in the ensemble. In general, estimating r is a challenging task [Wold, 1978, Peres-Neto et al, 2005, Choi et al, 2017, Virta and Nordhausen, 2019.…”
Section: Crc-l: Training a Classifier On The Latent Signalsmentioning
confidence: 99%
“…This signal subspace can be straightforwardly estimated with PCA as long as one knows its dimension d which is, however, usually unknown in practice. Numerous procedures for determining the dimension have been proposed, see Jolliffe (2002) for a review and, e.g., Schott (2006); Nordhausen et al (2016); Virta and Nordhausen (2019) for asymptotic tests and Beran and Srivastava (1985); Dray (2008); Luo and Li (2016) for bootstrap-and permutation-based techniques. Simplest of these methods is perhaps the test of sub-sphericity based on the test statistics, Tn,j = m2,p−j(Sn) m1,p−j(Sn) 2 − 1, j = 0, .…”
Section: Introductionmentioning
confidence: 99%