2005
DOI: 10.1002/cem.915
|View full text |Cite
|
Sign up to set email alerts
|

The prediction error in CLS and PLS: the importance of feature selection prior to multivariate calibration

Abstract: Classical least squares (CLS) and partial least squares (PLS) are two common multivariate regression algorithms in chemometrics. This paper presents an asymptotically exact mathematical analysis of the mean squared error of prediction of CLS and PLS under the linear mixture model commonly assumed in spectroscopy. For CLS regression with a very large calibration set the root mean squared error is approximately equal to the noise per wavelength divided by the length of the net analyte signal vector. It is shown,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
60
0

Year Published

2005
2005
2017
2017

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 109 publications
(64 citation statements)
references
References 47 publications
(79 reference statements)
1
60
0
Order By: Relevance
“…Our analysis is limited to the cases of either a finite error-free setting or a noisy but infinite population setting. While many simulations have studied the effects of various parameters on PLS and other competing algorithms in the presence of a finite and noisy training set, a theoretical statistical analysis in this case is still an open research problem [14].…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…Our analysis is limited to the cases of either a finite error-free setting or a noisy but infinite population setting. While many simulations have studied the effects of various parameters on PLS and other competing algorithms in the presence of a finite and noisy training set, a theoretical statistical analysis in this case is still an open research problem [14].…”
Section: Discussionmentioning
confidence: 99%
“…1. The analysis of PLS with a finite and noisy calibration set is considered in Reference [14]. We first consider a system with a single component, for which we assume input data of the form…”
Section: Pls In the Presence Of Noisementioning
confidence: 99%
See 2 more Smart Citations
“…One standard solution is to rely on full-spectrum methods for linear dimension reduction coupled with linear regression: the basic formulations of principal components regression (PCR) and partial least-squares regression (PLSR) are reference models. The natural refinement of such an approach benefits from a preliminary selection of relevant wavelength ranges [2] as performed by one of the many available techniques (e.g. see References [3][4][5][6][7][8][9][10][11]).…”
Section: Introductionmentioning
confidence: 99%