The prediction error in CLS and PLS: the importance of feature selection prior to multivariate calibration

Nadler, Boaz; Coifman, Ronald R.

doi:10.1002/cem.915

Cited by 109 publications

(64 citation statements)

References 47 publications

(79 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our analysis is limited to the cases of either a finite error-free setting or a noisy but infinite population setting. While many simulations have studied the effects of various parameters on PLS and other competing algorithms in the presence of a finite and noisy training set, a theoretical statistical analysis in this case is still an open research problem [14].…”

Section: Discussionmentioning

confidence: 99%

“…1. The analysis of PLS with a finite and noisy calibration set is considered in Reference [14]. We first consider a system with a single component, for which we assume input data of the form…”

Section: Pls In the Presence Of Noisementioning

confidence: 99%

“…The regression vector computed by PLS is simply the result of an ordinary least squares regression on these projections. This is equivalent to minimizing (14) under the restriction that r PLS 2 Spanfw w j g k j¼0 .…”

Section: Proofmentioning

confidence: 99%

“…1. The more complicated (and more interesting) theoretical analysis of PLS predictions based on a finite and noisy training set will be published separately [14].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Partial least squares, Beer's law and the net analyte signal: statistical modeling and analysis

Nadler

Coifman

2005

Journal of Chemometrics

Self Cite

View full text Add to dashboard Cite

Partial least squares (PLS) is one of the most common regression algorithms in chemistry, relating input-output samples (x i , y i ) by a linear multivariate model. In this paper we analyze the PLS algorithm under a specific probabilistic model for the relation between x and y. Following Beer's law, we assume a linear mixture model in which each data sample (x, y) is a random realization from a joint probability distribution where x is the sum of k components multiplied by their respective characteristic responses, and each of these components is a random variable. We analyze PLS on this model under two idealized settings: one is the ideal case of noise-free samples and the other is the case of an infinite number of noisy training samples. In the noise-free case we prove that, as expected, the regression vector computed by PLS is, up to normalization, the net analyte signal. We prove that PLS computes this vector after at most k iterations, where k is the total number of components. In the case of an infinite training set corrupted by unstructured noise, we show that PLS computes a final regression vector which is not in general purely proportional to the net analyte signal vector, but has the important property of being optimal under a mean squared error of prediction criterion. This result can be viewed as an asymptotic optimality of PLS in the limit of a very large but finite training set.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Pls In the Presence Of Noisementioning

confidence: 99%

Section: Proofmentioning

confidence: 99%

“…1. The more complicated (and more interesting) theoretical analysis of PLS predictions based on a finite and noisy training set will be published separately [14].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Partial least squares, Beer's law and the net analyte signal: statistical modeling and analysis

Nadler

Coifman

2005

Journal of Chemometrics

Self Cite

View full text Add to dashboard Cite

show abstract

“…One standard solution is to rely on full-spectrum methods for linear dimension reduction coupled with linear regression: the basic formulations of principal components regression (PCR) and partial least-squares regression (PLSR) are reference models. The natural refinement of such an approach benefits from a preliminary selection of relevant wavelength ranges [2] as performed by one of the many available techniques (e.g. see References [3][4][5][6][7][8][9][10][11]).…”

Section: Introductionmentioning

confidence: 99%

Wavelength selection using the measure of topological relevance on the self‐organizing map

Corona¹,

Рейникайнен

Aaljoki

et al. 2008

Journal of Chemometrics

View full text Add to dashboard Cite

In this work, we investigated the possibility to perform wavelength selection by exploiting the metric structure of the spectrophotoscopic measurements. The topologically preserving representation of the data is performed using the self-organizing map (SOM) where the inputs' significance to the output is computed with the measure of topological relevance (MTR) on SOM. The MTR on SOM is a metric measuring the similarity between local distance matrices and we found that spectral inputs with a topology, which is, close to the output's are also associated to the wavelengths that chemically explain the influence of the spectra to the property of interest. As a result, we suggest a wavelength selection strategy based on the MTR on SOM, that is, interpretable to the domain experts and independent on the regression technique subsequently used for estimation. To support the presentation, a full-scale application from the oil refining industry is illustrated on the problem of estimating standard properties in a complex hydrocarbon product starting from spectrophotoscopic measurements. The method is further validated on the problem of octane number estimation in finished gasolines, under small sample conditions. The application led to accurate, parsimonious and understandable models. Copyright (C) 2008 John Wiley & Sons, Ltd

show abstract

Chemometrics

Brown

2005

Encyclopedia of Statistical Sciences

View full text Add to dashboard Cite

The prediction error in CLS and PLS: the importance of feature selection prior to multivariate calibration

Cited by 109 publications

References 47 publications

Partial least squares, Beer's law and the net analyte signal: statistical modeling and analysis

Partial least squares, Beer's law and the net analyte signal: statistical modeling and analysis

Wavelength selection using the measure of topological relevance on the self‐organizing map

Chemometrics

Contact Info

Product

Resources

About