2010
DOI: 10.1002/cem.1310
|View full text |Cite
|
Sign up to set email alerts
|

Principles of Proper Validation: use and abuse of re‐sampling for validation

Abstract: Validation in chemometrics is presented using the exemplar context of multivariate calibration/prediction. A phenomenological analysis of common validation practices in data analysis and chemometrics leads to formulation of a set of generic Principles of Proper Validation (PPV), which is based on a set of characterizing distinctions: (i) Validation cannot be understood by focusing on the methods of validation only; validation must be based on full knowledge of the underlying definitions, objectives, methods, e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
176
0
2

Year Published

2012
2012
2019
2019

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 239 publications
(192 citation statements)
references
References 62 publications
1
176
0
2
Order By: Relevance
“…When it comes to timeseries, the seasonal effect can introduce a bias when splitting the dataset in equivalent sets, required by the k-fold crossvalidation method [12]. Moreover, in our evaluation we want to demonstrate the accuracy of modelling with micro learning units rather than evaluating the efficiency of the ML algorithm itself.…”
Section: A Setupmentioning
confidence: 99%
“…When it comes to timeseries, the seasonal effect can introduce a bias when splitting the dataset in equivalent sets, required by the k-fold crossvalidation method [12]. Moreover, in our evaluation we want to demonstrate the accuracy of modelling with micro learning units rather than evaluating the efficiency of the ML algorithm itself.…”
Section: A Setupmentioning
confidence: 99%
“…In general, the driving force behind application of multivariate calibration methods is to minimize the time-consuming effort (and cost) of performing actual y-measurements on a process (manual inspection of high speed video recordings in this case). The calibration and validation data were obtained from independent sets of experiments in accordance with the requirements stipulated by Esbensen and Geladi [34] regarding the necessary realism and validity of independent test set validation. Generally, visualization plots and statistical results are used for describing the prediction performance of PLS regression models.…”
Section: Partial Least Squares Regression (Pls-r)mentioning
confidence: 99%
“…There are several validation techniques available [20][21][22]. However, test-set validation has been the recommended validation method because it provides realistic prediction errors and optimal number of PLS-R components [11]. In this regard, over-fitting or under-fitting of the prediction model is avoided.…”
Section: Partial Least Squares Regressionmentioning
confidence: 99%
“…The present work is an attempt to adapt acoustic chemometrics for on-line fluidised bed drying progress monitoring and end-point determination using dedicated test material and PLS-R regression models validated with independent test data [11].…”
Section: Introductionmentioning
confidence: 99%