Principles of Proper Validation: use and abuse of re‐sampling for validation

Esbensen, Kim H.; Geladi, Paul

doi:10.1002/cem.1310

Cited by 239 publications

(192 citation statements)

References 62 publications

Supporting

Mentioning

176

Contrasting

Unclassified

Order By: Relevance

“…When it comes to timeseries, the seasonal effect can introduce a bias when splitting the dataset in equivalent sets, required by the k-fold crossvalidation method [12]. Moreover, in our evaluation we want to demonstrate the accuracy of modelling with micro learning units rather than evaluating the efficiency of the ML algorithm itself.…”

Section: A Setupmentioning

confidence: 99%

The next evolution of MDE: a seamless integration of machine learning into domain modeling

Hartmann

Moawad²,

Fouquet

et al. 2017

Softw Syst Model

View full text Add to dashboard Cite

Abstract-Machine learning algorithms are designed to resolve unknown behaviours by extracting commonalities over massive datasets. Unfortunately, learning such global behaviours can be inaccurate and slow for systems composed of heterogeneous elements, which behave very differently, for instance as it is the case for cyber-physical systems and Internet of Things applications. Instead, to make smart decisions, such systems have to continuously refine the behaviour on a per-element basis and compose these small learning units together. However, combining and composing learned behaviours from different elements is challenging and requires domain knowledge. Therefore, there is a need to structure and combine the learned behaviours and domain knowledge together in a flexible way. In this paper we propose to weave machine learning into domain modeling. More specifically, we suggest to decompose machine learning into reusable, chainable, and independently computable small learning units, which we refer to as micro learning units. These micro learning units are modeled together with and at the same level as the domain data. We show, based on a smart grid case study, that our approach can be significantly more accurate than learning a global behaviour while the performance is fast enough to be used for live learning.

show abstract

Section: A Setupmentioning

confidence: 99%

The next evolution of MDE: a seamless integration of machine learning into domain modeling

Hartmann

Moawad²,

Fouquet

et al. 2017

Softw Syst Model

View full text Add to dashboard Cite

show abstract

“…In general, the driving force behind application of multivariate calibration methods is to minimize the time-consuming effort (and cost) of performing actual y-measurements on a process (manual inspection of high speed video recordings in this case). The calibration and validation data were obtained from independent sets of experiments in accordance with the requirements stipulated by Esbensen and Geladi [34] regarding the necessary realism and validity of independent test set validation. Generally, visualization plots and statistical results are used for describing the prediction performance of PLS regression models.…”

Section: Partial Least Squares Regression (Pls-r)mentioning

confidence: 99%

Online acoustic chemometric monitoring of fish feed pellet velocity in a pneumatic conveying system

et al. 2014

View full text Add to dashboard Cite

Fish farmers consider the cost of fish feed pellets as one of the most expensive factors in fish cultivation. Proper control of the handling and conveying systems is necessary to avoid damage and disintegration of the cylindrically shaped fish feed pellets. Pneumatic conveying is widely used to transport large quantities of fish feed. Proneness of crushing the fish feed pellets caused by pellets interaction with the inner wall of the pipeline is a major concern to the manufacturer due to the associated economic loss; pellet damage increases exponentially with the conveying air velocity. On the other hand, too low conveying rates would lead to pipeline blockages and severe pipe vibration. In order to address the foregoing issues, it is necessary to optimize the conveying velocity of fish feed pellets during pneumatic transport. Application of an on-line monitoring technique based on non-invasive passive acoustic measurements and multivariate regression modeling (acoustic chemometrics) was investigated. A partial least squares regression (PLS-R) model was calibrated to predict pellet velocity from 19 m/s to 36 m/s in a pilot scale pneumatic conveying system. The PLS-R prediction model was validated based on independent experimental data (test set validation). The root mean square error of prediction (RMSEP), slope and r 2 of the prediction results were 0.64 m/s, 1.02 and 0.97 respectively. The prediction results obtained shows the applicability of acoustic chemometrics for real-time prediction of the velocities of fish feed pellets during pneumatic conveying.

show abstract

“…There are several validation techniques available [20][21][22]. However, test-set validation has been the recommended validation method because it provides realistic prediction errors and optimal number of PLS-R components [11]. In this regard, over-fitting or under-fitting of the prediction model is avoided.…”

Section: Partial Least Squares Regressionmentioning

confidence: 99%

“…The present work is an attempt to adapt acoustic chemometrics for on-line fluidised bed drying progress monitoring and end-point determination using dedicated test material and PLS-R regression models validated with independent test data [11].…”

Section: Introductionmentioning

confidence: 99%

Acoustic chemometrics for on-line monitoring and end-point determination of fluidised bed drying

2013

View full text Add to dashboard Cite

Principles of Proper Validation: use and abuse of re‐sampling for validation

Cited by 239 publications

References 62 publications

The next evolution of MDE: a seamless integration of machine learning into domain modeling

The next evolution of MDE: a seamless integration of machine learning into domain modeling

Online acoustic chemometric monitoring of fish feed pellet velocity in a pneumatic conveying system

Acoustic chemometrics for on-line monitoring and end-point determination of fluidised bed drying

Contact Info

Product

Resources

About