Interval selection: A case‐study‐based approach

Arboretti, Rosa; Ceccato, Riccardo; Pegoraro, Luca; Salmaso, Luigi

doi:10.1002/asmb.2611

Cited by 2 publications

(2 citation statements)

References 26 publications

(64 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Additionally, the considered nonparametric ranking procedure also proved its efficacy from the point of view of the reliability of results as it has been extensively validated by means of simulation studies 53 and practical applications considering both observational and experimental data in several fields, 54 including medicine, new product development and marketing studies. Recently it has also been included as a core component in a methodology that performs variable selection in near‐infrared spectroscopy using ML models 55 …”

Section: Design and Model Choicementioning

confidence: 99%

“…Recently it has also been included as a core component in a methodology that performs variable selection in near-infrared spectroscopy using ML models. 55 In this paper the permutation tests are applied using the difference in means as test statistics and assuming independent or paired data, depending on the specific situation. Considering 𝐺 𝑖 and 𝐺 𝑗 with 𝑖, 𝑗 = 1, … , 𝐶, 𝑖 ≠ 𝑗 two different groups of data to be compared (e.g., the different experimental designs), the permutation testing framework is employed to test the directional alternative hypothesis RMSE 𝐺 𝑖 > RMSE 𝐺 𝑗 , where RMSE is the prediction error calculated on the test data, i.e.…”

Section: Ranking Proceduresmentioning

confidence: 99%

See 1 more Smart Citation

Design choice and machine learning model performances

Arboretti

Ceccato

Pegoraro

et al. 2022

Quality & Reliability Eng

Self Cite

View full text Add to dashboard Cite

An increasing number of publications present the joint application of design of experiments (DOE) and machine learning (ML) as a methodology to collect and analyze data on a specific industrial phenomenon. However, the literature shows that the choice of the design for data collection and model for data analysis is often not driven by statistical or algorithmic advantages, thus there is a lack of studies which provide guidelines on what designs and ML models to jointly use for data collection and analysis. This article discusses the choice of design in relation to the ML model performances. A study is conducted that considers 12 experimental designs, seven families of predictive models, seven test functions that emulate physical processes, and eight noise settings, both homoscedastic and heteroscedastic. The results of the research can have an immediate impact on the work of practitioners, providing guidelines for practical applications of DOE and ML.

show abstract

Section: Design and Model Choicementioning

confidence: 99%

Section: Ranking Proceduresmentioning

confidence: 99%

Design choice and machine learning model performances

Arboretti

Ceccato

Pegoraro

et al. 2022

Quality & Reliability Eng

Self Cite

View full text Add to dashboard Cite

show abstract

Design choice and machine learning model performances

Arboretti,

Ceccato,

Pegoraro

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

An increasing number of publications present the joint application of Design of Experiments (DOE) and machine learning (ML) as a methodology to collect and analyze data on a specific industrial phenomenon. However, the literature shows that the choice of the design for data collection and model for data analysis is often driven by incidental factors, rather than by statistical or algorithmic advantages, thus there is a lack of studies which provide guidelines on what designs and ML models to jointly use for data collection and analysis. This is the first time in the literature that a paper discusses the choice of design in relation to the ML model performances. An extensive study is conducted that considers 12 experimental designs, 7 families of predictive models, 7 test functions that emulate physical processes, and 8 noise settings, both homoscedastic and heteroscedastic. The results of the research can have an immediate impact on the work of practitioners, providing guidelines for practical applications of DOE and ML.

show abstract

Interval selection: A case‐study‐based approach

Cited by 2 publications

References 26 publications

Design choice and machine learning model performances

Design choice and machine learning model performances

Design choice and machine learning model performances

Contact Info

Product

Resources

About