“…Machine learning (ML) methods have gradually been applied to the field of chemometrics and have been shown to sometimes outperform linear methods on various regression and classification tasks, employing artificial neural networks (ANNs) ( Long et al, 1990 ; Santos et al, 2005 ), Gaussian process regression (GPR) ( Cui and Fearn, 2017 ; Malek et al, 2018 ), support vector machines (SVMs) ( Cui and Fearn, 2017 ), k-nearest neighbor (kNN) ( Wang et al, 2023 ) or convolutional neural networks (CNNs) ( Acquarelli et al, 2017 ; Bjerrum et al, 2017 ; Cui and Fearn, 2018 ; Blazhko et al, 2021 ; Passos and Mishra, 2021 ; Rolinger et al, 2021 ; Wang et al, 2023 ). Next to the increased accuracy, ML models were found to reduce the amount of preprocessing needed prior to spectral modeling ( Cui and Fearn, 2018 ; Rolinger et al, 2021 ; Tulsyan et al, 2021 ; Schiemer et al, 2023 ) and increase robustness against variability in the data ( Cui and Fearn, 2018 ; Yuanyuan and Zhibin, 2018 ). Major obstacles to successfully deploy these models for process monitoring in biopharmaceutical operations are the required amount of data for model calibration ( Tulsyan et al, 2019 ; Banner et al, 2021 ), the high number of hyperparameters ( Passos and Mishra, 2022 ) as well as the necessity for universally applicable diagnostic tools to reduce the black-box character of these models ( Burkart and Huber, 2021 ).…”