2021
DOI: 10.1007/s11081-021-09608-0
|View full text |Cite
|
Sign up to set email alerts
|

Obey validity limits of data-driven models through topological data analysis and one-class classification

Abstract: Data-driven models are becoming increasingly popular in engineering, on their own or in combination with mechanistic models. Commonly, the trained models are subsequently used in model-based optimization of design and/or operation of processes. Thus, it is critical to ensure that data-driven models are not evaluated outside their validity domain during process optimization. We propose a method to learn this validity domain and encode it as constraints in process optimization. We first perform a topological dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 33 publications
(26 citation statements)
references
References 73 publications
0
26
0
Order By: Relevance
“…Thus, the development of models describing the validity domain of data-driven models are desired [77,99,118]. However, when training datadriven models on industry data, defining and modeling the validity domain is a major issue [99]. Similar issues can arise when applying GNNs to molecular property prediction [119,120] or when applying RL to control processes [44,121].…”
Section: Safety and Trust In Machine Learning Applicationsmentioning
confidence: 99%
See 2 more Smart Citations
“…Thus, the development of models describing the validity domain of data-driven models are desired [77,99,118]. However, when training datadriven models on industry data, defining and modeling the validity domain is a major issue [99]. Similar issues can arise when applying GNNs to molecular property prediction [119,120] or when applying RL to control processes [44,121].…”
Section: Safety and Trust In Machine Learning Applicationsmentioning
confidence: 99%
“…Considering increasingly high-dimensional data sets, manifold learning has become ever more important. Among others, Aimin et al [99,100] use manifold learning as the basis for soft-sensor developments for a fermentation process and a debutanizer column. To tackle highly different data frequencies or data, which is erroneous or incomplete, [20] suggest using semi-supervised learning.…”
Section: Heterogeneity Of Datamentioning
confidence: 99%
See 1 more Smart Citation
“…As, in the general case, the limit functions ρ (γ) min , ρ (γ) max are multivariate functions, multivariate regression methods, e.g., hinging hyperplanes (Breiman, 1993;Adeniran and Ferik, 2017;Kämper et al, 2021), convex region surrogates Schweidtmann et al, 2021), or artificial neural networks with ReLU activation functions (Grimstad and Andersson, 2019;Lueg et al, 2021), can be used to find piece-wise affine approximations.…”
Section: Pwa Approximation Of Ramping Limitsmentioning
confidence: 99%
“…The QuantityTypes allow to compare different quantities and they are defined in the definition 3.2 of the ISO 80000-1 standard [36]. Furthermore, well-structured data lays the ground for better results in optimization of data, which can be also shown in the recent work of Schweidtmann et al [37].…”
Section: Roadmapmentioning
confidence: 99%