“…These models rely on the assumption that the relationship between surface data and subsurface data can be represented by known function forms, with unknown coefficients determined through regression methods. While these models are straightforward and flexible, their simplicity may not be adequate in handling complex, nonlinear problems (L. Meng et al., 2021; Tian et al., 2022). Methods based on machine learning have recently become the most widely used class of data‐driven models, such as artificial neural network (Ali et al., 2004; Bao et al., 2019; Su et al., 2020), self‐organizing maps (Wu et al., 2012; C. Chen et al., 2018), support vector machine (Su et al., 2015, 2018), clustering neural network (W. Lu et al., 2019), and random forest (Su et al., 2018).…”