“…In recent decades, there has been an increasing use of this approach to derive DSLF, namely based on multiple regression models (e.g., Trigo et al, 2010;Zeng et al, 2020) and, more recently, machine learning models (e.g., Cao et al, 2022;Feng, Ye, & Zou, 2020;Feng, Zhang, et al, 2020;Feng et al, 2021;Jiang et al, 2022;Lopes et al, 2022;Shao et al, 2023;Wang et al, 2012Wang et al, , 2020Zhou et al, 2018Zhou et al, , 2019Jung et al, 2019), which are able to account for complex nonlinear interactions that occur between DSLF and respective predictors. The latter can be separated into different branches, including artificial neural network (e.g., Feng, Ye, & Zou, 2020;Feng, Zhang, et al, 2020;Jiang et al, 2022;Wang et al, 2012), random forest (e.g., Shao et al, 2023;Wang et al, 2020), extremely randomized trees (e.g., Cao et al, 2022), gradient boosting regression tree (e.g., Feng et al, 2021), and multivariate adaptive regression splines (MARS, e.g., Jung et al, 2019;Lopes et al, 2022;Zhou et al, 2018Zhou et al, , 2019. For instance, Jiang et al (2022) used a generic algorithmartificial neural network to create a model that combines DSLF data simulated by the Moderate Resolution Atmospheric Transmittance and Radiance Code (MODTRAN-5) with satellite information (radiance at top-ofatmosphere, cloud mask, and water vapor content) from the Moderate-resolution Imaging Spectroradiometer (MODIS) and ERA5 reanalysis (cloud base height and temperature, total cloud cover, 2-m air and dew point temperatures) to produce estimates of DSLF under cloudy-sky conditions.…”