Differentiable modeling to unify machine learning and physical models and advance Geosciences

Appling, Alison P.; Gentine, Pierre; Bandai, Toshiyuki; Gupta, Hoshin; Tartakovsky, Alexandre M.; Baity‐Jesi, Marco; Fenicia, Fabrizio; Kifer, Daniel; Liu, Xiaofeng; Li, Li; Feng, Dan; Ren, Wei; Zheng, Yi; Harman, Ciaran J.; Clark, Martyn P.; Farthing, Matthew W.

doi:10.5194/egusphere-egu23-15968

Cited by 21 publications

(26 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Even though that model does not simulate the physical quantity of soil moisture, it could be modified to have a module that does. However, to obtain suitable parameters on the global scale and improve the physical processes, we think adding differentiable programming to the model will give it the adaptive capability to learn from big data (Feng et al, 2022;Shen et al, 2023;Aboelyazeed et al, 2022;Bindas et al, 2022). It is possible that such a model may generalize better than LSTM over long distances due to the imposed physical constraints.…”

Section: Further Discussionmentioning

confidence: 99%

“…Typically, for many hydrologic applications (Fang et al, 2022;Liu et al, 2022a;Rahmani et al, 2021a), a spatial test is a tougher test than a temporal test for fully data-driven models, showing the strong impacts of spatial heterogeneity. This could either mean the inputs of the model do not completely describe the problem or that there are not enough sites in space with different combinations of input attributes for the model to fully resolve their impacts.…”

Section: Further Discussionmentioning

confidence: 99%

“…Component (iv) appears as a large difference between training and testing metrics. It is worthwhile to note that LSTM models typically (although not always) perform better for each site when given data from more numerous or more diverse sites due to a "data synergy" effect (Fang et al, 2022).…”

Section: Error Types and Temporal Testsmentioning

confidence: 99%

“…Due to LSTM's strong ability to fit to data, it can serve as a probe for process complexity (Liu et al, 2022a;Feng et al, 2022Feng et al, , 2020Tsai et al, 2021): those sites that LSTM cannot adequately capture may contain complicated processes that are not well described by the inputs. The factorial importance analysis indicates that slope aspect, average soil moisture, and surface solar radiation downwards are the top three factors that influence the multitask LSTM model's R in the temporal test (Fig.…”

Section: Factorial Influences On Model Performancementioning

confidence: 99%

See 3 more Smart Citations

Evaluating a global soil moisture dataset from a multitask model (GSM3 v1.0) with potential applications for crop threats

et al. 2023

Self Cite

View full text Add to dashboard Cite

Abstract. Climate change threatens our ability to grow food for an ever-increasing population. There is a need for high-quality soil moisture predictions in under-monitored regions like Africa. However, it is unclear if soil moisture processes are globally similar enough to allow our models trained on available in situ data to maintain accuracy in unmonitored regions. We present a multitask long short-term memory (LSTM) model that learns simultaneously from global satellite-based data and in situ soil moisture data. This model is evaluated in both random spatial holdout mode and continental holdout mode (trained on some continents, tested on a different one). The model compared favorably to current land surface models, satellite products, and a candidate machine learning model, reaching a global median correlation of 0.792 for the random spatial holdout test. It behaved surprisingly well in Africa and Australia, showing high correlation even when we excluded their sites from the training set, but it performed relatively poorly in Alaska where rapid changes are occurring. In all but one continent (Asia), the multitask model in the worst-case scenario test performed better than the soil moisture active passive (SMAP) 9 km product. Factorial analysis has shown that the LSTM model's accuracy varies with terrain aspect, resulting in lower performance for dry and south-facing slopes or wet and north-facing slopes. This knowledge helps us apply the model while understanding its limitations. This model is being integrated into an operational agricultural assistance application which currently provides information to 13 million African farmers.

show abstract

Section: Further Discussionmentioning

confidence: 99%

Section: Further Discussionmentioning

confidence: 99%

Section: Error Types and Temporal Testsmentioning

confidence: 99%

Section: Factorial Influences On Model Performancementioning

confidence: 99%

See 2 more Smart Citations

Evaluating a global soil moisture dataset from a multitask model (GSM3 v1.0) with potential applications for crop threats

et al. 2023

Self Cite

View full text Add to dashboard Cite

show abstract

“…Additional custom loss terms would be possible and could make use of other data, such as observed groundwater temperatures and levels, if a differentiable model were used that represented those intermediate variables within process‐based equations (Shen et al., 2023). In the DRB, there were only two sites with groundwater wells with daily water temperature observations (all occurring within the test partition) and only 24 wells with more than 20 discrete groundwater temperature observations.…”

Section: Discussionmentioning

confidence: 99%

Train, Inform, Borrow, or Combine? Approaches to Process‐Guided Deep Learning for Groundwater‐Influenced Stream Temperature Prediction

Barclay,

Topp,

Koenig

et al. 2023

Water Resources Research

View full text Add to dashboard Cite

Although groundwater discharge is a critical stream temperature control process, it is not explicitly represented in many stream temperature models, an omission that may reduce predictive accuracy, hinder management of aquatic habitat, and decrease user confidence. We assessed the performance of a previously‐described process‐guided deep learning model of stream temperature in the Delaware River Basin (USA). We found lower accuracy (root mean square error [RMSE] of 1.71 versus 1.35°C) and stronger seasonal bias (absolute mean monthly bias of 1.06 vs. 0.68°C) for reaches primarily influenced by deep groundwater as compared to atmospheric conditions. We then tested four approaches for improving groundwater process representation: (a) a custom loss function leveraging the unique patterns of air and water temperature coupling characteristic of different temperature drivers, (b) inclusion of additional groundwater‐relevant catchment attributes, (c) incorporation of additional process model outputs, and (d) a composite model. The custom loss function and the additional attributes significantly improved the predictive accuracy in groundwater‐dominated reaches (RMSE of 1.37 and 1.26°C) and reduced the seasonal bias (absolute mean monthly bias of 0.44 and 0.48°C), but neither approach could identify holdout groundwater reaches. Variable importance analysis indicates the custom loss function nudges the model to use the existing inputs more efficiently, whereas with the added features the model relies on a broader suite of inputs. This analysis is a substantial step toward more accurately representing groundwater discharge processes in stream temperature models and will improve predictive accuracy and inform habitat management.

show abstract

Predicting the growth trajectory and yield of greenhouse strawberries based on knowledge-guided computer vision

Yang,

Liu,

Zhou

et al. 2024

Computers and Electronics in Agriculture

View full text Add to dashboard Cite

Differentiable modeling to unify machine learning and physical models and advance Geosciences

Cited by 21 publications

References 0 publications

Evaluating a global soil moisture dataset from a multitask model (GSM3 v1.0) with potential applications for crop threats

Evaluating a global soil moisture dataset from a multitask model (GSM3 v1.0) with potential applications for crop threats

Train, Inform, Borrow, or Combine? Approaches to Process‐Guided Deep Learning for Groundwater‐Influenced Stream Temperature Prediction

Predicting the growth trajectory and yield of greenhouse strawberries based on knowledge-guided computer vision

Contact Info

Product

Resources

About