2023
DOI: 10.1016/j.fcr.2023.109063
|View full text |Cite
|
Sign up to set email alerts
|

Big data, small explanatory and predictive power: Lessons from random forest modeling of on-farm yield variability and implications for data-driven agronomy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 47 publications
0
4
0
Order By: Relevance
“…However, these data are generally aggregated at large administrative scales, such counties or countries. Experimental or on-farm data are also a valuable source of information [46], because they are more precisely located than the gridded-cell data and provide information regarding farm management However, these data cover restricted spatial area and time period, and are thus not suitable for modeling yields at the continental or global scale. The reanalysis data used in this study combines the advantages to span large scale in space and time while covering a large diversity of yield and climate conditions, as shown in table 1.…”
Section: Discussionmentioning
confidence: 99%
“…However, these data are generally aggregated at large administrative scales, such counties or countries. Experimental or on-farm data are also a valuable source of information [46], because they are more precisely located than the gridded-cell data and provide information regarding farm management However, these data cover restricted spatial area and time period, and are thus not suitable for modeling yields at the continental or global scale. The reanalysis data used in this study combines the advantages to span large scale in space and time while covering a large diversity of yield and climate conditions, as shown in table 1.…”
Section: Discussionmentioning
confidence: 99%
“…RF is a supervised ensemble-learning algorithm that improves regression combining multiple decision trees to enhance the accuracy of the model and its generalization. Due to its accuracy and ability in finding non-parametric relationships, RF is used in the fields of remote sensing and agronomy for prediction and modeling ( Belgiu and Drăgu, 2016 ; Nayak et al., 2022 ; Silva et al., 2023 ). In this work, RF model implemented in the “ranger” package in RStudio was used ( Wright and Ziegler, 2017 ); to avoid the overfitting of the model, the 10-fold-cross-validation was applied, by using the trainControl function of the package “caret” ( Kuhn, 2008 ).…”
Section: Methodsmentioning
confidence: 99%
“…Temporal variability was another important constituent of yield variability explaining 10 -55% of the yield variation in all datasets, highlighting the importance of weather conditions on yield at all scales (Fig. 2.7; see also Ray et al, 2015;Silva et al, 2023). Our method allowed us to compare temporal yield variability with spatial yield variability revealing that temporal yield variability was particularly important for the two large commercial farms, for which most fields were distributed across a relatively small area.…”
Section: Figure 22 Long-term Median Scaled Yield Per Farm (In T Ha -1...mentioning
confidence: 83%
“…Finally, we compared yield variability across spatial scales and years with a random effects model fitted with year and spatial scale as random effects (see also Silva et al, 2023). Random effects models are a particular type of linear mixed models that consider only random effects.…”
Section: Introductionmentioning
confidence: 99%