Abstract:The capability of machine learning/ML algorithms to analyze the effect of human and environmental factors and variables in controlling soil nutrients has been profoundly studied over the last decades. Unfortunately, ML utilization to estimate macronutrients and their governing factors in cultivated tropical peat soil are extremely scarce. In this study, we trained regression tree/RT, ML-based pedotransfer models to predict total N, P, and K in peat soils based on oil palm/OP and OP+bush datasets. Our results i… Show more
“…& Chambers (2021), the RT algorithm is easy to implement and captures non-linear relationships among covariates. Unfortunately, the RT-based PfF model's low predictive performance may arise from smooth, gradual changes in the continuous covariates, as can be observed in Pulunggono et al (2022a) results. Furthermore, this present study hypothesized that using more advanced ML algorithms such as ensemble trees (e.g., random forest/RF, gradient boosting machine/GBM, and extreme gradient boosting/XGB), which are considered more complex than RT algorithm, thereby can handle previous constraints and may improve N predictive performance.…”
Section: Introductionmentioning
confidence: 93%
“…A single block (around 30 hectares) is usually established as the smallest sampling unit for OPP management. Nevertheless, previous research highlighted the spatiotemporal variability of peat nutrients at sub-block scales, along with the gradients of the distance from the oil palm tree, canal and mineral soil border, peat thickness, sampling depth, oil palm age, season, and land use (Pulunggono et al 2016;Pulunggono 2019;Pulunggono 2020;Pulunggono 2021;Pulunggono 2022a;Pulunggono 2022b). Soil samples must be collected enormously to satisfy the sampling design for this detailed approach, especially for large-scale OPP.…”
Section: Introductionmentioning
confidence: 99%
“…Currently, ML-based PtFs are widely used in predicting soil properties, such as soil texture , soil temperature (Abimbola et al 2021), aggregate stability (Boushilim et al 2021, soil organic C/SOC (Sakhaee et al 2022), cation exchange capacity (Ma et al 2021), soil reaction (Baltensweiler et al 2021), available P (Kaya et al 2022) and other soil macro-and micronutrients (Shi et al 2022;Sihag et al 2019). Eventhough the research for cultivated tropical peatlands was scarce (e.g., Pulunggono et al 2022a), PtFs employed with ML algorithms have been successfully implemented in estimating N in a maize field (Mashaba-Munghemezulu et al 2021), rangelands (Parsaie et al 2021), paddy soils (Xu et al 2021) and other various land uses (Forkuor et al 2017;Hounkpatin et al 2022;Liu et al 2022) which is entirely located in mineral soils.…”
Section: Introductionmentioning
confidence: 99%
“…In a similar location to this study, Pulunggono et al (2022a) initially tried regression tree/RTbased PtF models for OP-cultivated tropical peat total N, P, and K contents. However, their report only described model performances and interpretation of a single type of tree-based ML algorithm and its comparison to multiple linear regression/MLR.…”
Section: Introductionmentioning
confidence: 99%
“…Selain itu, mengekstraksi informasi yang berarti dari model 'kotak hitam' ini sangat penting, terutama terkait kompleksitas algoritmik dan sifat hubungan kovariat tanah yang tidak linier. Penelitian ini menggunakan dataset Pulunggono (2022a) dan metode bootstrapping, untuk (1) mengembangkan dan mengevaluasi tujuh model PtF, termasuk model linear umum (GLM) dan regresi machine learning (ML) untuk mengestimasi total nitrogen (N) pada gambut tropis yang telah didrainase dan dibudidayakan untuk kelapa sawit (OP) di Riau, Indonesia, serta (2) menjelaskan fungsi model dengan menggabungkan Shapley Additive Explanation (SHAP), sebuah perangkat yang berasal dari teori permainan koalisi. Studi ini menunjukkan kinerja prediksi yang unggul dari PtF berbasis ML dalam mengestimasi total N dibandingkan dengan algoritma GLM.…”
Currently, there is a growing interest among research communities in the development of statistical learning-based pedotransfer functions/PtFs to predict mineral soil nutrients; however, similar studies in peatlands are relatively rare. Moreover, extracting meaningful information from these ‘black-box’ models is crucial, particularly concerning their algorithmic complexity and the non-linear nature of the soil covariate interrelationships. This study employed the Pulunggono (2022a) dataset and the bootstrapping method, to (1) develop and evaluate seven PtF models, including both general linear models (GLM) and machine learning (ML) regressors for estimating total nitrogen (N) in tropical peat that has been drained and cultivated for oil palm (OP) in Riau, Indonesia and (2) explaining model functioning by incorporating Shapley Additive Explanation (SHAP), a tool derived from coalitional game theory. This study demonstrated the superior predictive performance of ML-based PtFs in estimating total N compared to GLM algorithms. The top-performing algorithms for PtF models were identified as GBM, XGB, and Cubist. The SHAP method revealed that sampling depth and organic C were consistently identified as the most important covariates across all models, irrespective of their algorithmic capabilities. Additionally, ML algorithms identified the total Fe, pH, and bulk density (BD) as significant covariates. Local explanations based on Shapley values indicated that the behavior of PtF-based algorithms diverged from their global explanations. This study emphasized the critical role of ML algorithms and game theory in accurately predicting total N in peatlands subjected to drainage and cultivation for OP and explaining their model behavior in relation to soil biogeochemical processes.
“…& Chambers (2021), the RT algorithm is easy to implement and captures non-linear relationships among covariates. Unfortunately, the RT-based PfF model's low predictive performance may arise from smooth, gradual changes in the continuous covariates, as can be observed in Pulunggono et al (2022a) results. Furthermore, this present study hypothesized that using more advanced ML algorithms such as ensemble trees (e.g., random forest/RF, gradient boosting machine/GBM, and extreme gradient boosting/XGB), which are considered more complex than RT algorithm, thereby can handle previous constraints and may improve N predictive performance.…”
Section: Introductionmentioning
confidence: 93%
“…A single block (around 30 hectares) is usually established as the smallest sampling unit for OPP management. Nevertheless, previous research highlighted the spatiotemporal variability of peat nutrients at sub-block scales, along with the gradients of the distance from the oil palm tree, canal and mineral soil border, peat thickness, sampling depth, oil palm age, season, and land use (Pulunggono et al 2016;Pulunggono 2019;Pulunggono 2020;Pulunggono 2021;Pulunggono 2022a;Pulunggono 2022b). Soil samples must be collected enormously to satisfy the sampling design for this detailed approach, especially for large-scale OPP.…”
Section: Introductionmentioning
confidence: 99%
“…Currently, ML-based PtFs are widely used in predicting soil properties, such as soil texture , soil temperature (Abimbola et al 2021), aggregate stability (Boushilim et al 2021, soil organic C/SOC (Sakhaee et al 2022), cation exchange capacity (Ma et al 2021), soil reaction (Baltensweiler et al 2021), available P (Kaya et al 2022) and other soil macro-and micronutrients (Shi et al 2022;Sihag et al 2019). Eventhough the research for cultivated tropical peatlands was scarce (e.g., Pulunggono et al 2022a), PtFs employed with ML algorithms have been successfully implemented in estimating N in a maize field (Mashaba-Munghemezulu et al 2021), rangelands (Parsaie et al 2021), paddy soils (Xu et al 2021) and other various land uses (Forkuor et al 2017;Hounkpatin et al 2022;Liu et al 2022) which is entirely located in mineral soils.…”
Section: Introductionmentioning
confidence: 99%
“…In a similar location to this study, Pulunggono et al (2022a) initially tried regression tree/RTbased PtF models for OP-cultivated tropical peat total N, P, and K contents. However, their report only described model performances and interpretation of a single type of tree-based ML algorithm and its comparison to multiple linear regression/MLR.…”
Section: Introductionmentioning
confidence: 99%
“…Selain itu, mengekstraksi informasi yang berarti dari model 'kotak hitam' ini sangat penting, terutama terkait kompleksitas algoritmik dan sifat hubungan kovariat tanah yang tidak linier. Penelitian ini menggunakan dataset Pulunggono (2022a) dan metode bootstrapping, untuk (1) mengembangkan dan mengevaluasi tujuh model PtF, termasuk model linear umum (GLM) dan regresi machine learning (ML) untuk mengestimasi total nitrogen (N) pada gambut tropis yang telah didrainase dan dibudidayakan untuk kelapa sawit (OP) di Riau, Indonesia, serta (2) menjelaskan fungsi model dengan menggabungkan Shapley Additive Explanation (SHAP), sebuah perangkat yang berasal dari teori permainan koalisi. Studi ini menunjukkan kinerja prediksi yang unggul dari PtF berbasis ML dalam mengestimasi total N dibandingkan dengan algoritma GLM.…”
Currently, there is a growing interest among research communities in the development of statistical learning-based pedotransfer functions/PtFs to predict mineral soil nutrients; however, similar studies in peatlands are relatively rare. Moreover, extracting meaningful information from these ‘black-box’ models is crucial, particularly concerning their algorithmic complexity and the non-linear nature of the soil covariate interrelationships. This study employed the Pulunggono (2022a) dataset and the bootstrapping method, to (1) develop and evaluate seven PtF models, including both general linear models (GLM) and machine learning (ML) regressors for estimating total nitrogen (N) in tropical peat that has been drained and cultivated for oil palm (OP) in Riau, Indonesia and (2) explaining model functioning by incorporating Shapley Additive Explanation (SHAP), a tool derived from coalitional game theory. This study demonstrated the superior predictive performance of ML-based PtFs in estimating total N compared to GLM algorithms. The top-performing algorithms for PtF models were identified as GBM, XGB, and Cubist. The SHAP method revealed that sampling depth and organic C were consistently identified as the most important covariates across all models, irrespective of their algorithmic capabilities. Additionally, ML algorithms identified the total Fe, pH, and bulk density (BD) as significant covariates. Local explanations based on Shapley values indicated that the behavior of PtF-based algorithms diverged from their global explanations. This study emphasized the critical role of ML algorithms and game theory in accurately predicting total N in peatlands subjected to drainage and cultivation for OP and explaining their model behavior in relation to soil biogeochemical processes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.