Soil salinization is one of the main causes of global desertification and soil degradation. Although previous studies have investigated the hyperspectral inversion of soil salinity using machine learning, only a few have been based on soil types. Moreover, agricultural fields can be improved based on the accurate estimation of the soil salinity, according to the soil type. We collected field data relating to six salinized soils, Haplic Solonchaks (HSK), Stagnic Solonchaks (SSK), Calcic Sonlonchaks (CSK), Fluvic Solonchaks (FSK), Haplic Sonlontzs (HSN), and Takyr Solonetzs (TSN), in the Hetao Plain of the upper reaches of the Yellow River, and measured the in situ hyperspectral, pH, and electrical conductivity (EC) values of a total of 231 soil samples. The two-dimensional spectral index, topographic factors, climate factors, and soil texture were considered. Several models were used for the inversion of the saline soil types: partial least squares regression (PLSR), random forest (RF), extremely randomized trees (ERT), and ridge regression (RR). The spectral curves of the six salinized soil types were similar, but their reflectance sizes were different. The degree of salinization did not change according to the spectral reflectance of the soil types, and the related properties were inconsistent. The Pearson’s correlation coefficient (PCC) between the two-dimensional spectral index and the EC was much greater than that between the reflectance and EC in the original band. In the two-dimensional index, the PCC of the HSK-NDI was the largest (0.97), whereas in the original band, the PCC of the SSK400 nm was the largest (0.70). The two-dimensional spectral index (NDI, RI, and DI) and the characteristic bands were the most selected variables in the six salinized soil types, based on the variable projection importance analysis (VIP). The best inversion model for the HSK and FSK was the RF, whereas the best inversion model for the CSK, SSK, HSN, and TSN was the ERT, and the CSK-ERT had the best performance (R2 = 0.99, RMSE = 0.18, and RPIQ = 6.38). This study provides a reference for distinguishing various salinization types using hyperspectral reflectance and provides a foundation for the accurate monitoring of salinized soil via multispectral remote sensing.
An accurate estimation of soil electrical conductivity (EC) using hyperspectral techniques is of great significance for understanding the spatial distribution of solutes and soil salinization. Although spectral transformation has been widely used in data pre-processing, the performance of different pre-processing techniques (or combination methods) on different models of the same data set is still ambiguous. Moreover, extremely randomized trees (ERT) and light gradient boosting machine (LightGBM) models are new learning algorithms with good generalization performance (soil moisture and above-ground biomass), but are less studied in estimating soil salinity in the visible and near-infrared spectra. In this study, 130 soil EC data, soil measured hyperspectral data, topographic factors, conventional salinity indices such as Salinity Index 1, and two-band (2D) salinity indices such as ratio indices, were introduced. The five spectral pre-processing methods of standard normal variate (SNV), standard normal variate and detrend (SNV-DT), inverse (1/OR) (OR is original spectrum), inverse-log (Log(1/OR) and fractional order derivative (FOD) (range 0–2, with intervals of 0.25) were performed. A gradient boosting machine (GBM) was used to select sensitive spectral parameters. Models (extreme gradient boosting (XGBoost), LightGBM, random forest (RF), ERT, classification and regression tree (CART), and ridge regression (RR)) were used for inversion soil EC and model validation. The results reveal that the two-dimensional correlation coefficient highlighted EC more effectively than the one-dimensional. Under SNV and the second order derivative, the two-dimensional correlation coefficient increased by 0.286 and 0.258 compared to the one-dimension, respectively. The 13 characteristic factors of slope, NDI, SI-T, RI, profile curvature, DOA, plane curvature, SI (conventional), elevation, Int2, aspect, S1 and TWI provided 90% of the cumulative importance for EC using GBM. Among the six machine models, the ERT model performed the best for simulation (R2 = 0.98) and validation (R2 = 0.96). The ERT model showed the best performance among the EC estimation models from the reference data. The kriging map based on the ERT simulation showed a close relationship with the measured data. Our study selected the effective pre-processing methods (SNV and the 2 order derivative) using one- and two-dimensional correlation, 13 important factors and the ERT model for EC hyperspectral inversion. This provides a theoretical support for the quantitative monitoring of soil salinization on a larger scale using remote sensing techniques.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.