Machine learning models have attracted much research attention for groundwater potential mapping. However, the accuracy of models for groundwater potential mapping is significantly influenced by sample size and this is still a challenge. This study evaluates the influence of sample size on the accuracy of different individual and hybrid models, adaptive neuro-fuzzy inference system (ANFIS), ANFISimperial competitive algorithm (ANFIS-ICA), alternating decision tree (ADT), and random forest (RF) to model groundwater potential, considering the number of springs from 177 to 714. A well-documented inventory of springs, as a natural representative of groundwater potential, was used to designate four sample data sets: 100% (D1), 75% (D2), 50% (D3), and 25% (D4) of the entire springs inventory. Each data set was randomly split into two groups of 30% (for training) and 70% (for validation). Fifteen diverse geo-environmental factors were employed as independent variables. The area under the operating 2 receiver characteristic curve (AUROC) and the true skill statistic (TSS) as two cutoff-independent and cutoff-dependent performance metrics were used to assess the performance of models. Results showed that the sample size influenced the performance of four machine learning algorithms, but RF had a lower sensitivity to the reduction of sample size. In addition, validation results revealed that RF (AUROC=90.74-96.32%, TSS=0.79-0.85) had the best performance based on all four sample data sets, followed by ANFIS-ICA (AUROC=81.23-91.55%, TSS=0.74-0.81), ADT (AUROC=79.29-88.46%, TSS=0.59-0.74), and ANFIS (AUROC=73.11-88.43%, TSS=0.59-0.74). Further, the relative slope position, lithology, and distance from faults were the main spring-affecting factors contributing to groundwater potential modelling. This study can provide useful guidelines and valuable reference for selecting machine learning models when a complete spring inventory in a watershed is unavailable.
Check dams are widely used watershed management measures for reducing flood peak discharge and sediment transport, and increasing lag time and groundwater recharge throughout the world. However, identifying the best suitable sites for check dams within the stream networks of various watersheds remains challenging. This study aimed to develop an open-source software with user-friendly interface for screening the stream network possibilities and identifying and guiding the selection of suitable sites for check dams within watersheds. In this developed site selection software (SSS), multi-criteria decision analysis (MCDA) was integrated into geographic information systems (GIS), which allowed for numerous spatial data of the multiple criteria to be relatively simply and visually processed. Different geomorphometric and topo-hydrological factors were considered and accounted for to enhance the SSS identification of the best locations for check dams. The factors included topographic wetness index (TWI), terrain ruggedness index (TRI), topographic position index (TPI), sediment transport index (STI), stream power index (SPI), slope, drainage density (DD), and stream order (SO). The site identification performance of the SSS was assessed using the receiver operating characteristic (ROC) curve method, with results for the case study example of the Poldokhtar watershed in Iran showing excellent performance and identifying 327 potential sites for efficient check dam construction in this watershed. The SSS tool is not site-specific but is rather general, adaptive, and comprehensive, such that it can and should be further applied and tested across different watersheds and parts of the world.
Gully erosion has become one of the major environmental issues, due to the severity of its impact in many parts of the world. Gully erosion directly and indirectly affects agriculture and infrastructural development. The Golestan Dam basin, where soil erosion and degradation are very severe problems, was selected as the study area. This research maps gully erosion susceptibility (GES) by integrating four models: maximum entropy (MaxEnt), artificial neural network (ANN), support vector machine (SVM), and general linear model (GLM). Of 1042 gully locations, 729 (70%) and 313 (30%) gully locations were used for modeling and validation purposes, respectively. Fourteen effective gully erosion conditioning factors (GECFs) were selected for spatial gully erosion modeling. Tolerance and variance inflation factors (VIFs) were used to examine the collinearity among the GECFs. The random forest (RF) model was used to assess factors’ effectiveness and significance in gully erosion modeling. An ensemble of techniques can provide more accurate results than can single, standalone models. Therefore, we compared two-, three-, and four-model ensembles (ANN-SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, ANN-SVM-GLM, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM, MaxEnt-ANN-SVM and GLM-ANN-SVM-MaxEnt) for GES modeling. The susceptibility zones of the GESMs were classified as very-low, low, medium, high, and very-high using Jenks’ natural break classification method (NBM). Subsequently, the receiver operating characteristics (ROC) curve and the seed cell area index (SCAI) methods measured the reliability of the models. The success rate curve (SRC) and predication rate curve (PRC) and their area under the curve (AUC) values were obtained from the GES maps. The results show that the ANN model combined with two and three models are more accurate than the other combinations, but the ANN-SVM model had the highest accuracy. The rank of the others from best to worst accuracy is GLM, MaxEnt, SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, GLM-ANN-SVM-MaxEnt, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM and MaxEnt-ANN-SVM. The resulting gully erosion susceptibility models (GESMs) are efficient and powerful and could be used to improve soil and water conservation and management.
Floods are the most common natural disaster globally and lead to severe damage, especially in urban environments. This study evaluated the efficiency of a self-organizing map neural network (SOMN) algorithm for urban flood hazard mapping in the case of Amol city, Iran. First, a flood inventory database was prepared using field survey data covering 118 flooded points. A 70:30 data ratio was applied for training and validation purposes. Six factors (elevation, slope percent, distance from river, distance from channel, curve number, and precipitation) were selected as predictor variables. After building the model, the odds ratio skill score (ORSS), efficiency (E), true skill statistic (TSS), and the area under the receiver operating characteristic curve (AUC-ROC) were used as evaluation metrics to scrutinize the goodness-of-fit and predictive performance of the model. The results indicated that the SOMN model performed excellently in modeling flood hazard in both the training (AUC = 0.946, E = 0.849, TSS = 0.716, ORSS = 0.954) and validation (AUC = 0.924, E = 0.857, TSS = 0.714, ORSS = 0.945) steps. The model identified around 23% of the Amol city area as being in high or very high flood risk classes that need to be carefully managed. Overall, the results demonstrate that the SOMN model can be used for flood hazard mapping in urban environments and can provide valuable insights about flood risk management.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.