Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting

Tao, Hai; Al-Sulttani, Ali Omran; Ameen, Ameen Mohammed Salih; Ali, Zainab Hasan; Al‐Ansari, Nadhir; Salih, Sinan Q.; Mostafa, Reham R.

doi:10.1155/2020/8844367

Cited by 31 publications

(16 citation statements)

References 75 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Although success has been attained in the monthly evaporation using the GBM model during the training phase, it is very essential to evaluate the proposed model with testing dataset. As is well known, the training results may provide misleading assessment because the model is trained using known input and third corresponding targets [65]. Besides, the testing phase is very crucial in assessing the quality of the predictive models and, hence, the models' abilities would be assessed very well in terms of generalization and avoiding overfitting [66].…”

Section: Resultsmentioning

confidence: 99%

Evaporation Rate Prediction Using Advanced Machine Learning Models: A Comparative Study

Sudani

Salem

2022

Advances in Meteorology

View full text Add to dashboard Cite

Accurately estimating the amount of evaporation loss is necessary for scheduling and calculating irrigation water requirements. In this study, four machine learning (ML) modeling approaches, extreme learning machine (ELM), gradient boosting machine (GBM), quantile random forest (QRF), and Gaussian process regression (GPR), have been developed to estimate the monthly evaporation loss over two stations located in Iraq. Monthly climatical parameters have been used as an input variable for simulating the evaporation rate. Several statistical measures (e.g., mean absolute error (MAE), correlation coefficient (R), mean absolute percentage error (MAPE), and modified index of agreement (Md)), as well as graphical inspection, were used to compare the performances of the applied models. The results showed that the GBM model has much better performance in predicting monthly evaporation over two stations compared to other applied models. For the first case study which was in Diyala, the results showed a prediction enhancement in terms of MAE and RMSE by 7.17%, 21.01%; 16.51%, 15.74%; and 23.14%, 26.64%; using GBM compared to ELM, GPR, and QRF, respectively. However, for the second case study (in Erbil), the prediction enhancement was improved in terms of reduction of MAE and RMSE by 10.88%, 9.24%; 15.24%, 5%; and 16.06%, 15.76%; respectively, compared to ELM, GPR, and QRF models. The results of the proposed GMBM model can therefore assist local stakeholders in the management of water resources.

show abstract

Section: Resultsmentioning

confidence: 99%

Evaporation Rate Prediction Using Advanced Machine Learning Models: A Comparative Study

Sudani

Salem

2022

Advances in Meteorology

View full text Add to dashboard Cite

show abstract

“…So, in this study, the initial point of view to select of the decomposition level was taken from L but since many seasonal characteristics may be embedded in hydrological signals, 2-8 resolution levels (L ± x) for the daily and 2-5 resolution levels (L ± x) for the monthly modeling were examined via the proposed WANN and WES models which, respectively, denote to the 2 2 -day mode and 2 3 -day mode (which is nearly weekly mode), 2 4 -day mode (which is nearly semimonthly mode), 2 5 -day mode (which is nearly monthly mode), 2 6 -day mode, 2 7 -day mode (which is nearly semiyearly mode), and 2 8 -day mode (which is nearly yearly mode) in the daily scale and 2 2 -month mode, 2 3 -month, 2 4 month, and 2 5 -month mode in the monthly scale. Besides, the Daubechies 4 wavelet (db4) that has been frequently assessed in hydrological modeling was considered as the mother wavelet in this study.…”

Section: Resultsmentioning

confidence: 99%

“…Forecasting streamflow has been investigated by several researchers [1][2][3][4][5] as it is a fundamental subject in hydrological modeling. As a result, many researchers are developing new models to improve streamflow modeling.…”

Section: Introductionmentioning

confidence: 99%

Using Hybrid Wavelet-Exponential Smoothing Approach for Streamflow Modeling

et al. 2021

View full text Add to dashboard Cite

Considering the three intrinsic components (of autoregressive, seasonality, and error) of streamflow time series, the overall performance of the streamflow modeling tool is associated with the correct estimation of these components. In this study, a new hybrid method based on the wavelet transform (WT) as a multiresolution forecasting tool and exponential smoothing (ES) method, with two presented scenarios (WES1 and WES2), was introduced. To this end, the performance of the proposed method was investigated versus four conventional methods of the autoregressive integrated moving average (ARIMA), ES ad-hoc, artificial neural network (ANN), and wavelet-ANN (WANN) for daily and monthly streamflow modeling of West Nishnabotna and Trinity River watersheds with different hydro-geomorphological conditions. In the presented WES technique, firstly, WT is employed for decomposing the observed signal to one approximation (deterministic trend) and more diverse components of subseries (each at a specific frequency). Then, for the first scenario (WES1), only two subseries are introduced to the model as input parameters; however, for the second scenario (WES2), decomposed subseries are separately used as the inputs of ES models. The obtained results indicated that combining WT with the ES method and ANN led to more accurate modeling. The proposed methodology (WES2) that used all decomposed subseries separately improved the efficiency of models up to 30% and 10% for the daily dataset and up to 88% and 57% for the monthly dataset, respectively, for the West Nishnabotna and Trinity Rivers.

show abstract

“…The models will be trained on the training set, and the fitted models will be used to estimate the predicted value in the test set, which can provide an evaluation of the models. The different splitting rate of the data set is selected in respect to the object of characteristics of the studied subjects (Tao et al 2020, Nguyen et al 2021) and the sample size (Tai et al 2019). In this study, considering that the lumber price does not fluctuate abnormally until the second half of 2020 and there are thousands of entries of samples, the splitting rate of the data set is determined to be 95 percent.…”

Section: Sample Splittingmentioning

confidence: 99%

Nowcasting of Lumber Futures Price with Google Trends Index Using Machine Learning and Deep Learning Models

He¹,

Li²,

Via³

et al. 2021

Forest Products Journal

View full text Add to dashboard Cite

Firms engaged in producing, processing, marketing, or using lumber and lumber products always invest in futures markets to reduce the risk of lumber price volatility. The accurate prediction of real-time prices can help companies and investors hedge risks and make correct market decisions. This paper explores whether Internet browsing habits can accurately nowcast the lumber futures price. The predictors are Google Trends index data related to lumber prices. This study offers a fresh perspective on nowcasting the lumber price accurately. The novel outlook of employing both machine learning and deep learning methods shows that despite the high predictive power of both the methods, on average, deep learning models can better capture trends and provide more accurate predictions than machine learning models. The artificial neural network model is the most competitive, followed by the recurrent neural network model.

show abstract

Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting

Cited by 31 publications

References 75 publications

Evaporation Rate Prediction Using Advanced Machine Learning Models: A Comparative Study

Evaporation Rate Prediction Using Advanced Machine Learning Models: A Comparative Study

Using Hybrid Wavelet-Exponential Smoothing Approach for Streamflow Modeling

Nowcasting of Lumber Futures Price with Google Trends Index Using Machine Learning and Deep Learning Models

Contact Info

Product

Resources

About