Electricity demand forecast is necessary for power systems' operation scheduling and management. However, power consumption is uncertain and depends on several factors. Moreover, since the onset of covid-19, the electricity consumption pattern went through significant changes across the globe, which made the forecasting demand more challenging. This is mainly due to the fact that pandemic-driven restrictions changed people's lifestyles and work activities. This calls for new forecasting algorithms to more effectively handle these conditions. In this paper, ensemble-based machine learning models are utilized for this task. The lockdown temporal policies are added to the feature set in order to make the model capable of correcting itself in pandemic situations and enhance data quality for the forecasting task. Several ensemble-based machine learning models are examined for the short-term country-level demand prediction model. Besides, the quantile random forest regression is implemented for a probabilistic point of view. For case studies, the models are trained for predicting Germany's country-level demand. The results indicate that ensemble models, especially boosting and bagging-boosting models, are capable of accurate country-level demand forecast. Besides, the majority of these models are robust against missing the pandemic policy data. However, utilizing the pandemic policy data as features increases the forecasting accuracy during the pandemic situation significantly. Furthermore, the probabilistic quantile regression demonstrated high accuracy for the aforementioned case study.INDEX TERMS COVID-19 pandemic, demand forecasting, machine learning, decision tree ensembles, probabilistic.
Companies always seek ways to make their professional employees stay with them to reduce extra recruiting and training costs. Predicting whether a particular employee may leave or not will help the company to make preventive decisions. Unlike physical systems, human resource problems cannot be described by a scientific-analytical formula. Therefore, machine learning approaches are the best tools for this aim. This paper presents a three-stage (pre-processing, processing, post-processing) framework for attrition prediction. An IBM HR dataset is chosen as the case study. Since there are several features in the dataset, the “max-out” feature selection method is proposed for dimension reduction in the pre-processing stage. This method is implemented for the IBM HR dataset. The coefficient of each feature in the logistic regression model shows the importance of the feature in attrition prediction. The results show improvement in the F1-score performance measure due to the “max-out” feature selection method. Finally, the validity of parameters is checked by training the model for multiple bootstrap datasets. Then, the average and standard deviation of parameters are analyzed to check the confidence value of the model’s parameters and their stability. The small standard deviation of parameters indicates that the model is stable and is more likely to generalize well.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.