Process mining is a novel alternative that uses event logs to discover, monitor, and improve real business processes through knowledge extraction. Event logs are a prerequisite for any process mining technique. The extraction of event data and event log building is a complex and time-intensive process, with human participation at several stages of the procedure. In this paper, we propose a framework to semi-automatically build an event log based on the XES standard from relational databases. The framework comprises the stages of requirements identification, event log construction, and event log evaluation. In the first stage, the data is interpreted to identify the relationship between the columns and business process activities, then the business process entities are defined. In the second stage, the hierarchical structure of the event log is specified. Likewise, a formal rule set is defined to allow mapping the database columns with the attributes specified in the event log structure, enabling the extraction of attributes. This task is implemented through a correlation method at the case, event, and activity levels, to automatic event log generation. We validate the event log through quality metrics, statistical analysis, and business process discovery. The former allows for determining the quality of the event log built using the metrics of accuracy, completeness, consistency, and uniqueness. The latter evaluates the business process models discovered through precision, coverage, and generalization metrics. The proposed approach was evaluated using the autonomous Internet of Things (IoT) air quality monitoring system’s database and the patient admission and healthcare service delivery database, reaching acceptable values both in the event log quality and in the quality of the business process models discovered.
Air pollution is associated with respiratory diseases and the transmission of infectious diseases. In this context, the association between meteorological factors and poor air quality possibly contributes to the transmission of COVID-19. Therefore, analyzing historical data of particulate matter (PM2.5, and PM10) and meteorological factors in indoor and outdoor environments to discover patterns that allow predicting future confirmed cases of COVID-19 is a challenge within a long pandemic. In this study, a hybrid approach based on machine learning and deep learning is proposed to predict confirmed cases of COVID-19. On the one hand, a clustering algorithm based on K-means allows the discovery of behavior patterns by forming groups with high cohesion. On the other hand, multivariate linear regression is implemented through a long short-term memory (LSTM) neural network, building a reliable predictive model in the training stage. The LSTM prediction model is evaluated through error metrics, achieving the highest performance and accuracy in predicting confirmed cases of COVID-19, using data of PM2.5 and PM10 concentrations and meteorological factors of the outdoor environment. The predictive model obtains a root-mean-square error (RMSE) of 0.0897, mean absolute error (MAE) of 0.0837, and mean absolute percentage error (MAPE) of 0.4229 in the testing stage. When using a dataset of PM2.5, PM10, and meteorological parameters collected inside 20 households from 27 May to 13 October 2021, the highest performance is obtained with an RMSE of 0.0892, MAE of 0.0592, and MAPE of 0.2061 in the testing stage. Moreover, in the validation stage, the predictive model obtains a very acceptable performance with values between 0.4152 and 3.9084 for RMSE, and a MAPE of less than 4.1%, using three different datasets with indoor environment values.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.