Assessing the different factors that contribute to accidents in the workplace is essential to ensure the safety and well-being of employees. Given the importance of risk identification in hazard prediction, this work proposes a comparative study between different feature selection techniques (χ 2 test and Forward Feature Selection) combined with learning algorithms (Support Vector Machine, Random Forest, and Naive Bayes), both applied to a database of a leading company in the retail sector, in Portugal. The goal is to conclude which factors of each database have the most significant impact on the occurrence of accidents. Initial databases include accident records, ergonomic workplace analysis, hazard intervention and risk assessment, climate databases, and holiday records. Each method was evaluated based on its accuracy in the forecast of the occurrence of the accident. The results showed that the Forward Feature Selection-Random Forest pair performed better among the assessed combinations, considering the case study database. In addition, data from accident records and ergonomic workplace analysis have the largest number of features with the most significant predictive impact on accident prediction. Future studies will be carried out to evaluate factors from other databases that may have meaningful information for predicting accidents.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.