2022
DOI: 10.1177/1748006x221140196
|View full text |Cite
|
Sign up to set email alerts
|

A framework based on Natural Language Processing and Machine Learning for the classification of the severity of road accidents from reports

Abstract: Road safety analysis is typically performed by domain experts on the basis of the information contained in accident reports. The main challenges are the difficulty of considering a large number of reports in textual form and the subjectivity of the expert judgments contained in reports. This work develops a framework based on the combination of Natural Language Processing (NLP) and Machine Learning (ML) for the automatic classification of accidents with the final aim of assisting experts in performing road saf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 51 publications
0
0
0
Order By: Relevance
“…Thus, 1AT contains only seven different classes to be predicted, which are significantly fewer than 3AT, consisting of 297 different accident types to be predicted [10]. Furthermore, other research on building classifiers based on police accident data often focuses on predicting or modelling injury-related parameters, for example, the accident severity or the number of injured people [19], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], or investigating risk factors [30], [31], [32], [33], [34], [35], [36]. Additionally, models regarding crash type, typically describing how road users hit each other, 1 seem to be part of the research focus [37], [38], [39], [40], [41].…”
Section: Introductionmentioning
confidence: 99%
“…Thus, 1AT contains only seven different classes to be predicted, which are significantly fewer than 3AT, consisting of 297 different accident types to be predicted [10]. Furthermore, other research on building classifiers based on police accident data often focuses on predicting or modelling injury-related parameters, for example, the accident severity or the number of injured people [19], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], or investigating risk factors [30], [31], [32], [33], [34], [35], [36]. Additionally, models regarding crash type, typically describing how road users hit each other, 1 seem to be part of the research focus [37], [38], [39], [40], [41].…”
Section: Introductionmentioning
confidence: 99%
“…It is important to recognize that work zone characteristics and the environment exert a significant influence on work zone accidents, injuries, and fatalities [7]. Additionally, human factors, such as worker behavior and ergonomics, play a crucial role in accidents in highway construction zones [1,6].…”
Section: Introductionmentioning
confidence: 99%
“…Various approaches have been explored, including combining Term Frequency-Inverse Document Frequency (TFIDF) with machine learning classifiers, utilizing the K-means clustering algorithm for data mining and employing feature analysis through descriptive statistics [9]. TFIDF, a traditional method in text analytics, quantifies words' importance in a document, but it has limitations in capturing word similarity and accurately reflecting their importance [7].…”
Section: Introductionmentioning
confidence: 99%