A great number of computer vision publications have focused on distinguishing between human action recognition and classification rather than the intensity of actions performed. Indexing the intensity which determines the performance of human actions is a challenging task due to the uncertainty and information deficiency that exists in the video inputs. To remedy this uncertainty, in this paper we coupled fuzzy logic rules with the neural-based action recognition model to rate the intensity of a human action as intense or mild. In our approach, we used a Spatio-Temporal LSTM to generate the weights of the fuzzy-logic model, and then demonstrate through experiments that indexing of the action intensity is possible. We analyzed the integrated model by applying it to videos of human actions with different action intensities and were able to achieve an accuracy of 89.16% on our intensity indexing generated dataset. The integrated model demonstrates the ability of a neuro-fuzzy inference module to effectively estimate the intensity index of human actions.
Aviation is a complicated transportation system, and safety is of paramount importance because aircraft failure often involves casualties. Prevention is clearly the best strategy for aviation transportation safety. Learning from past incident data to prevent potential accidents from happening has proved to be a successful approach. To prevent potential safety hazards and make effective prevention plans, aviation safety experts identify primary and contributing factors from incident reports. However, safety experts’ review processes have become prohibitively expensive nowadays. The number of incident reports is increasing rapidly due to the acceleration of advances in information technologies and the growth of the commercial and private aviation transportation industries. Consequently, advanced text mining algorithms should be applied to help aviation safety experts facilitate the process of incident data extraction. This paper focuses on constructing deep-learning-based models to identify causal factors from incident reports. First, we prepare the data sets used for training, validation, and testing with approximately 200,000 qualified incident reports from the Aviation Safety Reporting System (ASRS). Then, we take an open-source natural language model, which is well trained with a large corpus of Wikipedia texts, as the baseline and fine-tune it with the texts in incident reports to make it more suited to our specific research task. Finally, we build and train an attention-based long short-term memory (LSTM) model to identify primary and contributing factors in each incident report. The solution we propose has multilabel capability and is automated and customizable, and it is more accurate and adaptable than traditional machine learning methods in extant research. This novel application of deep learning algorithms to the incident reporting system can efficiently improve aviation safety.
Customer-agent conversations (i.e. call transcripts) are invaluable source for companies as they convey direct information from their customers implicit and explicit behaviour. Identifying customerrelated events is an important task in customer services which is possible from the call transcripts. However, call centers produces a vast amount of transcripts which makes the manual or semi-manual processing of such raw datasets quite challenging. Furthermore, customer-agent call transcripts tend not to explicitly denote events that might be beneficial to customer services. Albeit being highly researched across multiple domains in the literature, event detection, especially implicit life event detection have not been well examined from call transcripts due to a lack of proper large-scale dataset. In this research, we propose a novel deep learning approach based on latent topic modeling and deep recurrent neural networks with memory units to automatically detect implicit events from a customer's history of call transcripts. These implicit events are detected prior to the report date of that event thereby not containing any explicit topic/feature. We provide a case study on a real-life, large-scale data of more than 800K call transcripts from a large financial services company in the U.S. to examine the practical features and challenges of this problem. The evaluation results demonstrate the potential applicability of our implicit life event detection as it achieves a macro-recall score of 53 (macro-f1 of 47.5) on a highly imbalanced test set, negative samples are 95% of the data. Our model beats the the state-of-the-art text classification benchmarks by macro-f1 score of 5.6 and macro-recall of 8.8 on average, and performs better than the ensemble of all single-document and sequential classification benchmarks albeit being significantly less complex. The comparison results show the importance as well as our model's capability of capturing the mutual information of a sequence of call transcripts in detecting the implicit life events. INDEX TERMS Implicit event discovery, call transcripts, deep learning, recurrent neural network, machine learning, natural language processing, text classification, topic modeling, event detection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.