In this research paper, we presented a novel approach to detect impulsive sounds in real-time using a combination of Deep CNN and RNN architectures. The proposed approach was evaluated using our collected dataset of impulsive sounds, and the results showed that it outperformed traditional audio signal processing methods in terms of accuracy and F1score. The proposed approach has several advantages over traditional methods, including the ability to handle complex audio patterns, detect impulsive sounds in real-time, and improve its performance with a large dataset of labeled impulsive sounds. However, there are some limitations to the proposed approach, including the requirement for a large amount of labeled data to train effectively, environmental factors that may impact the accuracy of the detection, and high computational requirements. Overall, the proposed approach demonstrates the effectiveness of using a combination of Deep CNN and RNN architectures for impulsive sound detection, with potential applications in various fields such as public safety, industrial settings, and home security systems. The proposed approach is a significant step towards developing automated systems for detecting dangerous events and improving public safety.
Video analytics is an integral part of surveillance cameras. Compared to video analytics, audio analytics offers several benefits, including less expensive equipment and upkeep expenses. Additionally, the volume of the audio datastream is substantially lower than the video camera datastream, especially concerning real-time operating systems, which makes it less demanding of the data channel's bandwidth needs. For instance, automatic live video streaming from the site of an explosion and gunshot to the police console using audio analytics technologies would be exceedingly helpful for urban surveillance. Technologies for audio analytics may also be used to analyze video recordings and identify occurrences. This research proposed a deep learning model based on the combination of convolutional neural network (CNN) and recurrent neural network (RNN) known as the CNN-RNN approach. The proposed model focused on automatically identifying pulse sounds that indicate critical situations in audio sources. The algorithm's accuracy ranged from 95% to 81% when classifying noises from incidents, including gunshots, explosions, shattered glass, sirens, cries, and dog barking. The proposed approach can be applied to provide security for citizens in open and closed locations, like stadiums, underground areas, shopping malls, and other places.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.