Environmental Audio Scene and Sound Event Recognition for Autonomous Surveillance

Chandrakala, S.; Jayalakshmi, S.

doi:10.1145/3322240

Cited by 60 publications

(29 citation statements)

References 96 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…EASR refers to recognition of indoor or outdoor acoustic scenes (e.g., cafes/restaurants, home, vehicle or metro stations, supermarkets, versus crowded or silent streets, forest landscape, countryside, beaches, gym halls, swimming pools). SER is intended to the investigation of specific acoustic events in the audio environments, like dog barking, gunshots, sudden brake sounds, or human nonspeech events, like coughing, whistling, screaming, child crying, snoring, sneezing [13].…”

Section: Introductionmentioning

confidence: 99%

Environmental Acoustics Modelling Techniques for Forest Monitoring

Segarceanu¹,

Suciu²,

Gavăt³

2021

Adv. sci. technol. eng. syst. j.

View full text Add to dashboard Cite

Environmental sounds detection plays an increasing role in computer science and robotics as it simulates the human faculty of hearing. It is applied in environment research, monitoring and protection, by allowing investigation of natural reserves, and showing potential risks of damage that can be deduced from the environmental acoustic. The research presented in this paper is related to the development of an intelligent forest environment monitoring solution, which applies signal analysis algorithm to detect endangering sounds. Environmental sounds are processed using some modelling algorithms based on which the acoustic forest events can be classified into one of the categories: chainsaw, vehicle, genuine forest background noise. The article will explore and compare several methodologies for environmental sound classification, among which the dominant Deep Neural Networks, the Long Short-Term Memory, and the classical Gaussian Mixtures Modelling and Dynamic Time Warping.

show abstract

Section: Introductionmentioning

confidence: 99%

Environmental Acoustics Modelling Techniques for Forest Monitoring

Segarceanu¹,

Suciu²,

Gavăt³

2021

Adv. sci. technol. eng. syst. j.

View full text Add to dashboard Cite

show abstract

“…Rex [30] provided software recommendations for SED. Although a survey by Chandrakala and Jayalakshmi [31] included discussion of several SED systems, the survey was focused on the audio scene and event classification. The survey by Purwins et al [32] was mainly on the general overview of audio signal processing.…”

Section: Introductionmentioning

confidence: 99%

A Comprehensive Review of Polyphonic Sound Event Detection

Chan

Chin

2020

IEEE Access

View full text Add to dashboard Cite

One of the most amazing functions of the human auditory system is the ability to detect all kinds of sound events in the environment. With the technologies and hardware advances, polyphonic Sound Event Detection (SED) can be developed to mimic the ability of the human auditory system. However, the development of a SED system is no trivial task, and several different factors often hinder accuracy. Although there are several overview papers available, most of them only provide a theoretical overview of algorithms used with little discussion. Thus, to the best of the authors' knowledge, there is no comprehensive review that covers this particular domain. Therefore, this paper aims to provide an in-depth discussion of different methodologies proposed by various authors that include the features used, detection algorithms, and their corresponding accuracy and limitations. Additional information on possible trends is also discussed that can be useful for future development works.

show abstract

“…Sound event detection (SED) is a task that identifies types of sound and detects their onset and offset [1]. Recently, many works have addressed SED because SED has a large potential for many applications such as monitoring elderly people or infants [2], [3], automatic surveillance [4]- [6], automatic anomaly detection [7], [8], and media retrieval [9]. SED is typically categorized into two types: monophonic and polyphonic SED.…”

Section: Introductionmentioning

confidence: 99%

Sound Event Detection Utilizing Graph Laplacian Regularization with Event Co-Occurrence

Imoto

Kyochi

2020

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

A limited number of types of sound event occur in an acoustic scene and some sound events tend to co-occur in the scene; for example, the sound events "dishes" and "glass jingling" are likely to cooccur in the acoustic scene "cooking." In this paper, we propose a method of sound event detection using graph Laplacian regularization with sound event co-occurrence taken into account. In the proposed method, the occurrences of sound events are expressed as a graph whose nodes indicate the frequencies of event occurrence and whose edges indicate the sound event co-occurrences. This graph representation is then utilized for the model training of sound event detection, which is optimized under an objective function with a regularization term considering the graph structure of sound event occurrence and co-occurrence. Evaluation experiments using the TUT Sound Events 2016 and 2017 detasets, and the TUT Acoustic Scenes 2016 dataset show that the proposed method improves the performance of sound event detection by 7.9 percentage points compared with the conventional CNN-BiGRU-based detection method in terms of the segment-based F1 score. In particular, the experimental results indicate that the proposed method enables the detection of co-occurring sound events more accurately than the conventional method.

show abstract

Environmental Audio Scene and Sound Event Recognition for Autonomous Surveillance

Cited by 60 publications

References 96 publications

Environmental Acoustics Modelling Techniques for Forest Monitoring

Environmental Acoustics Modelling Techniques for Forest Monitoring

A Comprehensive Review of Polyphonic Sound Event Detection

Sound Event Detection Utilizing Graph Laplacian Regularization with Event Co-Occurrence

Contact Info

Product

Resources

About