2019
DOI: 10.1145/3322240
|View full text |Cite
|
Sign up to set email alerts
|

Environmental Audio Scene and Sound Event Recognition for Autonomous Surveillance

Abstract: Monitoring of human and social activities is becoming increasingly pervasive in our living environment for public security and safety applications. The recognition of suspicious events is important in both indoor and outdoor environments, such as child-care centers, smart-homes, old-age homes, residential areas, office environments, elevators, and smart cities. Environmental audio scene and sound event recognition are the fundamental tasks involved in many audio surveillance applications. Although numerous app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
29
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 60 publications
(29 citation statements)
references
References 96 publications
0
29
0
Order By: Relevance
“…EASR refers to recognition of indoor or outdoor acoustic scenes (e.g., cafes/restaurants, home, vehicle or metro stations, supermarkets, versus crowded or silent streets, forest landscape, countryside, beaches, gym halls, swimming pools). SER is intended to the investigation of specific acoustic events in the audio environments, like dog barking, gunshots, sudden brake sounds, or human nonspeech events, like coughing, whistling, screaming, child crying, snoring, sneezing [13].…”
Section: Introductionmentioning
confidence: 99%
“…EASR refers to recognition of indoor or outdoor acoustic scenes (e.g., cafes/restaurants, home, vehicle or metro stations, supermarkets, versus crowded or silent streets, forest landscape, countryside, beaches, gym halls, swimming pools). SER is intended to the investigation of specific acoustic events in the audio environments, like dog barking, gunshots, sudden brake sounds, or human nonspeech events, like coughing, whistling, screaming, child crying, snoring, sneezing [13].…”
Section: Introductionmentioning
confidence: 99%
“…Rex [30] provided software recommendations for SED. Although a survey by Chandrakala and Jayalakshmi [31] included discussion of several SED systems, the survey was focused on the audio scene and event classification. The survey by Purwins et al [32] was mainly on the general overview of audio signal processing.…”
Section: Introductionmentioning
confidence: 99%
“…Sound event detection (SED) is a task that identifies types of sound and detects their onset and offset [1]. Recently, many works have addressed SED because SED has a large potential for many applications such as monitoring elderly people or infants [2], [3], automatic surveillance [4]- [6], automatic anomaly detection [7], [8], and media retrieval [9]. SED is typically categorized into two types: monophonic and polyphonic SED.…”
Section: Introductionmentioning
confidence: 99%