Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1821
|View full text |Cite
|
Sign up to set email alerts
|

Early Detection of Continuous and Partial Audio Events Using CNN

Abstract: Sound event detection is an extension of the static auditory classification task into continuous environments, where performance depends jointly upon the detection of overlapping events and their correct classification. Several approaches have been published to date which either develop novel classifiers or employ well-trained static classifiers with a detection front-end. This paper takes the latter approach, by combining a proven CNN classifier acting on spectrogram image features, with time-frequency shaped… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

4
3

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 17 publications
(32 reference statements)
0
9
0
Order By: Relevance
“…Finally, while the ability to perform early detection (i.e. low latency classification) is very useful for real-world applications such as human-robot interaction, noise reduction during calls or in acoustic security systems, very few publications explore or even mention this [26,27,28]. These several factors motivate this research to develop an ASC system that targets the three most important issues mentioned above while providing a comprehensive analysis on the ability of early detection and exploration of the main factors involved in ensuring good performance.…”
Section: State-of-the-art Approachesmentioning
confidence: 99%
See 2 more Smart Citations
“…Finally, while the ability to perform early detection (i.e. low latency classification) is very useful for real-world applications such as human-robot interaction, noise reduction during calls or in acoustic security systems, very few publications explore or even mention this [26,27,28]. These several factors motivate this research to develop an ASC system that targets the three most important issues mentioned above while providing a comprehensive analysis on the ability of early detection and exploration of the main factors involved in ensuring good performance.…”
Section: State-of-the-art Approachesmentioning
confidence: 99%
“…The ability of early detecting recording environments. Although early detection of recording environments promisingly opens a wide range of applications as introduced in Section 1.1.1, a few of research [28,27] mentioned and not many experiments have been conducted. This contribution shows an analysis of early detecting recording environments by using the novel Decoder-Encoder framework recently mentioned.…”
Section: Contributionmentioning
confidence: 99%
See 1 more Smart Citation
“…Deep learning techniques have achieved strong and robust detection performance for general sound classification [17], [18]. Feature extraction in state-of-the-art deep learning based systems typically involves generating twodimensional time-frequency spectrograms that are able to capture both fine grained temporal and spectral information as well as present a much wider time context than single frame analysis.…”
Section: Introductionmentioning
confidence: 99%
“…Isolated and overlapping AED literature often appear to derive from two separate methodological streams. For the former, there is a large body of work covering different perspectives: noise robustness [10,7,11], multichannel and multimodal fusion [12,13,14], weak labelling [15,16], early event detection [17,18,19], event detection under scarcity scenarios [20,21], as well as false positive…”
Section: Introductionmentioning
confidence: 99%