2020
DOI: 10.1109/access.2020.2999388
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Review of Polyphonic Sound Event Detection

Abstract: One of the most amazing functions of the human auditory system is the ability to detect all kinds of sound events in the environment. With the technologies and hardware advances, polyphonic Sound Event Detection (SED) can be developed to mimic the ability of the human auditory system. However, the development of a SED system is no trivial task, and several different factors often hinder accuracy. Although there are several overview papers available, most of them only provide a theoretical overview of algorithm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 38 publications
(20 citation statements)
references
References 108 publications
(249 reference statements)
0
14
0
1
Order By: Relevance
“…8: Affine mapping (3) based on sliding source along table edge (points 1-2-3-4) using point 1 as reference and (b) long-table edge (points 5-6) using point 6 as reference. The estimated B matrix is based on table (point 1-6), chair (point 7-11), and chair+table (point [1][2][3][4][5][6][7][8][9][10][11].…”
Section: Mappingmentioning
confidence: 99%
See 1 more Smart Citation
“…8: Affine mapping (3) based on sliding source along table edge (points 1-2-3-4) using point 1 as reference and (b) long-table edge (points 5-6) using point 6 as reference. The estimated B matrix is based on table (point 1-6), chair (point 7-11), and chair+table (point [1][2][3][4][5][6][7][8][9][10][11].…”
Section: Mappingmentioning
confidence: 99%
“…This could be used for fall monitoring [4] and daily events classification [5]. This system could also be combined to sound scene classification, e.g., [6], [7].…”
Section: Introductionmentioning
confidence: 99%
“…inside a house or in a crowded street) or to different actions or sources which produced the obtained acoustic signals. In the former case, we talk about acoustic scene classification [8], while in the latter the problem at hand is sound event classification or detection [9].…”
Section: Introductionmentioning
confidence: 99%
“…Such a problem is more likely to be solved when one had access to a large corpus of strongly labeled data where the event tags and corresponding onsets and offsets are known with certainty. However, the lack of strongly labeled data is a recurring problem faced during model development as it is difficult and time-consuming to collect [4].…”
Section: Introductionmentioning
confidence: 99%