2022
DOI: 10.3390/app12105075
|View full text |Cite
|
Sign up to set email alerts
|

A Method Based on Dual Cross-Modal Attention and Parameter Sharing for Polyphonic Sound Event Localization and Detection

Abstract: Sound event localization and detection (SELD) is a joint task that unifies sound event detection (SED) and direction-of-arrival estimation (DOAE). The task has become such a popular topic that it was introduced into the challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) Task3 in 2019. In this paper, we propose a method based on dual cross-modal attention (DCMA) and parameter sharing to simultaneously detect and localize sound events. In particular, the DCMA-based decoder commonly u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 36 publications
0
0
0
Order By: Relevance
“…Transformers are able to capture time-dependent features in data, without the use of recurrent networks as GRU or LSTM. This approach was adapted to the SELD task in (Lee et al, 2021;Park et al, 2021a).…”
Section: Discussionmentioning
confidence: 99%
“…Transformers are able to capture time-dependent features in data, without the use of recurrent networks as GRU or LSTM. This approach was adapted to the SELD task in (Lee et al, 2021;Park et al, 2021a).…”
Section: Discussionmentioning
confidence: 99%