ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053912
|View full text |Cite
|
Sign up to set email alerts
|

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

Abstract: Sound event detection (SED) and acoustic scene classification (ASC) are major tasks in environmental sound analysis. Considering that sound events and scenes are closely related to each other, some works have addressed joint analyses of sound events and acoustic scenes based on multitask learning (MTL), in which the knowledge of sound events and scenes can help in estimating them mutually. The conventional MTL-based methods utilize one-hot scene labels to train the relationship between sound events and scenes;… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
32
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 39 publications
(33 citation statements)
references
References 18 publications
(32 reference statements)
0
32
0
Order By: Relevance
“…Scarce research on the combination of related sound classification tasks has been conducted [10,11,13,14]. Imoto et al [13] assumed that ASC and SED are related and performed them simultaneously using a multitask learning framework.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Scarce research on the combination of related sound classification tasks has been conducted [10,11,13,14]. Imoto et al [13] assumed that ASC and SED are related and performed them simultaneously using a multitask learning framework.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, perceiving car horns and traffic sounds can be helpful for knowing that he/she is standing in a street. Imoto et al [13,14] explored the relation between ASC and SED, proposing DNNs to perform the two tasks simultaneously through a multi-task learning framework [15]. However, the integration of DNNs for related tasks remains in a preliminary stage because only pairs of two related tasks are investigated and motivations on the relationship of these tasks has not been explored.…”
Section: Introductionmentioning
confidence: 99%
“…Cross-task Transfer. Transferring the learned knowledge from one task to another related task has been approved as an effective way for better data modeling and messages correlating [6,2,14]. Aytar et al [2] proposed a teacher-student framework that transfers the discriminative knowledge of visual recognition to the representation learning task of sound modality via minimizing the differences in the distribution of categories.…”
Section: Related Workmentioning
confidence: 99%
“…Aytar et al [2] proposed a teacher-student framework that transfers the discriminative knowledge of visual recognition to the representation learning task of sound modality via minimizing the differences in the distribution of categories. Imoto et al [14] proposed a method for sound event detection by transferring the knowledge of scenes with soft labels. Gan et al [8] transferred the visual object location knowledge for auditory localization learning.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation