2020
DOI: 10.48550/arxiv.2002.05848
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…Cross-task Transfer. Transferring the learned knowledge from one task to another related task has been approved as an effective way for better data modeling and messages correlating [7,2,14]. Aytar et al [2] proposed a teacher-student framework that transfers the discriminative knowledge of visual recognition to the representation learning task of sound modality via minimizing the differences in the distribution of categories.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Cross-task Transfer. Transferring the learned knowledge from one task to another related task has been approved as an effective way for better data modeling and messages correlating [7,2,14]. Aytar et al [2] proposed a teacher-student framework that transfers the discriminative knowledge of visual recognition to the representation learning task of sound modality via minimizing the differences in the distribution of categories.…”
Section: Related Workmentioning
confidence: 99%
“…Chaplot et al [5] utilized a dual-attention unit to align textual and visual representations with the transferred knowledge of words and objects. Due to the partial correlation of scenes and sound events, Imoto et al [14] proposed a method for sound event detection by transferring the knowledge of scenes with soft labels. Salem et al [25] proposed to transfer the sound clustering knowledge to the image recognition task by predicting the distribution of sound clusters from an overhead image, similarly work can be found in [22].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently there have been few studies to jointly conduct the ASC and the audio tagging task [10][11][12]. Bear et.…”
Section: Introductionmentioning
confidence: 99%