2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2019
DOI: 10.1109/waspaa.2019.8937196
|View full text |Cite
|
Sign up to set email alerts
|

Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning

Abstract: Acoustic event detection and scene classification are major research tasks in environmental sound analysis, and many methods based on neural networks have been proposed. Conventional methods have addressed these tasks separately; however, acoustic events and scenes are closely related to each other. For example, in the acoustic scene "office", the acoustic events "mouse clicking" and "keyboard typing" are likely to occur. In this paper, we propose multitask learning for joint analysis of acoustic events and sc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
25
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 22 publications
(25 citation statements)
references
References 26 publications
0
25
0
Order By: Relevance
“…Scarce research on the combination of related sound classification tasks has been conducted [10,11,13,14]. Imoto et al [13] assumed that ASC and SED are related and performed them simultaneously using a multitask learning framework.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Scarce research on the combination of related sound classification tasks has been conducted [10,11,13,14]. Imoto et al [13] assumed that ASC and SED are related and performed them simultaneously using a multitask learning framework.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, perceiving car horns and traffic sounds can be helpful for knowing that he/she is standing in a street. Imoto et al [13,14] explored the relation between ASC and SED, proposing DNNs to perform the two tasks simultaneously through a multi-task learning framework [15]. However, the integration of DNNs for related tasks remains in a preliminary stage because only pairs of two related tasks are investigated and motivations on the relationship of these tasks has not been explored.…”
Section: Introductionmentioning
confidence: 99%
“…However, some sound events and scenes are closely related, and the knowledge of sound events and scenes can help in estimating them mutually. Considering this idea, joint analysis of sound events and acoustic scenes based on MTL of SED and ASC has been proposed [18,19]. As shown in Fig.…”
Section: Joint Analysis Of Sound Events and Scenes Based On Multitaskmentioning
confidence: 99%
“…where α indicates the weight of the scene loss. In particular, the previous work [19] showed that an MTL-based method achieves a better performance in detecting sound events than CRNN-based SED [12].…”
Section: Joint Analysis Of Sound Events and Scenes Based On Multitaskmentioning
confidence: 99%
“…As an example of the relationship, "mouse clicking" occurs indoors such as "office," whereas, "car" tends to occurs outdoor such as "city center." On the basis of this idea, SED using the information on the acoustic scene [9][10][11] and the model combining SED and acoustic scene classification (ASC) [12][13][14][15][16] have been proposed. Heittola et al [10] have proposed the SED model using the results of the ASC, where the ASC model is trained in the first stage and then the SED model is trained in the second stage with the ASC results.…”
Section: Introductionmentioning
confidence: 99%