ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054708
|View full text |Cite
|
Sign up to set email alerts
|

Few-Shot Sound Event Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
32
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 53 publications
(32 citation statements)
references
References 15 publications
0
32
0
Order By: Relevance
“…However, in a few approaches, acoustic data have been utilized to generate few-shot learning models. In the work of Wang [40], a metric-based few-shot learning method has been proposed for AER due to high cost of listening to a mixed sound to label each location of an event. Another few-shot learning approach based on the acoustic data has used an Attentional Graph Neural Network [41].…”
Section: Related Workmentioning
confidence: 99%
“…However, in a few approaches, acoustic data have been utilized to generate few-shot learning models. In the work of Wang [40], a metric-based few-shot learning method has been proposed for AER due to high cost of listening to a mixed sound to label each location of an event. Another few-shot learning approach based on the acoustic data has used an Attentional Graph Neural Network [41].…”
Section: Related Workmentioning
confidence: 99%
“…As an alternative, few-shot learning [9][10][11][12][13][14] has been applied to audio classification [15][16][17] and sound event detection [18,19], where a classifier must learn to recognize a novel class from very few examples. Among different few-shot learning methods, metricbased prototypical networks [12] have been shown to yield excellent performance for audio [15,18,19]. However, few-shot methods do not maintain the training data class vocabulary, requiring manual labeling of all novel classes for deployment, which can be overwhelming for large vocabulary problems.…”
Section: Introductionmentioning
confidence: 99%
“…There are some conventional methods of SED in the case of imbalanced data [16,17,18]. For example, Chen and Jin have proposed a method of detecting rare sound events using data augmentation [16].…”
Section: Introductionmentioning
confidence: 99%
“…For example, Chen and Jin have proposed a method of detecting rare sound events using data augmentation [16]. Wang et al have proposed a method of few-shot SED based on metric learning [17]. Dinkel and Yu have proposed a method of SED using a temporal subsampling method within a CRNN [18].…”
Section: Introductionmentioning
confidence: 99%