2021
DOI: 10.3390/app11031040
|View full text |Cite
|
Sign up to set email alerts
|

Data-Dependent Feature Extraction Method Based on Non-Negative Matrix Factorization for Weakly Supervised Domestic Sound Event Detection

Abstract: In this paper, feature extraction methods are developed based on the non-negative matrix factorization (NMF) algorithm to be applied in weakly supervised sound event detection. Recently, the development of various features and systems have been attempted to tackle the problems of acoustic scene classification and sound event detection. However, most of these systems use data-independent spectral features, e.g., Mel-spectrogram, log-Mel-spectrum, and gammatone filterbank. Some data-dependent feature extraction … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 29 publications
(46 reference statements)
0
4
0
Order By: Relevance
“…As found in many audio enhancement applications the vanilla architecture usually does not generate an impressive audio quality (although comparable or slightly better than the machine learning algorithms e.g. [66], [67], [68], [69], [70], and [71]) in the output. Therefore either additional layers are added, skip connections are modified, hyper parameters are adjusted or completely new architectures are proposed in many recent AE models to improve the performance.…”
Section: Modified Architecturesmentioning
confidence: 91%
See 2 more Smart Citations
“…As found in many audio enhancement applications the vanilla architecture usually does not generate an impressive audio quality (although comparable or slightly better than the machine learning algorithms e.g. [66], [67], [68], [69], [70], and [71]) in the output. Therefore either additional layers are added, skip connections are modified, hyper parameters are adjusted or completely new architectures are proposed in many recent AE models to improve the performance.…”
Section: Modified Architecturesmentioning
confidence: 91%
“…However, the event detection tasks do not require the frequency axis restoration as the required information lies on the time axis. So in LUU-Net only time scale is restored by limited upscaling of the frequency axis by using asymmetric stride for decoder convolutional layers [70]. This results in an immense reduction in learnable parameters.…”
Section: Limited Upscale U-net (Luu-net)mentioning
confidence: 99%
See 1 more Smart Citation
“…Although NMF is quite an old algorithm, it is considered an effective and widespread methodology as it is still employed in multiple areas of interest, such as sound event detection [29], speech recognition [30], text mining [31][32][33], image analysis [34,35], security and privacy [36] and community detection [37]. In many of these applications, NMF is used as a tool for classification, filtering, dimensionality reduction as well as data clustering, which is the core concept of this study.…”
Section: Methodsmentioning
confidence: 99%
“…This paper illustrates the ability of these decomposition models to impute missing features, denoising and to artificially generate additional data samples (data augmentation) with examples to the brain-computer interface (BCI) and epileptic EEG analysis, among others. In [9], Lee et al (South Korea) developed feature extraction methods based on the non-negative matrix factorization (NMF) algorithm and it is applied in weakly supervised sound event detection.…”
Section: Methodological Articlesmentioning
confidence: 99%