2018
DOI: 10.48550/arxiv.1812.04748
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

An efficient supervised dictionary learning method for audio signal recognition

Imad Rida,
Romain Hérault,
Gilles Gasso

Abstract: Machine hearing or listening represents an emerging area. Conventional approaches rely on the design of handcrafted features specialized to a specific audio task and that can hardly generalized to other audio fields. For example, Mel-Frequency Cepstral Coefficients (MFCCs) and its variants were successfully applied to computational auditory scene recognition while Chroma vectors are good at music chord recognition. Unfortunately, these predefined features may be of variable discrimination power while extended … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 54 publications
0
0
0
Order By: Relevance
“…Its fundamental principle involves the incorporation of update and reset gates to efficiently regulate the information flow. The main purpose of the GRU is to model sequence data, and it is especially good at capturing temporal dependencies in long sequences (Rida et al, 2018). In the field of emergency classification, the GRU performs well in processing this type of data.…”
Section: The Gru Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…Its fundamental principle involves the incorporation of update and reset gates to efficiently regulate the information flow. The main purpose of the GRU is to model sequence data, and it is especially good at capturing temporal dependencies in long sequences (Rida et al, 2018). In the field of emergency classification, the GRU performs well in processing this type of data.…”
Section: The Gru Modelmentioning
confidence: 99%
“…The basic principle is to extract the spatial level features of the input data, layer-by-layer, through convolution operations and pooling operations, thereby achieving effective processing and classification of two-dimensional data such as images. In the field of emergency classification, CNNs have been widely used to process image data and text data (Rida, 2018). Through convolution operations, CNNs are able to capture local features in event data, thereby achieving effective classification of event text and images .…”
mentioning
confidence: 99%