2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) 2016
DOI: 10.1109/iscslp.2016.7918369
|View full text |Cite
|
Sign up to set email alerts
|

Long short-term memory recurrent neural network based segment features for music genre classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
17
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(18 citation statements)
references
References 14 publications
0
17
0
1
Order By: Relevance
“…The CNN-based approaches obtain notable results in MGC tasks; however, they neglect spectrogram temporal information, which may be useful. Based on this reasoning, the long short-term memory recurrent neural network (RNN) has been used [9] to extract features from scatter spectrograms [1] of audio segments and fuse them with those obtained using CNNs. In addition, to take advantage of both CNNs and RNNs, a convolutional RNN has been designed for music tagging [7] .…”
Section: Mgc Methodsmentioning
confidence: 99%
“…The CNN-based approaches obtain notable results in MGC tasks; however, they neglect spectrogram temporal information, which may be useful. Based on this reasoning, the long short-term memory recurrent neural network (RNN) has been used [9] to extract features from scatter spectrograms [1] of audio segments and fuse them with those obtained using CNNs. In addition, to take advantage of both CNNs and RNNs, a convolutional RNN has been designed for music tagging [7] .…”
Section: Mgc Methodsmentioning
confidence: 99%
“…Author has a tendency to conjointly demonstrate the capability of the options to capture relevant data from audio information by applying them to genre classification on the ISMIR 2004 dataset. [4] 5. Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification In the typical frame feature primarily based expressive style classification strategies, the audio information is depicted by freelance frames and therefore the serial nature of audio is completely unheeded.…”
Section: A Deep Learning Approach For Mapping Music Genresmentioning
confidence: 99%
“…LSTM) can grasp the prominent long-term dependency based properties, such as recurrent harmonics and music structure contained in the music. These are the possible reasons why deep learning architecture based schemes have achieved tremendous success in various MIR tasks, such as onset detection [6], emotion recognition [7], chord estimation [8], rhythm stimuli recognition [9], source separation [10], music recommendation [11] and auto-tagging [4], [12], [14], [15]. For music classification tasks, CNN and RNN are the two most adopted deep learning architectures.…”
Section: Introductionmentioning
confidence: 99%
“…To take full advantage of the complementarity between CNN and RNN in representing different aspects of music sound, some researchers proposed to construct hybrid architectures of CNN and RNN for music classification [2], [4], [12], [13]. In [13], a hybrid architecture consisting of the paralleling CNN and Bi-RNN blocks was proposed.…”
Section: Introductionmentioning
confidence: 99%