Music genre recognition using convolutional recurrent neural network architecture

Bisharad, Dipjyoti; Laskar, Rabul Hussain

doi:10.1111/exsy.12429

Cited by 29 publications

(18 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They suggested convolutional recurrent neural network (CRNN) models for the music auto-tagging and genre classification. The hybrid nature of neural networks has also been used in [3][5] for music recognition tasks. They used batch normalization on the input feature map, but fail to perform well in the situation where statistics change for various time steps, as mentioned in [12].…”

Section: Results and Analysismentioning

confidence: 99%

“…Each of the datasets has some common and distinct features with associated labels. The first dataset is GTZAN which consists of 1000 samples with 10 different genres, each having 100 songs as "blues, classical, country, disco, hip-hop, jazz, metal, pop, reggae, and rock" with a duration of 30 sec each as in [3] [5]. All files are mp3 and encoded with the sample rate of 22050 Hz with a size of 16 bit with the mono channel.…”

Section: A Dataset Descriptionmentioning

confidence: 99%

“…These systems are capable enough for automatic feature extraction, free from feature biases, and beneficial in comparison with the traditional methods. Neural Networks such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs) are among the influential methods for music classification [3][4] [5]. CNNs are favorable to record the spatial dependencies concerning feature domain [6] [7], whereas RNNs reasonably handle temporal dependencies in sequential data [8].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Globally Regularized Joint Neural Architecture for Music Classification

et al. 2020

View full text Add to dashboard Cite

Section: Results and Analysismentioning

confidence: 99%

Section: A Dataset Descriptionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Globally Regularized Joint Neural Architecture for Music Classification

et al. 2020

View full text Add to dashboard Cite

“…For methods based on representation learning, a bidirectional recurrent neural network with serial attention and parallelized attention is proposed to focus on details of the target area [15]. A CNN-RNN-cascaded deep learning model that uses almost no handcrafted features is proposed in [14]. In [12], a CNN-based architecture with multi-level and multiscale features [11] is extended by transfer learning.…”

Section: ) Gtzan With Artist Filtermentioning

confidence: 99%

“…The scatter transform and the transfer learning have been widely used for image and audio classification [3]- [6] but rarely used for MGR in combination. Various methods for building music genre classifiers have been studied, including support vector machines (SVM) [7]- [9], Gaussian processes [10], convolutional neural network (CNN) [11] [12] [13], recurrent neural network (RNN) [14], and long short-term memory (LSTM) [15]. It is proven that CNNs learning representation features yield better performance in comparison to LSTMs and traditional methods which extract handcrafted features [16].…”

Section: Introductionmentioning

confidence: 99%

Multi-Level Local Feature Coding Fusion for Music Genre Recognition

Zeng

Wang

2020

IEEE Access

View full text Add to dashboard Cite

Music genre recognition (MGR) plays a fundamental role in the context of music indexing and retrieval. Unlike images, music genres consist of immediate characteristics that are highly diversified with abstractions in different levels. However, most representation learning methods for MGR focus on global features and make decisions from features in the same level. To remedy such defects, we intergrate a convolutional neural network (CNN) with NetVLAD and self-attention to capture the local information across levels and learn their long-term dependencies. A meta classifier is used to make the final MGR classification by learning from aggregated high-level features from different local feature coding networks. Experimental results show that the proposed approach yields higher accuracies than other state-of-the-art models on GTZAN, ISMIR2004, and Extended Ballroom dataset.

show abstract

List of Deep Learning Models

Mosavi

Ardabili²,

Várkonyi-Kóczy

2020

Lecture Notes in Networks and Systems

View full text Add to dashboard Cite

Deep learning (DL) algorithms have recently emerged from machine learning and soft computing techniques. Since then, several deep learning (DL) algorithms have been recently introduced to scientific communities and are applied in various application domains. Today the usage of DL has become essential due to their intelligence, efficient learning, accuracy and robustness in model building. However, in the scientific literature, a comprehensive list of DL algorithms has not been introduced yet. This paper provides a list of the most popular DL algorithms, along with their applications domains.

show abstract

Music genre recognition using convolutional recurrent neural network architecture

Cited by 29 publications

References 14 publications

A Globally Regularized Joint Neural Architecture for Music Classification

A Globally Regularized Joint Neural Architecture for Music Classification

Multi-Level Local Feature Coding Fusion for Music Genre Recognition

List of Deep Learning Models

Contact Info

Product

Resources

About