2021 5th International Conference on Computing Methodologies and Communication (ICCMC) 2021
DOI: 10.1109/iccmc51019.2021.9418035
|View full text |Cite
|
Sign up to set email alerts
|

Music Genre Classification using Transfer Learning on log-based MEL Spectrogram

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 12 publications
0
8
0
Order By: Relevance
“…In terms of classifying music genres, one study trained CNN models with the Mel-scaled spectrogram (MSS) as a dataset, which exhibited superior performance compared to other machine-learning techniques with different data formats in previous studies [18]. The MSS is a type of spectrogram with the Mel scale on the y-axis [17].…”
Section: Music Genre Classification With Mel-scaled Spectrogrammentioning
confidence: 99%
See 1 more Smart Citation
“…In terms of classifying music genres, one study trained CNN models with the Mel-scaled spectrogram (MSS) as a dataset, which exhibited superior performance compared to other machine-learning techniques with different data formats in previous studies [18]. The MSS is a type of spectrogram with the Mel scale on the y-axis [17].…”
Section: Music Genre Classification With Mel-scaled Spectrogrammentioning
confidence: 99%
“…In our CNN model, we strategically choose to employ the Mel-scaled spectrogram (MSS) [17] as a pivotal feature. This decision is bolstered by the MSS's proven superior performance in music genre classification tasks when used in conjunction with CNNs, highlighting its potential effectiveness for our purposes [18]. The Mel scale is specifically designed to mirror human auditory sensitivity, adeptly capturing variations in frequency and amplitude within audio signals [19].…”
Section: Introductionmentioning
confidence: 99%
“…One of the proven methods for joint time-frequency domain analysis of non-stationary sound signals is STFT [59]. The STFT spectrogram is a two-dimensional convolution of the signal and window function [72]: the X-axis represents time, the Y-axis represents frequency, and the amplitude of a particular frequency at a particular time is represented by its color in the image [73].…”
Section: Short-time Fourier Transformation (Stft)-spectrogrammentioning
confidence: 99%
“…Zeng and Tan (2021) [31] developed a large-scale pretrained model MusicBERT for four music understanding tasks, including melody completion, accompaniment suggestion, genre classification, and style classification. Mehta and Gandhi (2021) [32] compared four transfer learning architectures, Resnet34, Resnet50, VGG16, and AlexNet, for music genre classification.…”
Section: Literature Reviewmentioning
confidence: 99%