Music Genre Classification using Spectral Analysis Techniques With Hybrid Convolution-Recurrent Neural Network

Ahmad*, Faiyaz; Sahil,

doi:10.35940/ijitee.a3956.119119

Cited by 5 publications

(4 citation statements)

References 10 publications

(16 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The time duration of the data affects the accuracy of results obtained, as seen in the comparison of several studies in the duration data comparison (Table 6). Research Time duration Acc % Ahmad [7] 3 seconds 95 Lau [19] 3 seconds 81.73 Zhang [10] 3 seconds 87,4 Vita [9] 30 seconds 58 Purnama [13] 30 seconds 60 Ndou [3] 30…”

Section: Resultsmentioning

confidence: 99%

“…These features can be used to classify the type of music. The research conducted by Shah et al [6] [7]. In another study, classification using CNN with a three-second music duration feature gave 72.4% better accuracy than a thirty-second music duration feature which was only 53.50%; the spectrogram feature showed increased accuracy but with an even greater number of epochs [3].…”

Section: Introductionmentioning

confidence: 98%

See 1 more Smart Citation

Hyperparameter Optimization of CNN Classifier for Music Genre Classification

Soekarta,

Aras,

Ahmad Nur Aswad

2023

J. RESTI (Rekayasa Sist. Teknol. Inf.)

View full text Add to dashboard Cite

Playing music through a digital platform that has a large database of songs requires automated classification of music genres, highlighting the need to develop a model for music genre classification that is more efficient and accurate. This study evaluated the hyperparameters in the music genre classification process using the CNN on the GTZAN dataset with 30-second duration data optimized using the MFCC feature extraction. The model that is formed with a time of 3 (three) seconds classifies music genres in the first 3 seconds of music. This model has a high potential for error because the first 3 seconds of initial music is varied and cannot be used as a benchmark in determining music genres. This study performed hyperparameters on batch size, epoch, and split dataset variables with various scenarios. The highest accuracy result was obtained at 72% with a data split of 85%:15%, 32 batch size,s and 500 epochs

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 98%

Hyperparameter Optimization of CNN Classifier for Music Genre Classification

Soekarta,

Aras,

Ahmad Nur Aswad

2023

J. RESTI (Rekayasa Sist. Teknol. Inf.)

View full text Add to dashboard Cite

show abstract

“…[3] At present, there are few popular techniques to sort plastic and non-plastic materials. One of them is infrared hyper spectral imaging which is used for real-time instantaneous identification of plastics, where intelligent algorithms for image recognition and classification are used [6][7][8][9][10][11]. Near Infrared Spectroscopy is another technique used for the identification and selection of materials and is used for plastic segregation as proposed in [14].…”

Section: Introductionmentioning

confidence: 99%

“…Audio signals from different materials are transformed to data sets that represents the material explicitly using feature engineering. [4][5][6][7] There are many feature extraction techniques for audio signal [19] and these feature extraction methods use both spectral, and joint time-frequency signal representation. Artificial Neural Network is the classifier used to classify the feature database.…”

Section: Introductionmentioning

confidence: 99%

A Low Cost Solution for Automatic Plastic Segregation

Dharmana,

2020

IJEAT

View full text Add to dashboard Cite

Abstract: Solid waste management is a universal issue that matters to every single person in the world. The solid waste management system is fundamentally labor intensive with very little collection efficiency. The available automatic plastic segregation techniques are based on thermal imaging and electrostatic properties of materials-these methods are expensive for governments to invest upon, and also to maintain in landfills. In this paper, artificial intelligence techniques are exploited to recognize the sounds of plastics from that of other materials by designing suitable mechanism to produce sound from debris during segregation, the segregation process can be automated with relatively low-cost electronics like System on Chips and audio sensors. With 30,000 recorded samples of noisy plastic and non-plastic material sounds, ANN is trained and was able to successfully detect plastics with 93.5% accuracy in real time. Algorithms were developed in python and real time testing was done on SoC with a mic, which affirms that the proposed method is cost effective when compared to techniques involving image processing, thermal imaging and near infrared spectroscopy.

show abstract