Musical Instrument Identification Using Deep Learning Approach

Blaszke, Maciej; Kostek, Bożena

doi:10.3390/s22083033

Cited by 22 publications

(10 citation statements)

References 35 publications

(71 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Abd-AlGalil et al [10] extract MFCC coefficient (mel frequency cepstral coefficient, MFCC) from signal using auditory characteristics of the human ear and then detect the starting point based on cepstral distance with an accuracy of 96%. Schönberger [11] records a moment of note onset based on the result of phase difference, while Blaszke and Kostek [12] first preprocess music signal in full phase and then detect note onset using a feature of phase difference mutation, and experimental results show that this type of method is more suitable for note detection of a slow rhythm music. Alqahtani et al [13] combined wavelet domain and time domain features for note slicing and achieved 96% accuracy for detecting the starting point of piano music, but the number of missed notes was high, resulting in a low recall rate [14].…”

Section: Related Workmentioning

confidence: 99%

Automatic Piano Harmony Arrangement System Based on Deep Learning

2022

Journal of Sensors

View full text Add to dashboard Cite

Harmony, which plays an important role in enriching melody expression, is a combination of multiple notes. Melody coordination involves adding harmony effect to a single note of melody, which involves professional knowledge of basic music theory and harmony rules, and requires a high technical threshold. Under the macro background of deep learning and neural network technology, artificial intelligence is widely used in music retrieval, music creation, and music teaching. In this article, we provide a powerful tool for piano music creation by manually arranging melody and harmony instead of using deep learning. In this paper, harmonic elimination is divided into three subtasks: note detection, measurement, and multifundamental frequency estimation and model training. The music signal is divided into several segments by note detection, and the main notes and harmonic components of each segment are extracted by multifundamental frequency estimation, which are used as the features and labels of the neural network, so as to give a model with the ability of arrangement and harmony.

show abstract

Section: Related Workmentioning

confidence: 99%

Automatic Piano Harmony Arrangement System Based on Deep Learning

2022

Journal of Sensors

View full text Add to dashboard Cite

show abstract

“…DNN have been used successfully in widespread applications from speech-based emotion recognition [ 45 – 47 ] to music recognition [ 48 ] to detecting emotion in music [ 49 ]. Recent work with DNNs have shown immense potential for musical instrument classification as reviewed by Blaszke and Kostek [ 50 ], particularly for predominant instrument recognition in polyphonic audio. To our knowledge, no previous work with DNNs have explored their application to the percussion instruments (maracas, tambourines, castanets) or sound environment (e.g.…”

Section: Discussionmentioning

confidence: 99%

“…While the classification accuracy obtained by the methods described herein appeared to provide children with a good user experience, DNNs might be a promising direction of exploration particularly if more instrument families are added to the system. As well reviewed by Blaszke and Kostek [ 50 ], the current state of the art for multiple instrument recognition yields F1 scores around 0.64 while their DNN approach provided substantial increases to 0.93 [ 50 ]. DNN approaches may also offer greater flexibility allowing for more complex models for instruments that are difficult to classify and simpler, more computationally efficient models for instruments that are easily identified [ 50 ].…”

Section: Discussionmentioning

confidence: 99%

Musical instrument classifier for early childhood percussion instruments

Rufino,

Khan,

Dutta

et al. 2024

PLoS ONE

View full text Add to dashboard Cite

While the musical instrument classification task is well-studied, there remains a gap in identifying non-pitched percussion instruments which have greater overlaps in frequency bands and variation in sound quality and play style than pitched instruments. In this paper, we present a musical instrument classifier for detecting tambourines, maracas and castanets, instruments that are often used in early childhood music education. We generated a dataset with diverse instruments (e.g., brand, materials, construction) played in different locations with varying background noise and play styles. We conducted sensitivity analyses to optimize feature selection, windowing time, and model selection. We deployed and evaluated our best model in a mixed reality music application with 12 families in a home setting. Our dataset was comprised of over 369,000 samples recorded in-lab and 35,361 samples recorded with families in a home setting. We observed the Light Gradient Boosting Machine (LGBM) model to perform best using an approximate 93 ms window with only 12 mel-frequency cepstral coefficients (MFCCs) and signal entropy. Our best LGBM model was observed to perform with over 84% accuracy across all three instrument families in-lab and over 73% accuracy when deployed to the home. To our knowledge, the dataset compiled of 369,000 samples of non-pitched instruments is first of its kind. This work also suggests that a low feature space is sufficient for the recognition of non-pitched instruments. Lastly, real-world deployment and testing of the algorithms created with participants of diverse physical and cognitive abilities was also an important contribution towards more inclusive design practices. This paper lays the technological groundwork for a mixed reality music application that can detect children’s use of non-pitched, percussion instruments to support early childhood music education and play.

show abstract

“…Judging from the consistent positive outcomes, it only makes sense to assume that in the future, AM-enhanced NNs will be extensively used for MIR. In [97], identification is performed for four instruments: bass, drums, piano, and guitar. The model architecture consists of four identical, independent sub-models, each catering to one instrument.…”

Section: Convolutional Neural Network (Cnn)mentioning

confidence: 99%

Music Deep Learning: Deep Learning Methods for Music Signal Processing—A Review of the State-of-the-Art

et al. 2023

View full text Add to dashboard Cite

The discipline of Deep Learning has been recognized for its strong computational tools, which have been extensively used in data and signal processing, with innumerable promising results. Among the many commercial applications of Deep Learning, Music Signal Processing has received an increasing amount of attention over the last decade. This work reviews the most recent developments in Deep Learning in Music signal processing. Two main applications that are discussed are Music Information Retrieval, which spans a plethora of applications, and Music Generation, which can fit a range of musical styles. After a review of both topics, several emerging directions are identified for future research.

show abstract

Musical Instrument Identification Using Deep Learning Approach

Cited by 22 publications

References 35 publications

Automatic Piano Harmony Arrangement System Based on Deep Learning

Automatic Piano Harmony Arrangement System Based on Deep Learning

Musical instrument classifier for early childhood percussion instruments

Music Deep Learning: Deep Learning Methods for Music Signal Processing—A Review of the State-of-the-Art

Contact Info

Product

Resources

About