2021
DOI: 10.13053/cys-25-2-3946
|View full text |Cite
|
Sign up to set email alerts
|

Deep Neural Network for Musical Instrument Recognition Using MFCCs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…These nonlinear RP embedding features are compared with two of the most popular linear spectral features, the spectrogram and Mel Frequency Cepstral Coefficients (MFCC) 68,69 . While spectrograms of size 432 × 288 are computed from the CMT dataset with 3 s siding windows, 39 dimensional (13 static, 13 delta and 13 double delta) MFCC are computed using a sliding window of 25 ms with an overlap of 10 ms.…”
Section: Choice Of Embedding Dimension and Delay Parameter For Rpmentioning
confidence: 99%
See 1 more Smart Citation
“…These nonlinear RP embedding features are compared with two of the most popular linear spectral features, the spectrogram and Mel Frequency Cepstral Coefficients (MFCC) 68,69 . While spectrograms of size 432 × 288 are computed from the CMT dataset with 3 s siding windows, 39 dimensional (13 static, 13 delta and 13 double delta) MFCC are computed using a sliding window of 25 ms with an overlap of 10 ms.…”
Section: Choice Of Embedding Dimension and Delay Parameter For Rpmentioning
confidence: 99%
“…These speaker identification experiments use a Deep Neural Network (DNN) on the 39-dim MFCC features derived from the CMT dataset for each of the three modes of speech. The architecture of the DNN used here with ReLU activation function (R) and dropout layers is 512R-1024R-512R-dropout(0.3)-128R-64R-dropout(0.2)-20S, where S is the final softmax layer 69 . The results are given in Table 4.…”
Section: Unimodal Systems With Mfccmentioning
confidence: 99%
“…MFCC is a feature obtained by simulating the auditory characteristics of the human ear [12], which has a good performance in speech recognition [13]. It is extracted in the following way.…”
Section: Instrument Feature Extractionmentioning
confidence: 99%
“…Although this view is not entirely correct, it points out that the sound quality of an instrument is the importance of harmonic amplitudes to the sound quality of musical instruments. In terms of using computer to synthesize piano sound and improving the sound quality of piano sound through computer processing simulation, literature in [10] pointed out that harmonics are an important factor of sound quality, but this paper discusses how to improve the sound quality from the perspective of harmonic amplitude and phase changing with time, piano sound. Study in [11] believes that the harmonic amplitude is an important factor that constitutes the sound quality of the piano, and then uses the method of simulating multiple strings to reasonably adjust the frequency spectrum of the piano to study how to improve the sound of the piano.…”
Section: Related Workmentioning
confidence: 99%