2021
DOI: 10.1016/j.apacoust.2020.107682
|View full text |Cite
|
Sign up to set email alerts
|

Bionic optimization of MFCC features based on speaker fast recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 14 publications
0
2
0
Order By: Relevance
“…As sound is a type of volatile signal that contains a lot of noise, direct sound recognition is inefficient. 44,45 Additionally, the 2D-CNN is best suited to extracting features from images; its ability to recognize sound pressure signals is weak. Therefore, an acoustic wave spectrograms classification network (AWCNet), which consists of MFCCs, STFTs, and a CNN, is proposed in this study, as shown in Figure 9.…”
Section: Data Preparationmentioning
confidence: 99%
“…As sound is a type of volatile signal that contains a lot of noise, direct sound recognition is inefficient. 44,45 Additionally, the 2D-CNN is best suited to extracting features from images; its ability to recognize sound pressure signals is weak. Therefore, an acoustic wave spectrograms classification network (AWCNet), which consists of MFCCs, STFTs, and a CNN, is proposed in this study, as shown in Figure 9.…”
Section: Data Preparationmentioning
confidence: 99%
“…In the current deep learning framework, audio environment hidden information feature extraction methods are mainly divided into two categories: traditional feature representation [6][7][8] and automatic learning audio feature representation based on deep network [9,10]. MFCC [11], spectrograms, acoustic event histograms [12], and gradient histograms based on time-frequency learning [13] are the most commonly used traditional methods for acoustic feature representation.…”
Section: Introductionmentioning
confidence: 99%