2021
DOI: 10.1007/s00521-020-05672-2
|View full text |Cite
|
Sign up to set email alerts
|

Deep neural network architectures for dysarthric speech analysis and recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(11 citation statements)
references
References 35 publications
0
11
0
Order By: Relevance
“…The latest attempt at the time of this writing is [24], where a comparison between MFCCs, mel-frequency spectral coefficients, and perceptual linear prediction features extraction approaches was made to develop a dysarthric phoneme recognition system. Then, another comparison was made between CNN and Long-Short-Term Memory neural architectures and benchmarked with the conventional GMM-HMM-based approaches.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The latest attempt at the time of this writing is [24], where a comparison between MFCCs, mel-frequency spectral coefficients, and perceptual linear prediction features extraction approaches was made to develop a dysarthric phoneme recognition system. Then, another comparison was made between CNN and Long-Short-Term Memory neural architectures and benchmarked with the conventional GMM-HMM-based approaches.…”
Section: Related Workmentioning
confidence: 99%
“…On the other hand, our proposed solution is not affected by these limitations, as explained in the next section. It is pertinent to note that [24] was excluded from our comparative study since a compete ASR was not proposed, and the phoneme accuracy measured was not comparable to WER or WRAtwo criteria usually used to evaluate ASR efficacy.…”
Section: Related Workmentioning
confidence: 99%
“…If the objects are rotated by an angle of α, then its corresponding polar coordinate changes to (τr, θ + α). After log-polar transformation, the mapping is as Equations ( 8)- (10).…”
Section: Log-polar Transformationmentioning
confidence: 99%
“…In recent years, deep learning has played an important role in many areas of life, such as image processing [1][2][3], object detection [4][5][6], optic imaging [7][8][9], and speech recognition [10,11]. Especially in object detection and recognition, the accuracy of deep learning models becomes increasingly important [12].…”
Section: Introductionmentioning
confidence: 99%
“…In 2021, Brahim et al [17] found that the CNN-based system using perceptual linear prediction features achieved an impressive 82% recognition rate, which represents an improvement of 11% and 32% over the LSTM-and GMM-HMM-based systems, respectively, compared to the widely used MFCC.…”
mentioning
confidence: 99%