2021
DOI: 10.1109/tnsre.2021.3076778
|View full text |Cite
|
Sign up to set email alerts
|

Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System

Abstract: Dysarthria is a disorder that affects an individual's speech intelligibility due to the paralysis of muscles and organs involved in the articulation process. As the condition is often associated with physically debilitating disabilities, not only do such individuals face communication problems, but also interactions with digital devices can become a burden. For these individuals, automatic speech recognition (ASR) technologies can make a significant difference in their lives as computing and portable digital d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
37
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 78 publications
(37 citation statements)
references
References 27 publications
0
37
0
Order By: Relevance
“…We considered speech features presented as both MFCCs and spectrograms for the optimal setup identification since they both delivered significant results in the previous studies. While MFCCs have been widely studied in the literature, spectrograms do not appear to be thoroughly investigated in the context of intelligibility assessment, although they have outperformed other feature extraction approaches in ASR tasks [18]. We experimented with different configurations by selecting different MFCC parameters and spectrogram setups explained below.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We considered speech features presented as both MFCCs and spectrograms for the optimal setup identification since they both delivered significant results in the previous studies. While MFCCs have been widely studied in the literature, spectrograms do not appear to be thoroughly investigated in the context of intelligibility assessment, although they have outperformed other feature extraction approaches in ASR tasks [18]. We experimented with different configurations by selecting different MFCC parameters and spectrogram setups explained below.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…Similarly, CNNs have been successfully used in speech modeling tasks where acoustic features were presented visually. In this context, CNNs have been applied for dysarthria intelligibility classification where dysarthric speech features were presented as spectrograms, for example, in [18]. A more comprehensive example is [5], in which a cross-modal framework including both video and acoustic data was proposed.…”
Section: Reference-free Approachesmentioning
confidence: 99%
“…Artificial intelligence (AI) has become very popular in the past few years because it adds human capabilities, e.g., learning, reasoning, and perception, to the software accurately and efficiently, and as a result, computers gain the ability to perform tasks that are usually carried out by humans. The recent advances in computing resources and availability of large datasets, as well as the development of new AI algorithms, have opened the path to the use of AI in many different areas, including but not limited to image synthesis [ 121 ], speech recognition [ 122 , 123 ] and engineering [ 124 , 125 , 126 ]. AI has been also employed in healthcare industries for applications such as protein engineering [ 127 , 128 , 129 , 130 ], cancer detection [ 131 ], and drug discovery [ 132 , 133 ].…”
Section: Artificial Intelligence In Medical Image Analysismentioning
confidence: 99%
“…learning, reasoning, and perception, to the software accurately and efficiently and as the result, computers gain the ability to do tasks that are usually done by humans. The recent advances in computing resources and availability of large datasets, as well as the development of the new AI algorithms, have opened the path to the use of AI in many different areas, including but not limited to Image Synthesis [117], Speech Recognition [118] [119] and Engineering [120]- [122]. AI has been also employed in healthcare industries for applications such as protein engineering [123]- [126], cancer detection [127], and drug discovery [128], [129].…”
Section: Artificial Intelligence In Medical Image Analysismentioning
confidence: 99%