Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-2903
|View full text |Cite
|
Sign up to set email alerts
|

Diagnosing Dysarthria with Long Short-Term Memory Networks

Abstract: This paper proposes the use of Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units for determining whether Mandarin-speaking individuals are afflicted with a form of Dysarthria based on samples of syllable pronunciations. Several LSTM network architectures are evaluated on this binary classification task, using accuracy and Receiver Operating Characteristic (ROC) curves as metrics. The LSTM models are shown to significantly improve upon a baseline fully connected network, reaching over 90… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
10
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 18 publications
(14 reference statements)
0
10
0
Order By: Relevance
“…For the classifier, most of the previous investigations have used support vector machines (SVMs) [5], [13], [16], [22]. In addition to SVMs, other algorithms such as artificial neural networks, decision trees, and variants of recurrent neural network (RNN) have also been used as classifiers in the study area [13], [28], [32]- [34]. A review of various techniques considered for both parts is given in [5].…”
Section: Introductionmentioning
confidence: 99%
“…For the classifier, most of the previous investigations have used support vector machines (SVMs) [5], [13], [16], [22]. In addition to SVMs, other algorithms such as artificial neural networks, decision trees, and variants of recurrent neural network (RNN) have also been used as classifiers in the study area [13], [28], [32]- [34]. A review of various techniques considered for both parts is given in [5].…”
Section: Introductionmentioning
confidence: 99%
“…For the classifier, most of the previous investigations have used support vector machines (SVMs) [6], [16], [22], [24]- [26]. In addition to SVMs, other algorithms such as artificial neural networks [22], [27], decision trees [28], linear discriminant analysis (LDA) [23], [29], and variants of recurrent neural network (RNN) [30] have also been used as classifiers in the study area. Even though existing detection studies have trained data-driven models with many different types of features, there still exists a need for novel features which are effective and robust when used with different pathological voice databases.…”
Section: Introductionmentioning
confidence: 99%
“…Detecting dysarthria involves extracting hand-crafted acoustic features and using those features as inputs to a machine learning-based classifier [18][19][20]. Deep learning approaches are also possible where the raw speech signal or a set of elementary features are fed into complex neural network architectures that automatically determine the important acoustic information and distinguish between healthy and dysarthric speech [21,22]. Deep learning approaches require less data preparation and feature engineering but may suffer from a lack of interpretability as further post-processing is often required to interpret how the speaker's speech is impaired.…”
Section: Introductionmentioning
confidence: 99%
“…Various types of acoustic features have been proposed for detecting dysarthric speech. Spectral features such as Mel Frequency Cepstral Coefficients (MFCCs) are used in References [22,23], and filter banks are utilized in long short-term memory classifiers [21] and convolutional neural networks [24]. Spectral measures of fricatives are shown to significantly differ between healthy and dysarthric speakers in Reference [25] and are used as input to a machine learning classifier in Reference [26].…”
Section: Introductionmentioning
confidence: 99%