Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-2155
|View full text |Cite
|
Sign up to set email alerts
|

A Lightly Supervised Approach to Detect Stuttering in Children's Speech

Abstract: In speech pathology, new assistive technologies using ASR and machine learning approaches are being developed for detecting speech disorder events. Classically-trained ASR model tends to remove disfluencies from spoken utterances, due to its focus on producing clean and readable text output. However, diagnostic systems need to be able to track speech disfluencies, such as stuttering events, in order to determine the severity level of stuttering. To achieve this, ASR systems must be adapted to recognise full ve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
33
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 34 publications
(39 citation statements)
references
References 25 publications
1
33
0
Order By: Relevance
“…As a result, while these methods struggle with sub-word stutters such as sound repetition or revision, they perform well for word repetition or prolongation. This can be observed in Table 3 as [11] performs better than our method by a small margin (3.2%) for word repetition. Additionally, [11] performs with a lower miss rate than ours for detection of prolongation (5.92%).…”
Section: Performance and Comparisonmentioning
confidence: 62%
See 3 more Smart Citations
“…As a result, while these methods struggle with sub-word stutters such as sound repetition or revision, they perform well for word repetition or prolongation. This can be observed in Table 3 as [11] performs better than our method by a small margin (3.2%) for word repetition. Additionally, [11] performs with a lower miss rate than ours for detection of prolongation (5.92%).…”
Section: Performance and Comparisonmentioning
confidence: 62%
“…Another model using Bi-LSTMs with condition random fields (CRFs) to get an average F-score of 85.9% across all stutter types [15]. The current state-of-the-art stutter classification method uses task-oriented finite state transducer (FST) lattices to detect repetition stutters with an average 37% miss rate across 4 different types of [11].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The focus of this paper is on detection of five stuttering event types: Blocks, Prolongations, Sound Repetitions, Word/Phrase Repetitions, and Interjections. Existing work has explored this problem using traditional signal processing techniques [15,16,17], language modeling (LM) [12,18,19,20,21], and acoustic modeling (AM) [21,10]. Each approach has be shown to be effective 1.…”
Section: Introductionmentioning
confidence: 99%