2021
DOI: 10.48550/arxiv.2106.11759
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…Work conducted after the closing date of this systematic review suggests that deep learning techniques may further improve automatic classification of stuttering [23] [64]. Further work is needed to confirm whether modern deep methods can allow advances in performance as seen in other fields such as computer vision [65].…”
Section: B Recommendationsmentioning
confidence: 95%
“…Work conducted after the closing date of this systematic review suggests that deep learning techniques may further improve automatic classification of stuttering [23] [64]. Further work is needed to confirm whether modern deep methods can allow advances in performance as seen in other fields such as computer vision [65].…”
Section: B Recommendationsmentioning
confidence: 95%
“…2.1.3 Dysfluent Speech Recognition. Technical work on improving speech assistants for PWS has focused on ASR models [8,23,31,35,50,51,61], stuttering detection [43], dysfluency detection or classification [22,40,42,48,56], clinical assessment [11], and dataset development [12,37,42,55]. Shonibare et al [61] and Mendelev et al [50] investigate training end-to-end RNN-T ASR models on speech from PWS.…”
Section: Overview Of Speech Recognition Systemsmentioning
confidence: 99%
“…Alharbi et al [2,3] focused on stuttered speech from kids that incorporates the structure of repetitions and other dyfluencies into an augmented language model that is better at including dysfluencies in a transcription. In our work, we focus on solutions, like Mitra et al's [51], which can be applied on top of existing recognition systems and do not require as much data as end-to-end solutions. Their VA-oriented approach was to optimize a small set of ASR decoder parameters on stuttered speech, such that the system is biased towards common VA phrases, and was effective in removing dysfluencies such as repetitions in speech.…”
Section: Overview Of Speech Recognition Systemsmentioning
confidence: 99%
See 2 more Smart Citations