ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9747227
|View full text |Cite
|
Sign up to set email alerts
|

Nonverbal Sound Detection for Disordered Speech

Abstract: Many consumer speech recognition systems are not tuned for people with speech disabilities, resulting in poor recognition and user experience, especially for severe speech differences. Recent studies have emphasized interest in personalized speech models from people with atypical speech patterns. We propose a query-by-example-based personalized phrase recognition system that is trained using small amounts of speech, is language agnostic, does not assume a traditional pronunciation lexicon, and generalizes well… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(15 citation statements)
references
References 39 publications
(41 reference statements)
0
15
0
Order By: Relevance
“…Lea et al focused on intended speech transcription. They explored how people who stutter, where stuttering is characterized by an increase in disfluencies, interact with voice assistants and dictation services [26]. These services rely on ASR and Lea et al find that, for these services, individuals preferred to only see their intended speech transcribed.…”
Section: B Asr Text and Disfluency Detectionmentioning
confidence: 99%
See 3 more Smart Citations
“…Lea et al focused on intended speech transcription. They explored how people who stutter, where stuttering is characterized by an increase in disfluencies, interact with voice assistants and dictation services [26]. These services rely on ASR and Lea et al find that, for these services, individuals preferred to only see their intended speech transcribed.…”
Section: B Asr Text and Disfluency Detectionmentioning
confidence: 99%
“…(e.g. with voice assistants or speech dictation systems) [3], [25], [26], [31]. With these applications in mind, we present new techniques for automatic disfluency detection, categorization, and localization.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Recently, Jaddoh (2021) and Lea (2022) have studied the use of nonverbal sound as a method of instruction to extend the ability of interacting with ASR systems or devices. Lea used recordings with different accents to develop a model that detects different mouth sounds, such as "pop" and "click," as inputs, while Jaddoh suggested using nonverbal sound as a technique to control virtual home assistance.…”
Section: Table 1 Summary Of Speech Modalities Used In the Literaturementioning
confidence: 99%