2015
DOI: 10.1109/taslp.2015.2403619
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Intelligibility Assessment of Dysarthric Speech Using Phonologically-Structured Sparse Linear Model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
25
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(25 citation statements)
references
References 32 publications
0
25
0
Order By: Relevance
“… 5 If we consider the relationship between phonemes using weighted finite state transducer [54], which imposes different weights to phoneme mapping, we can obtain more reasonable phoneme alignment results even when we have a lot of phoneme errors. …”
mentioning
confidence: 99%
“… 5 If we consider the relationship between phonemes using weighted finite state transducer [54], which imposes different weights to phoneme mapping, we can obtain more reasonable phoneme alignment results even when we have a lot of phoneme errors. …”
mentioning
confidence: 99%
“…However, speaker recognition for dysarthric individuals is a challenging issue [3,4]. It is associated with unusual phonation and amplitude, especially on explosive phonemes [1].…”
Section: Introductionmentioning
confidence: 99%
“…It is associated with unusual phonation and amplitude, especially on explosive phonemes [1]. However, speaker recognition for dysarthric individuals is a challenging issue [3,4]. The human auditory system may not be able to successfully identify these people by their voices because of these unnatural variations.…”
Section: Introductionmentioning
confidence: 99%
“…A speaker-specific pronunciation dictionary was automatically generated using phoneme posterior probabilities of a deep neural network (DNN) trained on regular speech [21] or the state-specific vector of phone-cluster adaptive training-based acoustic models [49]. Weighted finite state transducers (WFSTs) using phonetic confusion matrices resulting from a regular ASR system were built to allow phonetic confusions during decoding process [22], [23], [32]. …”
Section: Introductionmentioning
confidence: 99%