2020 International Conference on Signal Processing and Communications (SPCOM) 2020
DOI: 10.1109/spcom50965.2020.9179511
|View full text |Cite
|
Sign up to set email alerts
|

Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(14 citation statements)
references
References 21 publications
0
9
0
Order By: Relevance
“…Our results cannot be directly compared to those reported in [10] and their state-of-the-art DiscoGAN model because of the differences in the denoising of the dysarthric speech that was applied in our work but not in theirs (see Section 2.2).…”
Section: Comparison Of the Results To State-of-the-art And Disco-ganmentioning
confidence: 86%
See 2 more Smart Citations
“…Our results cannot be directly compared to those reported in [10] and their state-of-the-art DiscoGAN model because of the differences in the denoising of the dysarthric speech that was applied in our work but not in theirs (see Section 2.2).…”
Section: Comparison Of the Results To State-of-the-art And Disco-ganmentioning
confidence: 86%
“…The PER results are shown for individual speakers separately and averaged over all speakers and blocked by model type (see the table caption for an explanation). Note that empty cells refer to results that were not given or specified by [10].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The phoneme error rate (PER) calculated with a phoneme recognizer evaluates the intelligibility, which is also related to severity and naturalness (RQ1 and RQ3). We use a pre-trained Kaldi ASR model with the same specifications as the one used in [22] for phoneme recognition. The ASR was trained with the TIMIT dataset and used an HMM acoustic model.…”
Section: Phoneme Error Ratementioning
confidence: 99%
“…Rule-based transformation based on signal processing [3] is limited in that each patient needs to be individually considered. Statistical approaches adopt models ranging from Gaussian mixture models [4], exemplar-based methods [5,6] and deep neural networks [7,8,9].…”
Section: Introductionmentioning
confidence: 99%