2008
DOI: 10.1007/978-3-540-70872-8_10
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Speech Recognition Used for Intelligibility Assessment of Text-to-Speech Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 5 publications
0
7
0
Order By: Relevance
“…Automatic speech recognition (ASR) systems are a promising but relatively unexplored solution [23, 24]. One significant limitation of ASR for this application is, however, that most approaches require a prohibitively large number of speech samples, since the approach is based on counting the percentage of correctly recognized words.…”
Section: Introductionmentioning
confidence: 99%
“…Automatic speech recognition (ASR) systems are a promising but relatively unexplored solution [23, 24]. One significant limitation of ASR for this application is, however, that most approaches require a prohibitively large number of speech samples, since the approach is based on counting the percentage of correctly recognized words.…”
Section: Introductionmentioning
confidence: 99%
“…The collected speech database consists of 300 records with mean duration of 5 seconds uttered in a neutral speaking style. Every record consists of five concatenated words with a similar phonetic sound in Czech but often totally different meaning (eg "pes", "nes", "ves" -in English: "dog", "carry", "village") usually used in the rhythm test for evaluation by the automatic speech recognition systems (ASR) [14]. These speech records were uttered by a female speaker with F0 ≈ 200 Hz, recorded at 32 kHz, and subsequently resampled to f s = 16 kHz.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…Results of the cepstral coefficient ranges and values statistical analysis are shown also in the form of histograms in a similar way as the spectral flatness ranges and values. This method can also be used for evaluation of emotional synthetic speech as a supplementary approach parallel to the listening tests [23].…”
Section: Resultsmentioning
confidence: 99%