2009
DOI: 10.1016/j.specom.2008.07.001
|View full text |Cite
|
Sign up to set email alerts
|

Modelling pronunciation variation with single-path and multi-path syllable models: Issues to consider

Abstract: International audienc

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2009
2009
2012
2012

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…The training data contained 396 187 orthographic word tokens (37 h, 20 s of speech). The (standard) triphone training procedure is de scribed in Hämäläinen et al, 2007 for the manual triphones, and in Hämäläinen et al, 2009 for the canonical triphones. For this study, we carried out state tying such that both sets of triphones had about 3400 physically distinct triphones.…”
Section: A Spectral Reduction Scoresmentioning
confidence: 99%
“…The training data contained 396 187 orthographic word tokens (37 h, 20 s of speech). The (standard) triphone training procedure is de scribed in Hämäläinen et al, 2007 for the manual triphones, and in Hämäläinen et al, 2009 for the canonical triphones. For this study, we carried out state tying such that both sets of triphones had about 3400 physically distinct triphones.…”
Section: A Spectral Reduction Scoresmentioning
confidence: 99%
“…There, pronunciation variation is usually modeled by adding pronunciation variants to the lexicon [3]- [6]. On the other hand, pronunciation variation can also be represented at the subword level in the acoustic model [4], [7], [8]. Here, a subword is a unit of a hidden Markov model (HMM) composing a word, and it typically corresponds to a phoneme or a triphone in speech recognition.…”
Section: Introductionmentioning
confidence: 99%