2007
DOI: 10.1155/2007/46460
|View full text |Cite
|
Sign up to set email alerts
|

On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling

Abstract: Recent research on the TIMIT corpus suggests th at longer-length acoustic models are more appropriate for pronunciation variation modelling than the context-dependent phones th at conventional autom atic speech recognisers use. However, the impressive speech recognition results obtained with longer-length models on TIMIT rem ain to be reproduced on other corpora. To understand the conditions in which longer-length acoustic models result in considerable im provem ents in recognition performance, we carry out re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
18
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(19 citation statements)
references
References 12 publications
1
18
0
Order By: Relevance
“…To alleviate the problems of the 'beads on a string' representation of speech, several authors propose modelling the spectral and temporal variation in speech 'implicitly' by using longerlength linguistic units as the basic building blocks of speech (Ganapathiraju et al, 2001;Hämäläinen et al, 2007a;Jones et al, 1997;Jouvet and Messina, 2004;Plannerer and Ruske, 1992;. For various reasons, most of these authors (Ganapathiraju et al, 2001;Hämäläinen et al, 2007a;Jones et al, 1997;Jouvet and Messina, 2004; suggest using syllable-length models.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…To alleviate the problems of the 'beads on a string' representation of speech, several authors propose modelling the spectral and temporal variation in speech 'implicitly' by using longerlength linguistic units as the basic building blocks of speech (Ganapathiraju et al, 2001;Hämäläinen et al, 2007a;Jones et al, 1997;Jouvet and Messina, 2004;Plannerer and Ruske, 1992;. For various reasons, most of these authors (Ganapathiraju et al, 2001;Hämäläinen et al, 2007a;Jones et al, 1997;Jouvet and Messina, 2004; suggest using syllable-length models.…”
Section: Introductionmentioning
confidence: 99%
“…For various reasons, most of these authors (Ganapathiraju et al, 2001;Hämäläinen et al, 2007a;Jones et al, 1997;Jouvet and Messina, 2004; suggest using syllable-length models. First, using syllables allows for a relatively compact representation of speech, while maintaining a manageable level of recogniser complexity.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Other projects of IMIX were responsible for question answering (van den Bosch et al, 2004;Tjong Kim Sang et al, 2005;Bouma et al, 2007), dialog and action management (op den Akker et al, 2005), speech synthesis (Marsi, 2004), and speech recognition (Hämäläinen et al, 2007). Work in this thesis contributed to the answer presentation module of IMIX.…”
Section: Imixmentioning
confidence: 99%