1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258) 1999
DOI: 10.1109/icassp.1999.758104
|View full text |Cite
|
Sign up to set email alerts
|

Hidden Markov models based on multi-space probability distribution for pitch pattern modeling

Abstract: This paper discusses a hidden Markov model (HMM) based on multi-space probability distribution (MSD). The H M M s are widelyused statistical models to characterize the sequence of speech spectra and have successfully been applied to speech recognition system. From these facts, it is considered that the HMM is useful for modeling pitch patterns of speech. However, we cannot apply the conventional discrete or continuous H M M s to pitch pattem modeling since the observation sequence of pitch pattem is composed o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
92
0
3

Year Published

2006
2006
2016
2016

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 203 publications
(95 citation statements)
references
References 7 publications
(4 reference statements)
0
92
0
3
Order By: Relevance
“…A five-state, left-to-right, no-skip structure was used for the HMMs. The excitation parameters were modeled with multi-space probability distributions HMMs [29] in both the proposed and conventional methods. Each state output probability distribution was modeled by a single Gaussian distribution with a diagonal covariance matrix.…”
Section: Methodsmentioning
confidence: 99%
“…A five-state, left-to-right, no-skip structure was used for the HMMs. The excitation parameters were modeled with multi-space probability distributions HMMs [29] in both the proposed and conventional methods. Each state output probability distribution was modeled by a single Gaussian distribution with a diagonal covariance matrix.…”
Section: Methodsmentioning
confidence: 99%
“…The HMMs were composed of five states with no-skip left-to-right transitions, with one Gaussian mixture for each state. The MSDHMMs [38], were used for the F 0 modelling. The Mel Log Spectrum Approximation (MLSA) filter [39] was used for the synthesis from the generated speech parameters.…”
Section: Design Of a Cross-language Mapped Synthetic Voicementioning
confidence: 99%
“…The cross valid prior distribution can be determined without tuning parameters. In the HMM-based speech synthesis, the multi-space probability distribution HMMs (MSD-HMMs) [10] have been used to model excitation. However, the cross valid prior distributions for the MSD-HMMs can be determined by using sufficient statistics of each space as equation (18).…”
Section: S S)mentioning
confidence: 99%