Tone Question of Tree Based Context Clustering for Hidden Markov Model Based Thai Speech Synthesis

Chomphan, Suphattharachai

doi:10.3844/jcssp.2010.1474.1478

Cited by 16 publications

(19 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Both male and female speech has been constructed in the speech database. The extraction tools are applied in this study (Mixdorff and Fujisaki, 1997;Chomphan and Kobayashi, 2007a;2007b). For each parameter, the frequency distribution over its range is constructed, subsequently the distributions of standard Thai are plot in a graph.…”

Section: Resultsmentioning

confidence: 99%

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Edgar¹

2012

American Journal of Applied Sciences

View full text Add to dashboard Cite

Problem statement: An important human speech feature is the fundamental frequency (F0) contour which represents the speech prosody. It indicates the naturalness and intelligibility of the speech. Modeling of fundamental frequency contour was an essential procedure in the natural speech processing. In speech communication, environmental noise plays an essential role in damaging the digital communication quality. The study of effects of noises on modeling of F0 contour for standard Thai is conducted. Approach: The selected modeling technique in this study was adapted from Fujisaki's model, because of its achievement in modeling of various Thai speech units. Four types of environmental noises were recorded for different levels of power. This study was proposed an analysis of some parameters of modeling of Thai speech prosody for two genders and four types of noises. The derived Fujisaki's model was covered seven parameters including baseline frequency, the numbers of phrase commands and tone commands, phrase command and tone command durations, amplitudes of phrase command and tone command. Results: In the experimental results, the standard Thai of 2 samples of 5 sentences with 5 males and 5 females was used. Four types of noises include train, factory, car and air conditioner. Five levels of each type of noise were varied from 0-20 dB. The results were showing that the different noises give the distinguished effects for most of the proposed model parameters. Conclusion: The results confirm that the effects of four types of noises are significantly different. It can be seen that the environmental noises deteriorate the model parameters empirically.

show abstract

Section: Resultsmentioning

confidence: 99%

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Edgar¹

2012

American Journal of Applied Sciences

View full text Add to dashboard Cite

show abstract

“…As for speaker dependent HMM-based Thai speech synthesis research, a speech synthesizer has been implemented (Chomphan and Kobayashi, 2007a). In the system, a group of contextual factors which affect spectrum, fundamental frequency (F0) and state duration, such as tone type and part of speech are taken into account especially for the purpose of producing natural sounding prosody of the tonal speech.…”

Section: Introductionmentioning

confidence: 99%

“…It has been found that it can provide speech with the better reproduction of prosody over the unit-selection-based Vaja TTS system from NECTEC (National Electronics and Computers Technology Center) (Chomphan and Kobayashi, 2007b). A decision tree with a toneseparated structure (Chomphan and Kobayashi, 2007a) presents the considerable improvement of tone correctness of the synthesized speech. However, some distortion of syllable duration is noticeable when the system is trained with a small amount of data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

WAN¹

2012

American Journal of Applied Sciences

View full text Add to dashboard Cite

Problem statement: Tone intelligibility in speech synthesis is an important attribute that should be taken into account. The tone correctness of the synthetic speech is degraded considerably in the average-voice-based HMM-based Thai speech synthesis. The tying mechanism in the decision tree based context clustering without appropriate criterion causes unexpected tone neutralization. Incorporation of the phrase intonation to the context clustering process in the training stage was proposed early. However, the tone correctness is not satisfied. Approach: This study proposes a number of tonal features including tone-geometrical features and phrase intonation features to be exploited in the context clustering process of HMM training stage. Results: In the experiments, subjective evaluations of both average voice and adapted voice in terms of the intelligibility of tone are conducted. Effects on decision trees of the extracted features are also evaluated. By considering gender in training speech, two core experiments were conducted. The first experiment shows that the proposed tonal features can improve the tone intelligibility for female speech model above that of male speech model, while the second experiment shows that the proposed tonal features improve the tone intelligibility for gender dependent model than for gender independent model. Conclusion: All of the experimental results confirm that the tone correctness of the synthesized speech from the averagevoice-based HMM-based Thai speech synthesis is significantly improved when using most of the extracted features.

show abstract

“…Therefore appropriate predictive schemes for path-dependent connection-holding times will be required. Hidden Markov Model (HMM), provides the solution for wide variety of applications such speech tagging and noun-phrase chunking (Chomphan, 2012), is also flexible enough to predict any type of internet traffic (Dainotti et al, 2008) and has been demonstrated for wavelength assignment in WDM networks by Johnson et al (2001). In this study, we use thresholding of HMM predicted path-dependent connection holding times and show that this connection-time impairment aware provisioning of Burst Sizes and Associated Wavelength Allocation (ABS-WA) can reduce delays and improve throughput.…”

Section: Introductionmentioning

confidence: 99%

Evaluation of Hidden Markov Model Based Adaptive Provisioning of Optical Burst Switching Networks Amenable for Upgradation to Green Flexigrid Networks

Ramesh¹,

Nair²

2014

Journal of Computer Science

View full text Add to dashboard Cite

Catering to the evolving bandwidth-on-demand applications requires flexible provisioning architectures that ensure fairness to both high end and low end users of internet. Dynamic classification of network paths into long and short, based on traffic dependent connection holding times and a wavelength allocation from different subsets for these categories is a necessity for implementing energy saving hybrid switching, loss recovery and flexi grid bit rate variable schemes. Therefore, this study evaluates such a scheme of dynamic classification based on HMM predicted connection-holding times with tightly integrated adaptive burst sizing and segregated wavelength allocation for long and short categories. Simulation study of a 28 node OBS network shows that this coupled scheme reduces delays and results in throughput improvement of 67% for long and 12% for short traffic over schemes that employ adaptive burst sizing based on number of active traffic flows and independent wavelength assignment schemes.

show abstract

Tone Question of Tree Based Context Clustering for Hidden Markov Model Based Thai Speech Synthesis

Cited by 16 publications

References 17 publications

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

Evaluation of Hidden Markov Model Based Adaptive Provisioning of Optical Burst Switching Networks Amenable for Upgradation to Green Flexigrid Networks

Contact Info

Product

Resources

About

Tone Question of Tree Based Context Clustering for Hidden Markov Model Based Thai Speech Synthesis

Cited by 16 publications

References 17 publications

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

Evaluation of Hidden Markov Model Based Adaptive Provisioning of Optical Burst Switching Networks Amenable for Upgradation to Green Flexigrid Networks

Contact Info

Product

Resources

About

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises