Ausdang Thangthai scite author profile

Ausdang Thangthai

5Publications

8Citation Statements Received

64Citation Statements Given

How they've been cited

How they cite others

Affiliations

National Electronics and Computer Technology Center, University of East Anglia

Publications

Order By: Most citations

Synthesising visual speech using dynamic visemes and deep learning architectures

Thangthai

Milner

Taylor

2019

Computer Speech & Language

View full text Add to dashboard Cite

This paper proposes and compares a range of methods to improve the naturalness of visual speech synthesis. A feedforward deep neural network (DNN) and many-to-one and many-to-many recurrent neural networks (RNNs) using long short-term memory (LSTM) are considered. Rather than using acoustically derived units of speech, such as phonemes, viseme representations are considered and we propose using dynamic visemes together with a deep learning framework. The input feature representation to the models is also

show abstract

A real-time Thai speech synthesizer on a mobile device

Wongpatikaseree

Ratikan

Thangthai

et al. 2009

View full text Add to dashboard Cite

Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs

Thangthai¹,

Milner²,

Taylor³

2016

View full text Add to dashboard Cite

This paper examines methods to improve visual speech synthesis from a text input using a deep neural network (DNN). Two representations of the input text are considered, namely into phoneme sequences or dynamic viseme sequences. From these sequences, contextual features are extracted that include information at varying linguistic levels, from frame level down to the utterance level. These are extracted from a broad sliding window that captures context and produces features that are input into the DNN to estimate visual features. Experiments first compare the accuracy of these visual features against an HMM baseline method which establishes that both the phoneme and dynamic viseme systems perform better with best performance obtained by a combined phoneme-dynamic viseme system. An investigation into the features then reveals the importance of the frame level information which is able to avoid discontinuities in the visual feature sequence and produces a smooth and realistic output.

show abstract

A bi-lingual Thai-English TTS system on Android mobile devices

Saychum

Thangthai

Janjoi

et al. 2012

View full text Add to dashboard Cite

Speech Gesture Generation from Acoustic and Textual Information using LSTMs

Thangthai

Namsanit

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.