Say Wei Foo scite author profile

The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum-Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.

show abstract

FPGA Implementation of Digital Filters Synthesized Using the FRM Technique

Lim

Zheng

et al. 2003

Circuits Syst Signal Process

View full text Add to dashboard Cite

The effective length of a filter designed using the frequency-response masking (FRM) technique is very long and requires a very large number of delay elements. In this paper, we present some useful techniques for reducing the data transfer between the field programmable gate array (FPGA) and external memory when the random logic is implemented using the FPGA and the delay elements are implemented using an external memory such as dynamic random access memory.

show abstract

Recognition of Visual Speech Elements Using Hidden Markov Models

Foo

Dong

2002

View full text Add to dashboard Cite

Detection of stress and emotion in speech using traditional and FFT based log energy features

Nwe

Foo

Silva

View full text Add to dashboard Cite

In this paper, a novel system for detection of human stress and emotion in speech is proposed. The system makes use of FFT based linear short time Log Frequency Power Coefficients (LFPC) and TEO based nonlinear LFPC features in both time and frequency domains. The performance of the proposed system is compared with the traditional approaches which use features of LPCC and MFCC. The comparison of each approach is performed using SUSAS (Speech Under Simulated and Actual Stress) and ESMBS (Emotional Speech of Mandarin and Burmese Speakers) databases. It is observed that proposed system outperforms the traditional systems. Results show that, the system using LFPC gives the highest accuracy (87.8% for stress, 89.2% for emotion classification) followed by the system using NFD-LFPC feature. While the system using NTD-LFPC feature gives the lowest accuracy.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Say Wei Foo

Speech emotion recognition using hidden Markov models

Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models

FPGA Implementation of Digital Filters Synthesized Using the FRM Technique

Recognition of Visual Speech Elements Using Hidden Markov Models

Detection of stress and emotion in speech using traditional and FFT based log energy features

Contact Info

Product

Resources

About