Darryl Stewart scite author profile

We present results of a study into the performance of a variety of different image transform-based feature types for speakerindependent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static and dynamic visual features. The performance of the features will be tested on both clean video data and also video data corrupted in a variety of ways to assess each feature type's robustness to potential real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter which simulates camera and/or head movement during recording.

show abstract

Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions

Stewart

Seymour

Pass³

et al. 2014

IEEE Trans. Cybern.

View full text Add to dashboard Cite

This paper presents the maximum weighted stream posterior (MWSP) model as a robust and efficient stream integration method for audio-visual speech recognition in environments, where the audio or video streams may be subjected to unknown and time-varying corruption. A significant advantage of MWSP is that it does not require any specific measurements of the signal in either stream to calculate appropriate stream weights during recognition, and as such it is modality-independent. This also means that MWSP complements and can be used alongside many of the other approaches that have been proposed in the literature for this problem. For evaluation we used the large XM2VTS database for speaker-independent audio-visual speech recognition. The extensive tests include both clean and corrupted utterances with corruption added in either/both the video and audio streams using a variety of types (e.g., MPEG-4 video compression) and levels of noise. The experiments show that this approach gives excellent performance in comparison to another well-known dynamic stream weighting approach and also compared to any fixed-weighted integration approach in both clean conditions or when noise is added to either stream. Furthermore, our experiments show that the MWSP approach dynamically selects suitable integration weights on a frame-by-frame basis according to the level of noise in the streams and also according to the naturally fluctuating relative reliability of the modalities even in clean conditions. The MWSP approach is shown to maintain robust recognition performance in all tested conditions, while requiring no prior knowledge about the type or level of noise.

show abstract

Subband correlation and robust speech recognition

Mcauley

Stewart

et al. 2005

IEEE Trans. Speech Audio Process.

View full text Add to dashboard Cite

Agile risk management using software agents

Odzaly

Greer

Stewart

2017

J Ambient Intell Human Comput

View full text Add to dashboard Cite

Lightweight Risk Management: The Development of Agile Risk Tool Agents

Odzaly¹,

Greer²,

Stewart³

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Darryl Stewart

Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions

Subband correlation and robust speech recognition

Agile risk management using software agents

Lightweight Risk Management: The Development of Agile Risk Tool Agents

Contact Info

Product

Resources

About