Y.-Q. Wang scite author profile

Y.-Q. Wang

3Publications

45Citation Statements Received

62Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Cambridge

Publications

Order By: Most citations

Model-based approaches to handling additive noise in reverberant environments

Gales

Wang

2011

View full text Add to dashboard Cite

Model-based approaches to handle additive and convolutional noise have been extensively investigated and used. However, the application of these approaches to handling reverberant noise has received less attention. This paper examines the extension of two standard adaptation/compensation approaches to handling reverberant noise. The first is an extension of vector Taylor series (VTS) compensation, reverberant VTS, where a mismatch function representing reverberant noise is used. The second scheme modifies constrained MLLR to allow a wide-span of frames to be taken into account and "projected" into the required dimensionality. To allow additive noise to be handled, both these schemes are combined with standard VTS. The approaches are evaluated and compared on two tasks, MC-WSJ-AV, and a reverberant simulated version of AURORA-4.

show abstract

Speaker and noise factorisation on the AURORA4 task

Wang

Gales

2011

View full text Add to dashboard Cite

For many realistic scenarios, there are multiple factors that affect the clean speech signal. In this work approaches to handling two such factors, speaker and background noise differences, simultaneously are described. A new adaptation scheme is proposed. Here the acoustic models are first adapted to the target speaker via an MLLR transform. This is followed by adaptation to the target noise environment via model-based vector Taylor series (VTS) compensation. These speaker and noise transforms are jointly estimated, using maximum likelihood. Experiments on the AURORA4 task demonstrate that this adaptation scheme provides improved performance over VTS-based noise adaptation. In addition, this framework enables the speech and noise to be factorised, allowing the speaker transform estimated in one noise condition to be successfully used in a different noise condition.

show abstract

Improving reverberant VTS for hands-free robust speech recognition

Wang

Gales

2011

View full text Add to dashboard Cite

Abstract-Model-based approaches to handling additive background noise and channel distortion, such as Vector Taylor Series (VTS), have been intensively studied and extended in a number of ways. In previous work, VTS has been extended to handle both reverberant and background noise, yielding the Reverberant VTS (RVTS) scheme. In this work, rather than assuming the observation vector is generated by the reverberation of a sequence of background noise corrupted speech vectors, as in RVTS, the observation vector is modelled as a superposition of the background noise and the reverberation of clean speech. This yields a new compensation scheme RVTS Joint (RVTSJ), which allows an easy formulation for joint estimation of both additive and reverberation noise parameters. These two compensation schemes were evaluated and compared on a simulated reverberant noise corrupted AURORA4 task. Both yielded large gains over VTS baseline system, with RVTSJ outperforming the previous RVTS scheme.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Y.-Q. Wang

Model-based approaches to handling additive noise in reverberant environments

Speaker and noise factorisation on the AURORA4 task

Improving reverberant VTS for hands-free robust speech recognition

Contact Info

Product

Resources

About