Mamoru Mimura scite author profile

This paper presents a statistical method of single-channel speech enhancement that uses a variational autoencoder (VAE) as a prior distribution on clean speech. A standard approach to speech enhancement is to train a deep neural network (DNN) to take noisy speech as input and output clean speech. Although this supervised approach requires a very large amount of pair data for training, it is not robust against unknown environments. Another approach is to use nonnegative matrix factorization (NMF) based on basis spectra trained on clean speech in advance and those adapted to noise on the fly. This semi-supervised approach, however, causes considerable signal distortion in enhanced speech due to the unrealistic assumption that speech spectrograms are linear combinations of the basis spectra. Replacing the poor linear generative model of clean speech in NMF with a VAE-a powerful nonlinear deep generative modeltrained on clean speech, we formulate a unified probabilistic generative model of noisy speech. Given noisy speech as observed data, we can sample clean speech from its posterior distribution. The proposed method outperformed the conventional DNN-based method in unseen noisy environments.

show abstract

On a diffusive prey-predator model which exhibits patchiness

Mimura

Murray

1978

Journal of Theoretical Biology

178

142

View full text Add to dashboard Cite

Interface growth and pattern formation in bacterial colonies

Matsushita

Wakita

Itoh

et al. 1998

Physica A: Statistical Mechanics and its Applications

125

132

View full text Add to dashboard Cite

Higher-dimensional localized patterns in excitable media

Ohta

Mimura

Kobayashi

1989

Physica D: Nonlinear Phenomena

172

121

View full text Add to dashboard Cite

Aggregating pattern dynamics in a chemotaxis model including growth

Mimura

Tsujikawa

1996

Physica A: Statistical Mechanics and its Applications

149

120

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mamoru Mimura

Exponential attractor for a chemotaxis-growth system of equations

Topology of Lie Groups, I and II

Reaction–diffusion modelling of bacterial colony patterns

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

On a diffusive prey-predator model which exhibits patchiness

Interface growth and pattern formation in bacterial colonies

Higher-dimensional localized patterns in excitable media

Aggregating pattern dynamics in a chemotaxis model including growth

Contact Info

Product

Resources

About