Masahito Togami scite author profile

A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates less distortion than the NRS alone. The three components are optimally combined from a probabilistic perspective using a unified likelihood function incorporating two probabilistic models. A multichannel probabilistic source model based on a recently proposed local Gaussian model (LGM) provides robustness against ATF fluctuations of the early reflection. A probabilistic reverberant transfer function model (PRTFM) provides robustness against ATF fluctuations of the late reverberation. The MIF and multichannel under-determined source separation (MUSS) are optimized in an iterative manner. The MIF is designed to reduce the time-invariant part of the late reverberation by using optimal time-weighting with reference to the PRTFM and the LGM. The MUSS separates the dereverberated speech signal and the residual reverberation after the MIF, which can be interpreted as an optimized combination of the BF and the NRS. The parameters of the PRTFM and the LGM are optimized based on the MUSS output. Experimental results show that the proposed method is robust against the ATF fluctuations under both single and multiple source conditions.

show abstract

Multi-channel Itakura Saito Distance Minimization with Deep Neural Network

Togami

2019

View full text Add to dashboard Cite

Basic Design of Human-Symbiotic Robot EMIEW

Hosoda

Egawa

Tamamoto

et al. 2006

View full text Add to dashboard Cite

Joint Dereverberation and Separation With Iterative Source Steering

Nakashima

Scheibler

Togami

2021

View full text Add to dashboard Cite

We propose a new algorithm for joint dereverberation and blind source separation (DR-BSS). Our work builds upon the IRLMA-T framework that applies a unified filter combining dereverberation and separation. One drawback of this framework is that it requires several matrix inversions, an operation inherently costly and with potential stability issues. We leverage the recently introduced iterative source steering (ISS) updates to propose two algorithms mitigating this issue. Albeit derived from first principles, the first algorithm turns out to be a natural combination of weighted prediction error (WPE) dereverberation and ISS-based BSS, applied alternatingly. In this case, we manage to reduce the number of matrix inversion to only one per iteration and source. The second algorithm updates the ILRMA-T matrix using only sequential ISS updates requiring no matrix inversion at all. Its implementation is straightforward and memory efficient. Numerical experiments demonstrate that both methods achieve the same final performance as ILRMA-T in terms of several relevant objective metrics. In the important case of two sources, the number of iterations required is also similar.

show abstract

Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection

Fujita

Takashima

Homma

et al. 2015

View full text Add to dashboard Cite

Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss

Komatsu

Imoto

Togami

2020

View full text Add to dashboard Cite

Sparseness-Aware DOA Estimation with Majorization Minimization

Togami

Scheibler

2020

View full text Add to dashboard Cite

Simultaneous Optimization of Acoustic Echo Reduction, Speech Dereverberation, and Noise Reduction against Mutual Interference

Togami

Kawaguchi

2014

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Masahito Togami

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

Multi-channel Itakura Saito Distance Minimization with Deep Neural Network

Basic Design of Human-Symbiotic Robot EMIEW

Joint Dereverberation and Separation With Iterative Source Steering

Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection

Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss

Sparseness-Aware DOA Estimation with Majorization Minimization

Simultaneous Optimization of Acoustic Echo Reduction, Speech Dereverberation, and Noise Reduction against Mutual Interference

Contact Info

Product

Resources

About